Data Engineering

Forum Posts

Sorted by:

by D4F • Visitor

an hour ago

12 Views
1 replies
0 kudos

Issue Genie API - different responses in UI and via API

Hi community,I created an agent with a genie tool, a wrapper around a GenieAgent connected to my Genie space (GENIE_SPACE_ID) that sends user questions and returns Genie’s textual response. I noticed I get 02 different responses when I post a questio...

Data Engineering

12 Views
1 replies
0 kudos

an hour ago

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

32m ago

0 kudos

Hey @D4F , What you’re seeing is normal behavior—and the good news is there are very real, very practical ways to make your Genie-based agent more consistent without resorting to a giant, brittle prompt. Let’s dig in. First, why the UI and API can r...

0 kudos

32m ago

by gsouza • New Contributor II

03-26-2025 1:40:39 PM

3070 Views
4 replies
3 kudos

Databricks asset bundle occasionally duplicating jobs

Since last year, we have adopted Databricks Asset Bundles for deploying our workflows to the production and staging environments. The tool has proven to be quite effective, and we currently use Azure DevOps Pipelines to automate bundle deployment, tr...

Data Engineering

3070 Views
4 replies
3 kudos

03-26-2025 1:40:39 PM

View Replies

Latest Reply

cmantilla
Visitor

2 hours ago

3 kudos

This is a recurring issue for my org as well.

3 kudos

2 hours ago

3 More Replies

by Jarno • Visitor

7 hours ago

34 Views
1 replies
0 kudos

Dangerous implicit type conversions on 17.3 LTS.

Starting with DBR 17 running Spark 4.0, spark.sql.ansi.enabled is set to true by default. With the flag enabled, strings are implicitly converted to numbers in a very dangerous manner. ConsiderSELECT 123='123';SELECT 123='123X';The first one is succe...

Data Engineering

34 Views
1 replies
0 kudos

7 hours ago

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

3 hours ago

0 kudos

Under ANSI rules, INT + STRING resolves to BIGINT (Long) that's why it crashes https://spark.apache.org/docs/latest/sql-ref-ansi-compliance.html. There are some examples when it works like 1Y or 1L. Regarding 4.0.1, can you double-check ansi.enabled ...

0 kudos

3 hours ago

by Michael_Appiah • Contributor II

11-27-2024 2:34:18 AM

3289 Views
4 replies
0 kudos

Delta Tables: Time-To-Live

I have seen somewhere (might have been in a Databricks Tech Talk) a Delta Table feature which allows to specify the "expiration date" of data stored in Delta Tables. Once rows surpass their time-to-live, they are automatically deleted or archived. Do...

Data Engineering

3289 Views
4 replies
0 kudos

11-27-2024 2:34:18 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

7 hours ago

0 kudos

Yes that feature was announced on Data + AI summit - really cool.

0 kudos

7 hours ago

3 More Replies

by kyeongmin_baek • New Contributor II

yesterday

114 Views
6 replies
1 kudos

AWS_INSUFFICIENT_INSTANCE_CAPACITY_FAILURE when starting SQL Server Ingestion pipeline

Dear Community,I’m seeing a compute error when running a Databricks ingestion pipeline (Lakeflow managed ingestion) on AWS.Cloud : AWSRegion: ap−northeast−2Source: SQL Server ingestion pipelineWhen I start the ingestion pipeline, it fails with the f...

Data Engineering

114 Views
6 replies
1 kudos

yesterday

View Replies

Latest Reply

emma_s
Databricks Employee

8 hours ago

1 kudos

Hi, I'm afraid you cannot edit compute instance type settings for SQL Server ingestion pipelines via the Databricks UI. Such changes can only be made via API.

1 kudos

8 hours ago

5 More Replies

by Fatimah-Tariq • New Contributor III

9 hours ago

17 Views
0 replies
0 kudos

Writing to Foreign catalog

I have a running notebook job where I am doing some processing and writing the tables in a foreign catalog. It has been running successfully for about an year. The job is scheduled and runs on job cluster with DBR 16.2Recently, I had to add new noteb...

Data Engineering

17 Views
0 replies
0 kudos

9 hours ago

by EAnthemNHC1 • New Contributor III

4 weeks ago

277 Views
3 replies
0 kudos

Time Travel Error when selecting from materialized view (Azure Databricks)

Hey - running into an error this morning that was brought to my attention via failed refreshes from PowerBI. We have a materialized view that, when queried with the standard pattern of 'select col1 from {schema}.table_name', returns an error of 'Cann...

Data Engineering

277 Views
3 replies
0 kudos

4 weeks ago

View Replies

Latest Reply

cookiebaker
New Contributor III

yesterday

0 kudos

Since last monday december 8th we're experiencing this same issue: Cannot time travel Delta table to version 158. Available versions: [70, 4]. However, this is just when doing a simple SELECT * statement from a gold view (without any version specifie...

0 kudos

yesterday

2 More Replies

by ManojkMohan • Honored Contributor II

3 weeks ago

326 Views
6 replies
4 kudos

Resolved! Accessing Databricks data in Salesforce via zero copy

I have uploaded clickstream data as shown belowDo i have to mandatorily share via Delta sharing for values to be exposed in Salesforce ?At the Salesforce end i have confirmed that i have a working connector where i am able to see samples data , but u...

Data Engineering

326 Views
6 replies
4 kudos

3 weeks ago

View Replies

Latest Reply

Rash_Databrick
Visitor

yesterday

4 kudos

HI Team ,Please help me my task is to connect Databrick and salesforce data cloud with zero copy . where we need databricks data in Salesforce data cloud , also just to mention my databricks workspace + ADLS stoarge is on private end point. any hel...

4 kudos

yesterday

5 More Replies

by rijin-thomas • Visitor

yesterday

24 Views
0 replies
0 kudos

Mongo Db connector - Connection timeout when trying to connect to AWS Document DB

I am on Databricks Run Time LTE 14.3 Spark 3.5.0 Scala 2.12 and mongodb-spark-connector_2.12:10.2.0. Trying to connect to Document DB using the connector and all I get is a connection timeout. I tried using PyMongo, which works as expected and I can ...

Data Engineering

24 Views
0 replies
0 kudos

yesterday

by dikla • New Contributor

yesterday

81 Views
3 replies
1 kudos

Resolved! Issues Creating Genie Space via API Join Specs Are Not Persisted

Hi,I’m experimenting with the new API to create a Genie Space.I’m able to successfully create the space, but the join definitions are not created, even though I’m passing a join_specs object in the same format returned by GET /spaces/{id} for an exis...

Data Engineering

81 Views
3 replies
1 kudos

yesterday

View Replies

Latest Reply

dikla
New Contributor

yesterday

1 kudos

@Raman_Unifeye@Raman_Unifeye Thanks for the detailed explanation — that really helps clarify why my join specs weren’t being persisted.Do you know if support for persisting join_specs, sql_snippets, and measures via the API is planned for an upcoming...

1 kudos

yesterday

2 More Replies

by dvd_lg_bricks • New Contributor

Monday

126 Views
6 replies
3 kudos

Questions About Workers and Executors Configuration in Databricks

Hi everyone, sorry, I’m new here. I’m considering migrating to Databricks, but I need to clarify a few things first.When I define and launch an application, I see that I can specify the number of workers, and then later configure the number of execut...

Data Engineering

126 Views
6 replies
3 kudos

Monday

View Replies

Latest Reply

dvd_lg_bricks
New Contributor

yesterday

3 kudos

I mean: while we’re at it @szymon_dybczak or @Raman_Unifeye , is there a place where all available Databricks configuration parameters are documented? I have some pipelines that rely on special settings, such as changing the serializer, enabling Apac...

3 kudos

yesterday

5 More Replies

by Richard3 • New Contributor

Monday

149 Views
4 replies
3 kudos

IDENTIFIER in SQL Views not supported?

Dear community,We are phasing out the dollar param `${catalog_name}` because it has been deprecated since runtime 15.2.We use this parameter in many queries and should now be replaced by the IDENTIFIER clause.In the query below where we retrieve data...

Data Engineering

149 Views
4 replies
3 kudos

Monday

View Replies

Latest Reply

mnorland
Valued Contributor

Monday

3 kudos

There are two options you may want to consider:Switch to using SQL UDTFs from views in certain casesFor each session, dynamically recreate the view using CREATE VIEW via EXECUTE IMMEDIATE or via Python string templating:

3 kudos

Monday

3 More Replies

by prashant151 • New Contributor II

Monday

87 Views
1 replies
1 kudos

Using Init Scipt to execute python notebook at all-purpose cluster level

HiWe have setup.py in my databricks workspace.This script is executed in other transformation scripts using%run /Workspace/Common/setup.pywhich consume lot of time. This setup.py internally calls other utilities notebooks using %run%run /Workspace/Co...

Data Engineering

87 Views
1 replies
1 kudos

Monday

View Replies

Latest Reply

Raman_Unifeye
Contributor III

yesterday

1 kudos

@prashant151 - Unlike legacy (pre-UC) clusters, you cannot directly run a Databricks notebook (like setup.py) from a cluster init script, because init scripts only support shell commands — not %run or notebook execution.You will need to refactor your...

1 kudos

yesterday

by venkatesh557 • New Contributor

yesterday

50 Views
1 replies
0 kudos

Is there a supported method to register a custom PySpark DataSource so that it becomes visible in th

Built a custom connector using the PySpark DataSource API (DataSource V2). The connector works programmatically, but it does not appear in the Databricks Ingestion UI (Add Data → Connectors) like the Salesforce connector.Is there a supported method t...

Data Engineering

50 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

yesterday

0 kudos

Hi @venkatesh557 ,Unfortunately, the answer is no - there isn’t a supported way for you to “register” an arbitrary PySpark DataSource V2 so that it appears as a tile in the Databricks Add data → Connectors (Ingestion) UI right now

0 kudos

yesterday

by tak0519 • New Contributor II

Saturday

288 Views
6 replies
6 kudos

Resolved! How can I pass parameters from DABs to something(like notebooks)?

I'm implementing DABs, Jobs, and Notebooks.For configure management, I set parameters on databricks.yml.but I can't get parameters on notebook after executed a job successfully. What I implemented ans Steps to the issue:Created "dev-catalog" on WEB U...

Data Engineering

288 Views
6 replies
6 kudos

Saturday

View Replies

Latest Reply

Taka-Yayoi
Databricks Employee

Sunday

6 kudos

Hi @tak0519 I think I found the issue! Don't worry - your DABs configuration looks correct. The problem is actually about how you're verifying the results, not the configuration itself. What's happening In your last comment, you mentioned: "Manuall...

6 kudos

Sunday

5 More Replies

Databricks Community

Forum Posts

Issue Genie API - different responses in UI and via API

Databricks asset bundle occasionally duplicating jobs

Dangerous implicit type conversions on 17.3 LTS.

Delta Tables: Time-To-Live

AWS_INSUFFICIENT_INSTANCE_CAPACITY_FAILURE when starting SQL Server Ingestion pipeline

Writing to Foreign catalog

Time Travel Error when selecting from materialized view (Azure Databricks)

Resolved! Accessing Databricks data in Salesforce via zero copy

Mongo Db connector - Connection timeout when trying to connect to AWS Document DB

Resolved! Issues Creating Genie Space via API Join Specs Are Not Persisted

Questions About Workers and Executors Configuration in Databricks

IDENTIFIER in SQL Views not supported?

Using Init Scipt to execute python notebook at all-purpose cluster level

Is there a supported method to register a custom PySpark DataSource so that it becomes visible in th

Resolved! How can I pass parameters from DABs to something(like notebooks)?

Join Us as a Local Community Builder!

Issues Creating Genie Space via API Join Specs Are...

How can I pass parameters from DABs to something(l...

delta live tables - collaborative development

Declarative Pipelines: set Merge Schema to False

Row tracking in Delta tables