Data Engineering

Forum Posts

Sorted by:

by Naga05 • New Contributor III

09-28-2025 7:41:05 PM

429 Views
4 replies
2 kudos

Databricks app with parameters from databricks asset bundle

HelloooI tried out setting up a Databricks App using asset bundle, where i was able to successfully parameterize the sql warehouse id which was specified on specific targets. However i was unable to get values of other variables from the targets, the...

Data Engineering

429 Views
4 replies
2 kudos

09-28-2025 7:41:05 PM

View Replies

Latest Reply

Naga05
New Contributor III

09-29-2025 10:27:01 AM

2 kudos

Found that this is an implementation in progress on the Databricks CLI. https://github.com/databricks/cli/issues/3679

2 kudos

09-29-2025 10:27:01 AM

3 More Replies

by smoortema • Contributor

09-29-2025 5:06:31 AM

356 Views
2 replies
3 kudos

Resolved! handling both Pyspark and Python exceptions

In a Python notebook, I am using error handling according to the official documentation. try:[some data transformation steps]except PySparkException as ex:[logging steps to log the error condition and error message in a table]However, this catches o...

Data Engineering

356 Views
2 replies
3 kudos

09-29-2025 5:06:31 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

09-29-2025 5:28:06 AM

3 kudos

To handle both PySpark exceptions and general Python exceptions without double-logging or overwriting error details, the recommended approach is to use multiple except clauses that distinguish the exception type clearly. In Python, exception handlers...

3 kudos

09-29-2025 5:28:06 AM

1 More Replies

by tom_1 • New Contributor III

03-13-2025 6:02:22 AM

1314 Views
5 replies
1 kudos

Resolved! BUG in Job Task of Type DBT

Hi, just wanted to let the Databricks Team know, that there is a bug in the task ui.Currently it is not possible to save a task of "Type: dbt" if the "SQL Warehouse" is set to "None (Manual)".Some weeks ago this was possible, also the "Profiles Direc...

Data Engineering

1314 Views
5 replies
1 kudos

03-13-2025 6:02:22 AM

View Replies

Latest Reply

Aishu95
New Contributor II

07-01-2025 5:59:33 AM

1 kudos

I am facing this bug still. I don't want to select any SQL warehouse, what do I do? and from where can I pass the profiles directory

1 kudos

07-01-2025 5:59:33 AM

4 More Replies

by Navi991100 • New Contributor II

09-28-2025 9:58:47 AM

241 Views
3 replies
1 kudos

Resolved! I recently made new account on databricks under Free edition

It by default made SQL warehouse compute, but I want all-purpose compute, as I want test and learn capabilities of PySpark and Databricks.I can't connect with the serverless compute in the notebook; it gives a mean error as follows: "An error occurr...

Data Engineering

241 Views
3 replies
1 kudos

09-28-2025 9:58:47 AM

View Replies

Latest Reply

belforte
New Contributor II

09-29-2025 7:10:03 AM

1 kudos

In the free Databricks edition, to use PySpark you need to create and start a cluster, since the SQL Warehouse is only for SQL queries; go to Compute > Create Cluster, set up a free cluster, click Start, and then attach your notebook to it this will ...

1 kudos

09-29-2025 7:10:03 AM

2 More Replies

by yit • Contributor III

09-29-2025 5:49:01 AM

289 Views
1 replies
1 kudos

How to implement MERGE operations in Lakeflow Declarative Pipelines

Hey everyone,We’ve been using Autoloader extensively for a while, and now we’re looking to transition to full Lakeflow Declarative Pipelines. From what I’ve researched, the reader part seems straightforward and clear.For the writer, I understand that...

Data Engineering

289 Views
1 replies
1 kudos

09-29-2025 5:49:01 AM

View Replies

Latest Reply

saurabh18cs
Honored Contributor II

09-29-2025 7:06:27 AM

1 kudos

Hi @yit Lakeflow supports upsert/merge semantics natively for Delta tables unlile ForEachBatchInstead of writing custom forEachBatch code, you declare the merge keys and update logic in your pipeline configuration.Lakeflow will automatically generate...

1 kudos

09-29-2025 7:06:27 AM

by vishal_balaji • New Contributor II

09-29-2025 1:55:56 AM

494 Views
2 replies
1 kudos

Unable to access metrics from Driver node on localhost:4040

Greetings,I am trying to setup monitoring in Grafana for all my databricks clustersI have added 2 things as part of thisUnder Compute > Configuration > Advanced > Spark > Spark Config, I have addedspark.ui.prometheus.enabled trueUnder init_scripts, I...

Data Engineering

494 Views
2 replies
1 kudos

09-29-2025 1:55:56 AM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

09-29-2025 3:04:34 AM

1 kudos

Hi @vishal_balaji ,You're following guides that were prepared for OSS Apache Spark. For sure localhost won't work in this case because in Databricks all compute is cloud-based. Please follow below guide how to configure it properly on databricks:Data...

1 kudos

09-29-2025 3:04:34 AM

1 More Replies

by saurabh18cs • Honored Contributor II

09-26-2025 8:21:00 AM

465 Views
4 replies
3 kudos

Autoloader - File Notification Mode

Hello All,We have started to consume source messages/files via autoloader directory listing mode at the moment and want to convert this to file notification mode instead so consumption can be faster with no more entire directories/folder scanning. I ...

Data Engineering

465 Views
4 replies
3 kudos

09-26-2025 8:21:00 AM

View Replies

Latest Reply

saurabh18cs
Honored Contributor II

09-29-2025 1:32:33 AM

3 kudos

Hi @K_Anudeep @szymon_dybczak how do i understand a situation when 100 jobs are running in parallel with minimal latency needed. does autoloader directly connect to the cloud queue service ? or databricks stores and manages detected files somewhere? ...

3 kudos

09-29-2025 1:32:33 AM

3 More Replies

by jmeer • New Contributor II

03-29-2025 6:52:08 AM

1595 Views
5 replies
2 kudos

Cannot click Compute tab

Hi, I want to change the cluster I am using. However, when I click on the "Compute" tab on the platform, I get automatically redirected to the "SQL Warehouses" page. I am not able to click and enter the "Compute" page. How can I solve this? Thank you

Data Engineering

1595 Views
5 replies
2 kudos

03-29-2025 6:52:08 AM

View Replies

Latest Reply

efchea
New Contributor II

09-28-2025 9:04:36 PM

2 kudos

I had the same problem, but I figured it out. You might be working in a serverless space of Databricks, so you don't have the option to create a cluster. To fix the compute issue, create a new notebook. After that, look at the right of your screen af...

2 kudos

09-28-2025 9:04:36 PM

4 More Replies

by RobsonNLPT • Contributor III

09-27-2025 8:20:41 AM

360 Views
2 replies
0 kudos

Foreign Catalog Wrong Mapping - Azure SQL Database Binary Column

Hi all.I've used foreign catalog attached to azure sql databases and never had problems except in 2 situations:1) Foreign Catalogs don't support sql schemas/objects like [xxxx.yyyy].tablename. The workaround is creating views on sql database2) This i...

Data Engineering

360 Views
2 replies
0 kudos

09-27-2025 8:20:41 AM

View Replies

Latest Reply

Isi
Honored Contributor III

09-27-2025 1:58:02 PM

0 kudos

Hello @RobsonNLPT One thing that might help to narrow this down: could you check whether the problem occurs for the entire column, or if some of the batches you receive actually contain the full (non-truncated) value?If some batches are complete but ...

0 kudos

09-27-2025 1:58:02 PM

1 More Replies

by DataDev • New Contributor

09-01-2025 8:30:22 AM

470 Views
5 replies
3 kudos

Schedule databricks job based on custom calendar

I want to schedule the databricks jobs based on the custom calender, like skip the job run on random days or holidays.#databricks @DataBricks @DATA

Data Engineering

470 Views
5 replies
3 kudos

09-01-2025 8:30:22 AM

View Replies

Latest Reply

Advika
Databricks Employee

09-27-2025 3:01:48 AM

3 kudos

Hello @DataDev! Did the suggestions shared above help address your question? If so, please consider marking one or more responses as the accepted solution. If you found another approach that worked for you, sharing it with the community would be real...

3 kudos

09-27-2025 3:01:48 AM

4 More Replies

by shan-databricks • New Contributor III

08-31-2025 11:59:46 PM

275 Views
3 replies
3 kudos

How to load all the previous day's data only into the newly added column of the existing delta table

How to load all the previous day's data only into the newly added column of the existing delta table? Is there any option available to do that without writing any logic?

Data Engineering

275 Views
3 replies
3 kudos

08-31-2025 11:59:46 PM

View Replies

Latest Reply

Advika
Databricks Employee

09-27-2025 2:44:11 AM

3 kudos

Hello @shan-databricks! Did the suggestions shared above help resolve your concern? If so, please consider marking one of the responses as the accepted solution. If you found a different approach that worked for you, it would be great if you could sh...

3 kudos

09-27-2025 2:44:11 AM

2 More Replies

by philsch • New Contributor III

09-09-2025 3:07:12 AM

1073 Views
8 replies
3 kudos

Resolved! How to create a managed iceberg table via REST catalog

We're iceberg's java lib to write managed iceberg tables in databricks. We actually can create these tables using databricks as iceberg REST catalog. But this only works when we provide a partitioning spec. This is then picked up as cluster_columns f...

Data Engineering

Iceberg

1073 Views
8 replies
3 kudos

09-09-2025 3:07:12 AM

View Replies

Latest Reply

liko
Databricks Employee

09-26-2025 10:50:05 AM

3 kudos

Why are you using the iceberg-core Java library instead of an existing open source Iceberg client (like Apache Spark)? Any of these can create a table with partitions when using Unity Catalog.

3 kudos

09-26-2025 10:50:05 AM

7 More Replies

by chirag_nagar • New Contributor

07-15-2025 11:56:16 PM

1883 Views
1 replies
1 kudos

Resolved! uidance Required for Informatica to Databricks Workflow Migration Using AI

Hi Team,I am currently exploring approaches to convert Informatica PowerCenter workflows into Databricks-compatible code using AI capabilities. As part of this effort, I would like to highlight that Informatica generates individual XML files for each...

Data Engineering

1883 Views
1 replies
1 kudos

07-15-2025 11:56:16 PM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

09-26-2025 9:33:01 AM

1 kudos

Greetings @chirag_nagar , as you can imagine or know, migrations are extremely complex and time consuming. There are a few approaches to migrations but I want to focus on one - Bladebridge. This is a free tool provided by Databricks that is AI powe...

1 kudos

09-26-2025 9:33:01 AM

by nikhilshetty4 • New Contributor III

08-22-2025 3:01:09 AM

978 Views
8 replies
6 kudos

Issue with Autoloader cleanSource=MOVE Not Working as Expected

Hi everyone, I've been trying to explore on cleanSource option in Autoloader to move files from the source to an archive location after they're processed and loaded into a table. I used the following simple code to test this functionality. While the ...

Data Engineering

978 Views
8 replies
6 kudos

08-22-2025 3:01:09 AM

View Replies

Latest Reply

Ozear
New Contributor II

09-26-2025 9:29:13 AM

6 kudos

any update on this ?

6 kudos

09-26-2025 9:29:13 AM

7 More Replies

by mgcasas-aws • New Contributor

06-16-2025 3:50:01 PM

1886 Views
1 replies
1 kudos

Resolved! Azure Databricks Serverless private connection to S3 bucket

I'm looking for technical references to connect an Azure Databricks serverless workspace to an S3 bucket over a private site-to-site VPN connection. Found the following to connect AWS (consumer) to Azure (provider), but I'm looking for the other way....

Data Engineering

1886 Views
1 replies
1 kudos

06-16-2025 3:50:01 PM

View Replies

Latest Reply

Sai_Ponugoti
Databricks Employee

09-26-2025 8:39:25 AM

1 kudos

Hello @mgcasas-aws Thank you for your question! We’re currently working on a solution for private cross-cloud Delta Sharing (Azure → AWS). In the meantime, here’s a possible approach: Update your Azure Storage Account network settings from private e...

1 kudos

09-26-2025 8:39:25 AM

Databricks Community

Forum Posts

Databricks app with parameters from databricks asset bundle

Resolved! handling both Pyspark and Python exceptions

Resolved! BUG in Job Task of Type DBT

Resolved! I recently made new account on databricks under Free edition

How to implement MERGE operations in Lakeflow Declarative Pipelines

Unable to access metrics from Driver node on localhost:4040

Autoloader - File Notification Mode

Cannot click Compute tab

Foreign Catalog Wrong Mapping - Azure SQL Database Binary Column

Schedule databricks job based on custom calendar

How to load all the previous day's data only into the newly added column of the existing delta table

Resolved! How to create a managed iceberg table via REST catalog

Resolved! uidance Required for Informatica to Databricks Workflow Migration Using AI

Issue with Autoloader cleanSource=MOVE Not Working as Expected

Resolved! Azure Databricks Serverless private connection to S3 bucket

Join Us as a Local Community Builder!

Hive Metastore End of Life

DLT Pipeline with unknown deleted source data

[Databricks Asset Bundles] Bug: driver_node_type_i...

Global Parameter at the Pipeline level in Lakeflow...

oracle sequence number