Data Engineering

Forum Posts

Sorted by:

by samaiyanik • New Contributor

08-07-2025 4:01:40 AM

472 Views
1 replies
0 kudos

Resolved! Databricks Free Edition | RETRIES_EXCEEDED issue

Hi Team,I am not able to fire below commands i am getting error %sqlCREATE SCHEMA IF NOT EXISTS workspace.gold; The maximum number of retries has been exceeded.Tried all the available option but not working ThanksNikhil Samaiya

Data Engineering

472 Views
1 replies
0 kudos

08-07-2025 4:01:40 AM

View Replies

Latest Reply

Advika
Community Manager

08-07-2025 4:43:40 AM

0 kudos

Hello @samaiyanik! Could you please try the suggestions shared in the post below and let us know if that helps resolve the issue?Similar Post: error: [RETRIES_EXCEEDED] The maximum number of retries has been exceeded

0 kudos

08-07-2025 4:43:40 AM

by Subha0920 • Databricks Partner

08-04-2025 11:57:21 PM

1448 Views
3 replies
1 kudos

Databricks recommended Approach to load data vault 2.0

Hi,Please share the recommended approach to load Data Vault 2.0 .Overview1. Current Landscape - Lakehouse (Bronze/Silver/Gold)2. Data Vault 2.0 to be created in Silver layer.3. Bronze data will be made available in delta table using ETL Questions1. ...

Data Engineering

1448 Views
3 replies
1 kudos

08-04-2025 11:57:21 PM

View Replies

Latest Reply

Subha0920
Databricks Partner

08-07-2025 3:21:07 AM

1 kudos

Kindly provide your valuable input and suggestion for the above questions

1 kudos

08-07-2025 3:21:07 AM

2 More Replies

by camilo_s • Databricks Partner

09-02-2024 1:22:54 AM

4606 Views
5 replies
0 kudos

Spark SQL vs serverless SQL

Are there any benchmarks showing performance and cost differences between running SQL workloads on Spark SQL vs Databricks SQL (specially serverless SQL)?Our customer is hesitant about getting locked into Databricks SQL as opposed to being able to ru...

Data Engineering

4606 Views
5 replies
0 kudos

09-02-2024 1:22:54 AM

View Replies

Latest Reply

maxwarior
New Contributor II

08-06-2025 9:51:03 PM

0 kudos

Spark SQL serves as the SQL interface for Spark applications, whereas Databricks SQL is a more advanced, warehouse-optimized product built around SQL Warehouses, which utilize multiple Spark clusters. This architectural difference can lead to noticea...

0 kudos

08-06-2025 9:51:03 PM

4 More Replies

by habyphilipose • New Contributor II

08-06-2025 1:43:16 AM

1151 Views
3 replies
4 kudos

DLT table deletion

If we delete the DLT pipeline, the tables would get deleted.But in a DLT pipeline which creates 5 tables , if I comment out logic of 1 table, that table is not deleted from the catalog, even though full refresh of the pipeline is done.Does anyone kno...

Data Engineering

1151 Views
3 replies
4 kudos

08-06-2025 1:43:16 AM

View Replies

Latest Reply

MartinIsti
Databricks Partner

08-06-2025 4:13:08 PM

4 kudos

Don't confuse DLT and LDP (Lakeflow Declarative Pipelines) as though behind the scenes they work very similarly, the UI and the developer experience has changed immensely and very important new features have been added. I used DLT extensively and in ...

4 kudos

08-06-2025 4:13:08 PM

2 More Replies

by ChristianRRL • Honored Contributor

08-06-2025 8:56:31 AM

539 Views
1 replies
0 kudos

Troubleshooting AutoLoader

Hi there, I am running into a bit of an issue displaying some AutoLoader readStream data. Can I get some assistance to understand how to properly troubleshoot this? I've looked at logs before, but frankly it's not clear where to look exactly:First, "...

Data Engineering

539 Views
1 replies
0 kudos

08-06-2025 8:56:31 AM

View Replies

Latest Reply

MartinIsti
Databricks Partner

08-06-2025 3:30:09 PM

0 kudos

I'm also working with AutoLoader these days to create an ingestion pattern and troubleshooting it can be tricky.I wonder if you could pick a single file (whose full path / location / uri you exactly know) and read it without autoloader. Just with spa...

0 kudos

08-06-2025 3:30:09 PM

by ManojkMohan • Honored Contributor II

08-06-2025 9:01:43 AM

446 Views
1 replies
2 kudos

Resolved! Sample Data Reflecting but Uploaded File reflecting

Step1: I uploaded CSV file manually in data bricks Step 2: Connector created and active between Salesforce and DatabricksStep 3: Creating Data Streams in Salesforce Data CloudSample Topics are reflecting , matching between what i see in data bricks ...

Data Engineering

446 Views
1 replies
2 kudos

08-06-2025 9:01:43 AM

View Replies

Latest Reply

ManojkMohan
Honored Contributor II

08-06-2025 1:07:12 PM

2 kudos

I resolved it myselfStep1: workspace --> manage permissions step 2: chose all permissionsstep 3: went to raw uploaded file and share via delta sharingStep4: in salesforce data stream i got the raw file

2 kudos

08-06-2025 1:07:12 PM

by Shruti12 • Databricks Partner

08-06-2025 6:30:20 AM

2641 Views
2 replies
1 kudos

Databricks support updating multiple target rows with single matching source row in merge query?

Hi,I am getting this error in merge statement. DeltaUnsupportedOperationException: Cannot perform Merge as multiple source rows matched and attempted to modify the same target row in the Delta table in possibly conflicting ways.Does Databricks suppor...

Data Engineering

2641 Views
2 replies
1 kudos

08-06-2025 6:30:20 AM

View Replies

Latest Reply

Shruti12
Databricks Partner

08-06-2025 10:49:49 AM

1 kudos

Hi @szymon_dybczak ,Thanks for your reply. The above code is working fine which means multiple updates can be done from a single source target. So, it may be when there are complex matching conditions/values, merge query gives error.I cannot send you...

1 kudos

08-06-2025 10:49:49 AM

1 More Replies

by arsamkull • New Contributor III

08-23-2022 11:47:54 PM

8084 Views
6 replies
6 kudos

Usage of Azure DevOps System.AccessToken as PAT in Databricks

Hi there! I'm trying to use Azure DevOps Pipeline to automate Azure Databricks Repos API. Im using the following workflow:Get an Access Token for a Databricks Service Principal using a Certificate (which works great)Usage REST Api to generate Git Cre...

Data Engineering

8084 Views
6 replies
6 kudos

08-23-2022 11:47:54 PM

View Replies

Latest Reply

Srihasa_Akepati
Databricks Employee

01-27-2023 12:07:40 AM

6 kudos

@Adrian Ehrsam The PAT limit has been increased to 2048 now. Please check.

6 kudos

01-27-2023 12:07:40 AM

5 More Replies

by filipniziol • Esteemed Contributor

08-06-2025 2:01:48 AM

1391 Views
1 replies
2 kudos

Merge slows down when the table grows with liquid clustering enabled.

Hi Everyone, I have a source table and target table and MERGE statement that is inserting/updating records every couple of minutes. The clustering keys are set up to match the 2 merge join columns.I noticed that with time the processing time increase...

Data Engineering

1391 Views
1 replies
2 kudos

08-06-2025 2:01:48 AM

View Replies

Latest Reply

kerem
Contributor

08-06-2025 3:10:19 AM

2 kudos

Hi @filipniziol ,I dealt with a large table of about a TB in size with liquid clustering enabled. Even with Liquid Clustering, selects and joins on the clustered columns took longer as the table grew. So I don't think it performs as fast as the table...

2 kudos

08-06-2025 3:10:19 AM

by vamsi_simbus • Databricks Partner

08-04-2025 6:29:50 AM

1704 Views
5 replies
0 kudos

Databricks System Table system.billing.usage Not Capturing Job Data in Real-Time

We’ve observed that the system.billing.usage table in Databricks is not capturing job usage data in real-time. There appears to be a noticeable delay between when jobs are executed and when their corresponding usage records appear in the system table...

Data Engineering

1704 Views
5 replies
0 kudos

08-04-2025 6:29:50 AM

View Replies

Latest Reply

vamsi_simbus
Databricks Partner

08-05-2025 11:04:41 PM

0 kudos

Hi @szymon_dybczak ,Is there any alternative approach to find the DBU usage of current running jobs ?

0 kudos

08-05-2025 11:04:41 PM

4 More Replies

by malla_aayush • Databricks Partner

08-05-2025 8:22:03 AM

810 Views
2 replies
1 kudos

Resolved! Not able to find lab for Data Engineering Learning Path

I am not able to find the data engineering learning path , i did open partner databricks academy lab which redirected to uplimit where i also enrolled myself to instructor led course but not able to see any labs.

Data Engineering

810 Views
2 replies
1 kudos

08-05-2025 8:22:03 AM

View Replies

Latest Reply

junaid-databrix
New Contributor III

08-06-2025 12:28:15 AM

1 kudos

You are right the self paced e-learning courses does not include any labs. However, they are available on instructor led courses available on Uplimit. I recently enrolled for one and here is how it worked for me:1. On Uplimit portal enroll for an upc...

1 kudos

08-06-2025 12:28:15 AM

1 More Replies

by susanne • Databricks Partner

08-04-2025 5:05:54 AM

1641 Views
3 replies
0 kudos

Resolved! Authentication failure Lakeflow SQL Server Ingestion

Hi all I am trying to create a Lakeflow Ingestion Pipeline for SQL Server, but I am running into the following authentication error when using my Databricks Database User for the connection:Gateway is stopping. Authentication failure while obtaining ...

Data Engineering

1641 Views
3 replies
0 kudos

08-04-2025 5:05:54 AM

View Replies

Latest Reply

susanne
Databricks Partner

08-06-2025 12:20:10 AM

0 kudos

Hi @szymon_dybczak,thanks a lot, that did the trick

0 kudos

08-06-2025 12:20:10 AM

2 More Replies

by Alena • New Contributor II

08-05-2025 1:41:55 PM

714 Views
1 replies
0 kudos

Programmatically set minimum workers for a job cluster based on file size?

I’m running an ingestion pipeline with a Databricks job:A file lands in S3A Lambda is triggeredThe Lambda runs a Databricks jobThe incoming files vary a lot in size, which makes processing times vary as well. My job cluster has autoscaling enabled, b...

Data Engineering

714 Views
1 replies
0 kudos

08-05-2025 1:41:55 PM

View Replies

Latest Reply

kerem
Contributor

08-05-2025 4:27:56 PM

0 kudos

Hi Alena, Jobs API has update functionality to be able to do that: https://docs.databricks.com/api/workspace/jobs_21/updateIf for some reason you can’t update your pipeline before you trigger it you can also consider creating a new job with desired c...

0 kudos

08-05-2025 4:27:56 PM

by Nick_Pacey • New Contributor III

08-01-2025 8:30:51 AM

908 Views
2 replies
0 kudos

Question on best method to deliver Azure SQL Server data into Databricks Bronze and Silver.

Hi,We have a Azure SQL Server (replicating from an On Prem SQL Server) that is required to be in Databricks bronze and beyond.This database has 100s of tables that are all required. Size of tables will vary from very small up to the biggest tables 1...

Data Engineering

908 Views
2 replies
0 kudos

08-01-2025 8:30:51 AM

View Replies

Latest Reply

kerem
Contributor

08-05-2025 4:08:50 PM

0 kudos

Hey Nick,Have you tried the SQL Server connector with Lakeflow Connect? This should provide native connection to your SQL server, potentially allowing for incremental updates and CDC setup. https://learn.microsoft.com/en-us/azure/databricks/ingestion...

0 kudos

08-05-2025 4:08:50 PM

1 More Replies

by yit • Databricks Partner

08-05-2025 2:19:13 PM

563 Views
1 replies
0 kudos

Unable to Upcast DECIMAL Field in Autoloader

I’m using Autoloader to read Parquet files and write them to a Delta table. I want to enforce a schema in which Column1 is defined as DECIMAL(10,2). However, in the Parquet files being ingested, Column1 is defined as DECIMAL(8,2).When Autoloader read...

Data Engineering

563 Views
1 replies
0 kudos

08-05-2025 2:19:13 PM

View Replies

Latest Reply

kerem
Contributor

08-05-2025 4:02:24 PM

0 kudos

Hi Yit,To potentially simplify your issue, why not read this column as String in your stream and then cast it to DECIMAL(10, 2) afterwards? That should eliminate the rescue behaviour. Kerem Durak

0 kudos

08-05-2025 4:02:24 PM

Databricks Community

Forum Posts

Resolved! Databricks Free Edition | RETRIES_EXCEEDED issue

Databricks recommended Approach to load data vault 2.0

Spark SQL vs serverless SQL

DLT table deletion

Troubleshooting AutoLoader

Resolved! Sample Data Reflecting but Uploaded File reflecting

Databricks support updating multiple target rows with single matching source row in merge query?

Usage of Azure DevOps System.AccessToken as PAT in Databricks

Merge slows down when the table grows with liquid clustering enabled.

Databricks System Table system.billing.usage Not Capturing Job Data in Real-Time

Resolved! Not able to find lab for Data Engineering Learning Path

Resolved! Authentication failure Lakeflow SQL Server Ingestion

Programmatically set minimum workers for a job cluster based on file size?

Question on best method to deliver Azure SQL Server data into Databricks Bronze and Silver.

Unable to Upcast DECIMAL Field in Autoloader

Databricks to Salesforce Core (Not cloud)

Databricks optimization for query perfomance and p...

Parametrize the DLT pipeline for dynamic loading o...

File Arrival Trigger - Multiple tables

Issue while handling Deletes and Inserts in Struct...