Data Engineering

Forum Posts

Sorted by:

by slimbnsalah • New Contributor II

03-21-2025 8:49:44 AM

2271 Views
2 replies
0 kudos

Use Salesforce Lakeflow Connector with a Salesforce Connected App

Hello, I'm trying to use the new Salesforce Lakeflow connector to ingest data into my Databricks account.However I see only the option to connect using a normal user, whereas I want to use a Salesforce App, just like how it is described here Run fede...

Data Engineering

2271 Views
2 replies
0 kudos

03-21-2025 8:49:44 AM

View Replies

Latest Reply

Ajay-Pandey
Databricks MVP

03-25-2025 11:53:46 PM

0 kudos

@slimbnsalah Please select Connection type as of Salesforce Data Cloud then you will be asked for details

0 kudos

03-25-2025 11:53:46 PM

1 More Replies

by ManojkMohan • Honored Contributor II

08-28-2025 7:06:18 AM

955 Views
4 replies
2 kudos

Resolved! Silver to Gold Layer | Running ML - Debug Help Needed

Problem I am solving:Reads the raw sports data IPL CSV → bronze layerCleans and aggregates → silver layerSummarizes team stats → gold layerPrepares ML-ready features and trains a Random Forest classifier to predict match winners Getting error: [PARS...

Data Engineering

955 Views
4 replies
2 kudos

08-28-2025 7:06:18 AM

View Replies

Latest Reply

BS_THE_ANALYST
Databricks Partner

08-29-2025 9:53:57 AM

2 kudos

@ManojkMohan thanks for sharing this, I'm looking at starting an ML project in the coming weeks, I might have to bring this forward . Feeling motivated with that confusion matrix in your output .Congrats on getting it working!All the best,BS

2 kudos

08-29-2025 9:53:57 AM

3 More Replies

by Srinivas5 • New Contributor II

08-20-2025 11:07:54 AM

1068 Views
6 replies
3 kudos

Jar File Upload To Workspace

Spoiler #dbfsI am unable to upload jar file dbfs to job cluster as it's deprecated now I need to upload it to workspace and install it to cluster, hower my jar size is 70mb i can't upload it through api or cli as max size is 50mb. Is there alternati...

Data Engineering

1068 Views
6 replies
3 kudos

08-20-2025 11:07:54 AM

View Replies

Latest Reply

Advika
Community Manager

08-29-2025 8:34:53 AM

3 kudos

Hi @Srinivas5! Were you able to find a solution or approach that worked? If so, please mark the helpful reply as the Accepted Solution, or share your approach so others can benefit as well.

3 kudos

08-29-2025 8:34:53 AM

5 More Replies

by ShankarM • Databricks Partner

08-29-2025 2:45:27 AM

517 Views
2 replies
0 kudos

Notebook exposure

i have created a notebook as per client requirement. I have to migrate the notebook in the client env for testing with live data but do not want to expose the Databricks notebook code to the testers in the client env.Is there a way to package the not...

Data Engineering

517 Views
2 replies
0 kudos

08-29-2025 2:45:27 AM

View Replies

Latest Reply

WiliamRosa
Databricks Partner

08-29-2025 6:28:25 AM

0 kudos

Hi @ShankarM,I’ve had to do something similar—packaging a Python class as a wheel. This documentation might help: https://docs.databricks.com/aws/en/dev-tools/bundles/python-wheel

0 kudos

08-29-2025 6:28:25 AM

1 More Replies

by DatabricksEngi1 • Contributor

08-28-2025 5:12:02 AM

1700 Views
2 replies
1 kudos

Resolved! databricks assets bundles issue

Hii all,I’m working with Databricks Asset Bundles (DAB) and trying to move from a single repository-level bundle to a structure where each workflow (folder under resources/jobs) has its own bundle.• My repository contains:• Shared src/variables.yml a...

Data Engineering

1700 Views
2 replies
1 kudos

08-28-2025 5:12:02 AM

View Replies

Latest Reply

DatabricksEngi1
Contributor

08-29-2025 4:13:23 AM

1 kudos

I solved it.For some reason, the Terraform folder created under the bundles wasn’t set up correctly.I copied it from a working bundle, and everything completed successfully.

1 kudos

08-29-2025 4:13:23 AM

1 More Replies

by JPNP • Databricks Partner

08-20-2025 5:19:08 AM

1499 Views
3 replies
1 kudos

Not able to creare Secret scope in Azure databricks

Hello,I am trying to create the Azure Key Vault-backed secret scope, but it failing with the below error, I have tried to clear the cache, and logged out , used incognito browser as well but not able to create a scope. Can you please help here ?

Data Engineering

1499 Views
3 replies
1 kudos

08-20-2025 5:19:08 AM

View Replies

Latest Reply

Yogesh_Verma_
Contributor II

08-28-2025 8:44:08 PM

1 kudos

If the UI keeps failing with that vague error, the CLI approach suggested above is the best next step, since it usually gives a clearer error message. Also make sure that:The service principal you’re using to create the scope has Key Vault Administra...

1 kudos

08-28-2025 8:44:08 PM

2 More Replies

by jar • Contributor

08-28-2025 8:12:14 PM

476 Views
1 replies
0 kudos

Excluding job update from DAB .yml deployment

Hi.We have a range of scheduled jobs and _one_ continuous job all defined in .yml and deployed with DAB. The continuous job is paused per default and we use a scheduled job of a notebook to pause and unpause it so that it only runs during business ho...

Data Engineering

476 Views
1 replies
0 kudos

08-28-2025 8:12:14 PM

View Replies

Latest Reply

Yogesh_Verma_
Contributor II

08-28-2025 8:39:52 PM

0 kudos

You’re running into this because DAB treats the YAML definition as the source of truth — so every time you redeploy, it will reset the job state (including the paused/running status) back to what’s defined in the file. Unfortunately, there isn’t curr...

0 kudos

08-28-2025 8:39:52 PM

by karthik_p • Databricks Partner

12-01-2023 9:13:34 AM

16533 Views
5 replies
1 kudos

does delta live tables supports identity columns

we are able to test identity columns using sql/python, but when we are trying same using DLT, we are not seeing values under identity column. it is always empty for coloumn we created "id BIGINT GENERATED ALWAYS AS IDENTITY"

Data Engineering

16533 Views
5 replies
1 kudos

12-01-2023 9:13:34 AM

View Replies

Latest Reply

Gowrish
New Contributor II

08-28-2025 7:03:22 PM

1 kudos

Hi,i see from the following databricks documentaion - https://docs.databricks.com/aws/en/dlt/limitationsit states the following which kind of giving an impression that you can define identity column to a steaming table Identity columns might be recom...

1 kudos

08-28-2025 7:03:22 PM

4 More Replies

by mtreigelman • New Contributor III

08-27-2025 9:07:16 AM

717 Views
1 replies
3 kudos

First Lakeflow (DLT) Pipeline Best Practice Question

Hi, I am writing my first streaming pipeline and trying to ensure it is setup to work as a "Lakeflow" pipeline. It is connecting an external Oracle database with some external Azure Blob storage data (all managed in the same Unity Catalog). The pipe...

Data Engineering

717 Views
1 replies
3 kudos

08-27-2025 9:07:16 AM

View Replies

Latest Reply

BS_THE_ANALYST
Databricks Partner

08-28-2025 12:33:26 PM

3 kudos

@mtreigelmanthanks for providing the update. If you wouldn't mind, could you explain why you think the first way didn't work and why the second way did? Then you can mark your response as a solution to the question .I found this article to be useful ...

3 kudos

08-28-2025 12:33:26 PM

by ck7007 • Contributor II

08-28-2025 11:45:13 AM

746 Views
1 replies
2 kudos

Cost

Reduced Monthly Databricks Bill from $47K to $12.7KThe Problem: We were scanning 2.3TB for queries needing only 8GB of data.Three Quick Wins1. Multi-dimensional Partitioning (30% savings)# Beforedf.write.partitionBy("date").parquet(path)# After-parti...

Data Engineering

746 Views
1 replies
2 kudos

08-28-2025 11:45:13 AM

View Replies

Latest Reply

BS_THE_ANALYST
Databricks Partner

08-28-2025 11:57:42 AM

2 kudos

@ck7007 thanks so much for sharing! That's such a saving, by the way. Congrats.Out of curiosity, did you consider using Liquid Clustering which was meant to replace partitioning and z-order: https://docs.databricks.com/aws/en/delta/clustering I found...

2 kudos

08-28-2025 11:57:42 AM

by AbhishekNakka15 • Databricks Partner

08-28-2025 9:35:52 AM

702 Views
1 replies
1 kudos

Resolved! Unable to login to partner account

When I try to login with my office email to the partner acccount. It says, The service is currently unavailable. Please try again later. It says "You are not authorized to access https://partner-academy.databricks.com. Please select a platform you ca...

Data Engineering

702 Views
1 replies
1 kudos

08-28-2025 9:35:52 AM

View Replies

Latest Reply

Advika
Community Manager

08-28-2025 9:45:59 AM

1 kudos

Hello @AbhishekNakka15! Please raise a ticket with the Databricks Support Team, and include your email address so they can review your account and provide further assistance.

1 kudos

08-28-2025 9:45:59 AM

by viralpatel • New Contributor II

08-19-2025 10:13:18 PM

1229 Views
2 replies
1 kudos

Lakebridge Synapse Conversion to DBX and Custom transpiler

I have 2 questions about Lakebridge solution,Synapse with dedicated pool ConversionWe were conducting a PoC for Synapse to DBX migration using Lakebridge. What we have observed is that the conversions are not correct. I was anticipating all tables wi...

Data Engineering

1229 Views
2 replies
1 kudos

08-19-2025 10:13:18 PM

View Replies

Latest Reply

yourssanjeev
Databricks Partner

08-28-2025 8:16:14 AM

1 kudos

We are also checking on this use case but got it confirmed from Databricks that it does not work for this use case yet, not sure whether it is in their roadmap

1 kudos

08-28-2025 8:16:14 AM

1 More Replies

by vishalv4476 • New Contributor III

08-25-2025 7:59:13 AM

624 Views
1 replies
0 kudos

Databricks job runs failures Py4JJavaError: An error occurred while calling o404.sql. : java.util.No

Hi ,We had a successful running pipeline but it started failing since 20th august , no change were published. Can you please guide me resolve this issue.I've tried increasing delta.deletedFileRetentionDuration' = 'interval 365 days' but it didn't hel...

Data Engineering

624 Views
1 replies
0 kudos

08-25-2025 7:59:13 AM

View Replies

Latest Reply

SP_6721
Honored Contributor II

08-28-2025 4:25:23 AM

0 kudos

Hi @vishalv4476 ,The error is likely due to a corrupted Delta transaction log or files deleted manually/outside of Delta. Check the table history and verify that no user or automated process removed data files. If issues are found, restore the table ...

0 kudos

08-28-2025 4:25:23 AM

by anazen13 • New Contributor III

08-27-2025 5:34:41 AM

1805 Views
9 replies
2 kudos

databricks api to create a serverless job

I am trying to follow your documentation on how to create serverless job via API https://docs.databricks.com/api/workspace/jobs/create#environments-spec-environment_version So i see that sending the json request resulted for me to see serverless clus...

Data Engineering

1805 Views
9 replies
2 kudos

08-27-2025 5:34:41 AM

View Replies

Latest Reply

siennafaleiro
New Contributor II

08-27-2025 8:19:07 PM

2 kudos

It looks like you’re hitting one of the current limitations of Databricks serverless jobs. Even though the API supports passing an environments object, only certain fields are honored right now. In particular:The environment_version parameter will de...

2 kudos

08-27-2025 8:19:07 PM

8 More Replies

by zero234 • New Contributor III

02-16-2024 3:44:35 AM

7317 Views
3 replies
1 kudos

i have created a materialized view table using delta live table pipeline and its not appending data

i have created a materialized view table using delta live table pipeline , for some reason it is overwriting data every day , i want it to append data to the table instead of doing full refresh suppose i had 8 million records in table and if irun the...

Data Engineering

7317 Views
3 replies
1 kudos

02-16-2024 3:44:35 AM

View Replies

Latest Reply

UMAREDDY06
New Contributor II

08-28-2025 12:16:06 AM

1 kudos

[expect_table_not_view.no_alternative] 'insert' expects a table but dim_airport_unharmonised is a view can you please help how to reslove this.thanksuma devi

1 kudos

08-28-2025 12:16:06 AM

2 More Replies

Databricks Community

Forum Posts

Use Salesforce Lakeflow Connector with a Salesforce Connected App

Resolved! Silver to Gold Layer | Running ML - Debug Help Needed

Jar File Upload To Workspace

Notebook exposure

Resolved! databricks assets bundles issue

Not able to creare Secret scope in Azure databricks

Excluding job update from DAB .yml deployment

does delta live tables supports identity columns

First Lakeflow (DLT) Pipeline Best Practice Question

Cost

Resolved! Unable to login to partner account

Lakebridge Synapse Conversion to DBX and Custom transpiler

Databricks job runs failures Py4JJavaError: An error occurred while calling o404.sql. : java.util.No

databricks api to create a serverless job

i have created a materialized view table using delta live table pipeline and its not appending data

Databricks Workspace - Unknow IP access

Inquiring whether table triggers are the recommend...

Autoloader inserts null rows in delta table while ...

Databricks to Salesforce Core (Not cloud)

Databricks optimization for query perfomance and p...