Data Engineering

Forum Posts

Sorted by:

by ticuss • Visitor

7 hours ago

15 Views
0 replies
0 kudos

Lakebase / Feature Store error: “Failed to get identity details for username” (service principal)

Hello,I’m running into a Lakebase / Feature Store issue related to service principal authentication when trying to log or read from the Databricks Feature Store. Migrating from the legacy online tables. Here’s the exact error:psycopg2.OperationalErr...

Data Engineering

15 Views
0 replies
0 kudos

7 hours ago

by tarunnagpal • New Contributor III

07-20-2025 9:26:20 PM

1033 Views
7 replies
3 kudos

Lakebridge questions

We have a few questions before we propose Lakebridge as the migration tooling for one of our customers, where the requirement is to migrate from Redshift to Databricks. We need help with your quick response so we can proceed with the next steps:Our u...

Data Engineering

1033 Views
7 replies
3 kudos

07-20-2025 9:26:20 PM

View Replies

Latest Reply

sky_bricks
Visitor

yesterday

3 kudos

Hi community,We’re currently planning a migration from an on-premise SQL Server data warehouse (with associated SSIS packages) to Databricks Unity Catalog. As part of this effort, we’re evaluating the use of Lakebridge for assessment, conversion, and...

3 kudos

yesterday

6 More Replies

by Jonathan_ • New Contributor III

a week ago

355 Views
7 replies
6 kudos

Slow PySpark operations after long DAG that contains many joins and transformations

We are using PySpark and notice that when we are doing many transformations/aggregations/joins of the data then at some point the execution time of simple task (count, display, union of 2 tables, ...) become very slow even if we have a small data (ex...

Data Engineering

355 Views
7 replies
6 kudos

a week ago

View Replies

Latest Reply

Jonathan_
New Contributor III

yesterday

6 kudos

It's a cluster with 128 GO of memory, when looking in Spark UI there is 54 GO for storage memory. Honestly I don't think it's memory issue like I said it's a small data and if we do checkpoint at same point then continu we don't have the problem afte...

6 kudos

yesterday

6 More Replies

by rajg • New Contributor

Saturday

128 Views
1 replies
1 kudos

Cannot export embedded dashboard widget as CSV or other formats except PNG

I’ve integrated a Databricks dashboard into my web application for all my users, following the guidelines in this article:Embedding Databricks Dashboards.This integration worked perfectly initially. However, I’m now encountering an issue with exporti...

Data Engineering

128 Views
1 replies
1 kudos

Saturday

View Replies

Latest Reply

stbjelcevic
Databricks Employee

yesterday

1 kudos

Hi @rajg , Based on the link you shared, it looks to me like you have an external embedding situation? If so, this is a feature that is not currently available, but it is a commonly requested feature. External dashboard embedding is currently in Publ...

1 kudos

yesterday

by Leladams • New Contributor III

01-06-2022 9:33:14 AM

13898 Views
10 replies
2 kudos

What is the best way to read in a ms access .accdb database into Databricks from a mounted drive?

I am currently trying to read in .accdb files from a mounted drive. Based on my research it looks like I would have to use a package like JayDeBeApi with ucanaccess drivers or pyodbc with ms access drivers.Will this work?Thanks for any help.

Data Engineering

13898 Views
10 replies
2 kudos

01-06-2022 9:33:14 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-13-2022 7:46:56 AM

2 kudos

Hi @Leland Adams Hope you are doing well. Thank you for posting your question and giving us additional information. Do you think you were able to solve the query?We'd love to hear from you.

2 kudos

04-13-2022 7:46:56 AM

9 More Replies

by DBU100725 • New Contributor II

2 weeks ago

287 Views
2 replies
0 kudos

URGENT: Delta writes to S3 fail after workspace migrated to Premium

Delta writes to S3 fail after workspace migrated to Premium (401 “Credential was not sent or unsupported type”)SummaryAfter our Databricks workspace migrated from Standard to Premium, all Delta writes to S3 started failing with:com.databricks.s3commi...

Data Engineering

287 Views
2 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

dkushari
Databricks Employee

yesterday

0 kudos

Hi @DBU100725 - Are you using a No isolation shared cluster? Can you check if this was turned ON in your account?

0 kudos

yesterday

1 More Replies

by Shefali • New Contributor

08-29-2025 2:23:36 AM

421 Views
1 replies
1 kudos

Lakebridge conversion tool: Incorrect Databricks SQL script generated

Hi Team,I was able to successfully install and use the Lakebridge code conversion tool to convert my SQL Server script into a Databricks SQL script. However, the generated script contains several syntax errors. Could you please let me know if I might...

Data Engineering

421 Views
1 replies
1 kudos

08-29-2025 2:23:36 AM

View Replies

Latest Reply

AbhaySingh
New Contributor

yesterday

1 kudos

Hi there!Known lakebase issues are listed here:https://github.com/databrickslabs/lakebridge/issuesDoes any of this apply to your use case?1. Variable scope errors in WHERE clauses or subqueries 2. DELETE/UPDATE FROM statements incorrectly converted ...

1 kudos

yesterday

by Oumeima • New Contributor

07-02-2025 9:57:51 AM

1819 Views
1 replies
1 kudos

I can't use my own .whl package in Databricks app with databricks asset bundles

I am building a databricks app using databricks asset bundles. I need to use a helpers packages that i built as an artifact and using in other resources outside the app. The only way to use it is to have the built package inside the app source code f...

Data Engineering

1819 Views
1 replies
1 kudos

07-02-2025 9:57:51 AM

View Replies

Latest Reply

stbjelcevic
Databricks Employee

yesterday

1 kudos

Hi @Oumeima , One potential way around this is to upload the wheel file into a Unity Catalog volume or workspace file. For the volume route, reference it directly in your app’s requirements.txt using an absolute /Volumes/<catalog>/<schema>/<volume>/....

1 kudos

yesterday

by Davila • New Contributor II

06-26-2025 10:54:59 AM

1524 Views
1 replies
1 kudos

Issue with Root Folder Configuration in Databricks Asset Bundles for DLT Pipelines

I'm currently working with Databricks Asset Bundles to deploy my DLT pipelines, but I’ve encountered an issue I can't resolve.The problem is that I’m unable to configure the root folder within the Asset Bundle in a way that lets me define a custom pa...

Data Engineering

1524 Views
1 replies
1 kudos

06-26-2025 10:54:59 AM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

yesterday

1 kudos

Hey @Davila , I did some digging and have come with some things you can think about as you work through your issue. Here’s a clear way to think about what you’re seeing and how to proceed. What’s going on That “Root folder” field in the DLT UI is in...

1 kudos

yesterday

by lauraxyz • Contributor

01-22-2025 10:58:15 AM

1985 Views
6 replies
0 kudos

Notebook in path workspace/repos/.internal/_commits/ was unable to be accessed

I have a workflow job (source is git) to access a notebook and execute it. From the job, it failed with error:Py4JJavaError: An error occurred while calling o466.run. : com.databricks.WorkflowException: com.databricks.NotebookExecutionException: FAI...

Data Engineering

1985 Views
6 replies
0 kudos

01-22-2025 10:58:15 AM

View Replies

Latest Reply

lauraxyz
Contributor

01-24-2025 3:34:41 PM

0 kudos

Just some clarification: the caller notebook can be found with no issues, no matter the task's source is GIT or WORKSPACE. However, the callee notebook, which is called by the caller notebook with dbutils.notebook.run(), cannot be found if the call...

0 kudos

01-24-2025 3:34:41 PM

5 More Replies

by JordanYaker • Contributor

05-30-2023 11:47:28 AM

2325 Views
2 replies
0 kudos

Integration options for Databricks Jobs and DataDog?

I know that there is already the Databricks (technically Spark) integration for DataDog. Unfortunately, that integration only covers the cluster execution itself and that means only Cluster Metrics and Spark Jobs and Tasks. I'm looking for somethin...

Data Engineering

2325 Views
2 replies
0 kudos

05-30-2023 11:47:28 AM

View Replies

Latest Reply

greg-0935
New Contributor

yesterday

0 kudos

Personally, I'm using their Data Jobs Monitoring product https://docs.datadoghq.com/data_jobs/databricks/ that works great and gives the right insights both for my high level job execution stats and Spark deeper metrics

0 kudos

yesterday

1 More Replies

by Dhruv-22 • Contributor

yesterday

74 Views
2 replies
1 kudos

Resolved! Can't mergeSchema handle int and bigint?

I have a table which has a column of data type 'bigint'. While overwriting it with new data, given that I do full loads, I used 'mergeSchema' to handle schema changes. The new data's datatype was int. I thought mergeSchema can easily handle that, but...

Data Engineering

74 Views
2 replies
1 kudos

yesterday

View Replies

Latest Reply

Chiran-Gajula
New Contributor

yesterday

1 kudos

Hi Dhruv,Delta won't automatically upcast unless you explicitly handle it. Cast the column Lob_Pk to LongType (which maps to BIGINT in SQL/Delta). Try below snippetfrom pyspark.sql.functions import colfrom pyspark.sql.types import LongTypecrm_retail_...

1 kudos

yesterday

1 More Replies

by saicharandeepb • New Contributor III

yesterday

79 Views
3 replies
1 kudos

How to Retrieve DBU Count per Compute Type for Accurate Cost Calculation?

Hello Everyone,We are currently working on a cost analysis initiative to gain deeper insights into our Databricks usage. As part of this effort, we are trying to calculate the hourly cost of each Databricks compute instance by utilizing the Azure Ret...

Data Engineering

79 Views
3 replies
1 kudos

yesterday

View Replies

Latest Reply

nayan_wylde
Honored Contributor III

yesterday

1 kudos

1. Is there a documented way to retrieve the DBU count per VM or compute type?Yes, but it's not directly exposed via a single API or table. The DBU consumption rate depends on:Compute type (Jobs Compute, All-Purpose Compute, SQL Compute, etc.)VM inst...

1 kudos

yesterday

2 More Replies

by Marthinus • New Contributor III

Monday

101 Views
4 replies
1 kudos

[Databricks Asset Bundles] Bug: driver_node_type_id not updated

Working with databricks asset bundles (using the new python-based definition), if you have a job_cluster defined using driver_node_type_id, and then update it to no longer have it defined, but only node_type_id, the driver node_type never gets update...

Data Engineering

101 Views
4 replies
1 kudos

Monday

View Replies

Latest Reply

Chiran-Gajula
New Contributor

yesterday

1 kudos

There is no built-in way in Databricks Asset bundles or terraform to automatically inherit the value of driver_node_type_id for node_type_id."You must set both explicitly in your configuration"You can always see your updated detail resource from the ...

1 kudos

yesterday

3 More Replies

by Dhruv-22 • Contributor

04-03-2024 3:03:26 AM

1687 Views
2 replies
0 kudos

Resolved! Understanding least common type in databricks

I was reading the data type rules and found about least common type.I have a doubt. What is the least common type of STRING and INT? The referred link gives the following example saying the least common type is BIGINT.-- The least common type between...

Data Engineering

1687 Views
2 replies
0 kudos

04-03-2024 3:03:26 AM

View Replies

Latest Reply

Dhruv-22
Contributor

yesterday

0 kudos

The question is solved here - link

0 kudos

yesterday

1 More Replies

Databricks Community

Forum Posts

Lakebase / Feature Store error: “Failed to get identity details for username” (service principal)

Lakebridge questions

Slow PySpark operations after long DAG that contains many joins and transformations

Cannot export embedded dashboard widget as CSV or other formats except PNG

What is the best way to read in a ms access .accdb database into Databricks from a mounted drive?

URGENT: Delta writes to S3 fail after workspace migrated to Premium

Lakebridge conversion tool: Incorrect Databricks SQL script generated

I can't use my own .whl package in Databricks app with databricks asset bundles

Issue with Root Folder Configuration in Databricks Asset Bundles for DLT Pipelines

Notebook in path workspace/repos/.internal/_commits/ was unable to be accessed

Integration options for Databricks Jobs and DataDog?

Resolved! Can't mergeSchema handle int and bigint?

How to Retrieve DBU Count per Compute Type for Accurate Cost Calculation?

[Databricks Asset Bundles] Bug: driver_node_type_id not updated

Resolved! Understanding least common type in databricks

Join Us as a Local Community Builder!

AUTO CDC API and sequence column

when automatic liquid clustering is enabled, how t...

Can't mergeSchema handle int and bigint?

Understanding least common type in databricks

Least Common Type is different in Serverless and A...