Data Engineering

Forum Posts

Sorted by:

by himoshi • New Contributor II

06-30-2025 7:19:05 PM

1293 Views
1 replies
1 kudos

Notebook execution keeps showing "Fetching result" endlessly

Hello, I am executing a very simple notebook with only two cells. In the first cell, I'm just defining some variables and printing the result. The second cell is more complex and it basically grabs those variables, parses a yaml file, and prints the ...

Data Engineering

1293 Views
1 replies
1 kudos

06-30-2025 7:19:05 PM

View Replies

Latest Reply

Vidhi_Khaitan
Databricks Employee

07-02-2025 2:45:24 AM

1 kudos

Hi @himoshi Good Day!1. Could you use print() instead of display()?2.If you're printing a large YAML or dictionary object directly (e.g. print(parsed_yaml) or display(parsed_yaml)),try:import jsonprint(json.dumps(parsed_yaml, indent=2)[:500]) # Print...

1 kudos

07-02-2025 2:45:24 AM

by omeryasirkucuk • New Contributor II

05-20-2025 4:58:49 AM

1199 Views
3 replies
2 kudos

Where is the path settings for New SQL Editory?

Hi everyone,I have switched the "New SQL Editor" on SQL Editor side. When opened this new feature, every new query saves automatically under my user. This is crazy, I cannot manage my folders like this. But I didn't find any article or message about ...

Data Engineering

1199 Views
3 replies
2 kudos

05-20-2025 4:58:49 AM

View Replies

Latest Reply

omeryasirkucuk
New Contributor II

05-21-2025 10:46:32 PM

2 kudos

Hi @lingareddy_Alva ,Many thanks for your response. Currently I'm using the "Save As" method that you mentioned above. But I'm looking for the default path settings. Because, in my method, I'm using a lot of Editor querying tables. I know that's not ...

2 kudos

05-21-2025 10:46:32 PM

2 More Replies

by data-enthu • New Contributor II

08-19-2024 8:58:06 AM

1468 Views
1 replies
0 kudos

Accessing DBT Articafts, runs, tests from Databricks workflow using automated script.

I am running dbt on a databricks job. It saves all documentation: manifest.json, run_results.json, etc in "Download Artifacts" in a job. I am not able to find out a way to read those in codes, transform and save on databricks. Tried job API. The arti...

Data Engineering

1468 Views
1 replies
0 kudos

08-19-2024 8:58:06 AM

View Replies

Latest Reply

rokata
New Contributor II

07-02-2025 1:12:42 AM

0 kudos

I know it is a late thread, but did you solve this? I am running into the same challange you have. It seems you can use get/job, and expand the field "tasks" to get the task ids. and then you can use the get-output from that task id. If that helpshtt...

0 kudos

07-02-2025 1:12:42 AM

by VKe • New Contributor III

09-27-2024 6:53:50 AM

5240 Views
6 replies
5 kudos

Issue with HTML Table Styling in Databricks Alerts

Hi Community,I’m trying to create an alert in Databricks with a custom email notification that includes the results of a SQL query displayed in an HTML table. However, I am facing issues with styling the table, specifically with adding borders and ba...

Data Engineering

5240 Views
6 replies
5 kudos

09-27-2024 6:53:50 AM

View Replies

Latest Reply

longchass1
New Contributor II

07-01-2025 10:37:25 PM

5 kudos

We are experiencing the same problem with alert v2

5 kudos

07-01-2025 10:37:25 PM

5 More Replies

by Kishori • New Contributor II

06-29-2025 2:10:59 PM

3391 Views
3 replies
1 kudos

lab mismatch with the course

Hi I am taking a lab included course on "Data Ingestion with Lakeflow Connect" and the labs shown in the course doesn't match with the lab opened in vocareum. The title of the vocareum lab does match the course title but demo and labs are different. ...

Data Engineering

3391 Views
3 replies
1 kudos

06-29-2025 2:10:59 PM

View Replies

Latest Reply

Advika
Community Manager

07-01-2025 6:20:16 AM

1 kudos

@Kishori, could you please file a support ticket? The team will be able to review the course details and assist you directly.You can raise a ticket here: https://help.databricks.com/s/contact-us?ReqType=training

1 kudos

07-01-2025 6:20:16 AM

2 More Replies

by rakeshsekar2025 • New Contributor III

05-12-2025 1:16:13 AM

988 Views
2 replies
0 kudos

Not able to read sample data in databricks in shared cluster but using single cluster im able to

Im not able to view sample data using share clusterError getting sample datasocket closedBut when I use the single cluster mode Im able to read the data

Data Engineering

988 Views
2 replies
0 kudos

05-12-2025 1:16:13 AM

View Replies

Latest Reply

rakeshsekar2025
New Contributor III

07-01-2025 9:56:53 AM

0 kudos

I've enabled the outbound traffic on port 8443 but still its not working please help me out here

0 kudos

07-01-2025 9:56:53 AM

1 More Replies

by pjruhnke • New Contributor

06-26-2025 7:04:35 AM

1345 Views
2 replies
0 kudos

Newest version of dbx-workspace always returns NoneType

I just updated the `databricks-sdk` library to the newest version on PyPi, and for some reason, I am almost always getting this error:File "/home/site/wwwroot/.python_packages/lib/site-packages/databricks/sdk/credentials_provider.py", line 283, in to...

Data Engineering

1345 Views
2 replies
0 kudos

06-26-2025 7:04:35 AM

View Replies

Latest Reply

nayan_wylde
Esteemed Contributor II

07-01-2025 8:32:20 AM

0 kudos

It seems your issue is in getting the AAD token. You are using an SPN to authenticate. You can try to update the azure packages too.azure-identity>=1.21.0azure-core>=1.32.0azure-mgmt-core>=1.6.0databricks-sdk>=0.57.0

0 kudos

07-01-2025 8:32:20 AM

1 More Replies

by laus • New Contributor III

03-31-2022 9:38:18 AM

43365 Views
4 replies
2 kudos

Resolved! How to solve Py4JJavaError: An error occurred while calling o5082.csv. : org.apache.spark.SparkException: Job aborted. when writing to csv

Hi ,I get the error: Py4JJavaError: An error occurred while calling o5082.csv.: org.apache.spark.SparkException: Job aborted. when writing to csv.Screenshot below with detail error.Any idea how to solve it?Thanks!

Data Engineering

43365 Views
4 replies
2 kudos

03-31-2022 9:38:18 AM

View Replies

Latest Reply

Noopur_Nigam
Databricks Employee

05-13-2022 7:52:56 AM

2 kudos

Please try output.coalesce(1).write.option("header","true").format("csv").save("path")It seems to be same to https://community.databricks.com/s/topic/0TO3f000000CjVqGAK/py4jjavaerror

2 kudos

05-13-2022 7:52:56 AM

3 More Replies

by JMartins777 • New Contributor

07-01-2025 2:26:58 AM

3077 Views
2 replies
3 kudos

Resolved! Azure Databricks Power Platform Connector - Doubts

Hello, Regarding the recently released azure databricks connector, i want to connect it to a Power App but i have 2 main questions which i need to know1 - If the databricks URL is in a private network, how does it work and how can i achieve this conn...

Data Engineering

3077 Views
2 replies
3 kudos

07-01-2025 2:26:58 AM

View Replies

Latest Reply

nayan_wylde
Esteemed Contributor II

07-01-2025 7:34:46 AM

3 kudos

1. Private Network ConnectivityFor Databricks workspaces in private networks, you have a couple of options:Option A: On-premises Data GatewayInstall the Microsoft On-premises Data Gateway in your private networkThe gateway acts as a bridge between Po...

3 kudos

07-01-2025 7:34:46 AM

1 More Replies

by DanielaHello • New Contributor

07-01-2025 3:16:34 AM

1698 Views
1 replies
0 kudos

Free edition and serverless edition are not loading and are really slow

Good morning,I am trying since last week to access two workspaces that I have (one in the free edition), and the other one in the paid serverless edition.Both of the workspaces are not loading, if they load they are very very slow, and I cannot see ...

Data Engineering

1698 Views
1 replies
0 kudos

07-01-2025 3:16:34 AM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

07-01-2025 5:28:08 AM

0 kudos

Hi @DanielaHello ,That's weird. According to status page, there is no outage in any region currently. Could you try to use different browser? Or try to log in incognito mode.For instance, Databricks Free Editon is currently available only in one regi...

0 kudos

07-01-2025 5:28:08 AM

by devyani_k • Databricks Partner

06-30-2025 7:30:51 PM

2766 Views
1 replies
1 kudos

Resolved! Extracting cost by user (run_by) for All-purpose clusters and SQL warehouse usage

Hi,I'm trying to extract usage cost per user (run_by) for workloads that utilize all-purpose clusters and SQL warehouses. I’ve been exploring the system.billing.usage table but noticed some challenges:1. For records related to all-purpose clusters an...

Data Engineering

2766 Views
1 replies
1 kudos

06-30-2025 7:30:51 PM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

07-01-2025 5:02:02 AM

1 kudos

Attribution of compute usage to individual users for all-purpose clusters and SQL warehouses is only partially supported. Job compute (including serverless jobs) and workflows are reliably attributable to the job owner/service principal. For interact...

1 kudos

07-01-2025 5:02:02 AM

by varni • New Contributor III

07-01-2025 12:45:27 AM

2313 Views
2 replies
6 kudos

Resolved! Unity Catalog blocks DML (UPDATE, DELETE) on static Delta tables — unable to use spark.sql

Hello,We’ve started migrating from Azure Databricks (Hive Metastore) to AWS Databricks with Unity Catalog. Our entire codebase was deliberately designed around spark.sql('...') using DML operations (UPDATE, DELETE, MERGE) for two reasons:In many case...

Data Engineering

2313 Views
2 replies
6 kudos

07-01-2025 12:45:27 AM

View Replies

Latest Reply

varni
New Contributor III

07-01-2025 1:43:01 AM

6 kudos

[RESOLVED] The issue was caused by the source tables being in Parquet format. After rewriting them as Delta tables, everything worked fine — including DML operations like UPDATE via DataFrame logic. Thanks!

6 kudos

07-01-2025 1:43:01 AM

1 More Replies

by alsetr • Databricks Partner

05-09-2025 4:22:07 AM

2550 Views
4 replies
0 kudos

Executor OOM Error with AQE enabled

We have Databricks Spark Job. After migration from Databricks Runtime 10.4 to 15.4 one of our Spark jobs which uses broadcast hint started to fail with error:```ERROR Executor: Exception in task 2.0 in stage 371.0 (TID 16912)org.apache.spark.memory.S...

Data Engineering

2550 Views
4 replies
0 kudos

05-09-2025 4:22:07 AM

View Replies

Latest Reply

alsetr
Databricks Partner

07-01-2025 4:06:16 AM

0 kudos

I found similar issuehttps://kb.databricks.com/python/job-fails-with-not-enough-memory-to-build-the-hash-map-errorLooks like the reason of error is a bug in new Databricks feature which is called executor-side broadcast (ebj, executor broadcast join)...

0 kudos

07-01-2025 4:06:16 AM

3 More Replies

by wilsmith • New Contributor

06-30-2025 11:36:12 AM

753 Views
1 replies
0 kudos

COPY INTO maintaining row order

I have a CSV file in S3 and loading the rows in the order they appear in the file is necessary for parsing it out later. When using COPY INTO will it maintain that order so the bronze layer is in exactly the same order as the source file?

Data Engineering

753 Views
1 replies
0 kudos

06-30-2025 11:36:12 AM

View Replies

Latest Reply

Isi
Honored Contributor III

06-30-2025 1:52:39 PM

0 kudos

Hey @wilsmith COPY INTO does not guarantee the order of rows because it processes files in parallel using Spark’s distributed architecture. This means that the ingestion engine reads different parts of the file simultaneously, potentially splitting a...

0 kudos

06-30-2025 1:52:39 PM

by briancuster63 • New Contributor II

06-07-2025 3:02:44 PM

3527 Views
4 replies
0 kudos

Asset Bundle .py files being converted to notebooks when deployed to Databricks

Hi everyone, I'm finding a particularly frustrating issue whenever I try to run some python code in an asset bundle on my workspace. The code and notebooks deploy fine but once deployed, the code files get converted to notebooks and I'm no longer abl...

Data Engineering

3527 Views
4 replies
0 kudos

06-07-2025 3:02:44 PM

View Replies

Latest Reply

olivier-soucy
Contributor

06-30-2025 12:11:00 PM

0 kudos

I came here looking for a solution to the opposite problem: I was hoping my .py files to be available as a notebook (without adding extra headers). Unfortunately, this does not seem to be possible with DABs.@facebiranhari if you have not solved your ...

0 kudos

06-30-2025 12:11:00 PM

3 More Replies

Databricks Community

Forum Posts

Notebook execution keeps showing "Fetching result" endlessly

Where is the path settings for New SQL Editory?

Accessing DBT Articafts, runs, tests from Databricks workflow using automated script.

Issue with HTML Table Styling in Databricks Alerts

lab mismatch with the course

Not able to read sample data in databricks in shared cluster but using single cluster im able to

Newest version of dbx-workspace always returns NoneType

Resolved! How to solve Py4JJavaError: An error occurred while calling o5082.csv. : org.apache.spark.SparkException: Job aborted. when writing to csv

Resolved! Azure Databricks Power Platform Connector - Doubts

Free edition and serverless edition are not loading and are really slow

Resolved! Extracting cost by user (run_by) for All-purpose clusters and SQL warehouse usage

Resolved! Unity Catalog blocks DML (UPDATE, DELETE) on static Delta tables — unable to use spark.sql

Executor OOM Error with AQE enabled

COPY INTO maintaining row order

Asset Bundle .py files being converted to notebooks when deployed to Databricks

File Arrival Trigger - Multiple tables

Issue while handling Deletes and Inserts in Struct...

DLT with CDC and schema changes in streaming pipel...

how to update not tracked column only in new row v...

Databricks Cost Estimation Template