Data Engineering

Forum Posts

Sorted by:

by s_agarwal • New Contributor

Tuesday

121 Views
1 replies
0 kudos

Queries from Serverless compute referring to older/deleted/vacuumed version of the delta tables.

Hi Team,I have a unity catalog based managed delta table which I am able to successfully query using the regular compute/cluster options.But when I try to query the same table using a Serverless/SQL Warehouse, they are referring to an older version /...

Data Engineering

121 Views
1 replies
0 kudos

Tuesday

View Replies

Latest Reply

Saritha_S
Databricks Employee

9 hours ago

0 kudos

Hi @s_agarwal Please find below my findinsg for your query. Serverless uses cached Unity Catalog metadata Your UC metadata points to an old Delta version Regular clusters bypass this cache Fix by refreshing or forcing UC metadata rewrite

0 kudos

9 hours ago

by seefoods • Valued Contributor

Wednesday

122 Views
1 replies
0 kudos

spark conf for serveless jobs

Hello Guys, I use serveless on databricks Azure, so i have build a decorator which instanciate a SparkSession. My job use autolaoder / kafka using mode availableNow. Someone Knows which spark conf is required beacause i want to add it ? Thanx import...

Data Engineering

122 Views
1 replies
0 kudos

Wednesday

View Replies

Latest Reply

Saritha_S
Databricks Employee

9 hours ago

0 kudos

Hi @seefoods Please find below my findings for your case. You don’t need (and can’t meaningfully add) any Spark conf to enable availableNow on Databricks Serverless. Let me explain clearly, and then show what is safe to do in your decorator. availa...

0 kudos

9 hours ago

by Joost1024 • New Contributor

Wednesday

261 Views
5 replies
0 kudos

Read Array of Arrays of Objects JSON file using Spark

Hi Databricks Community! This is my first post in this forum, so I hope you can forgive me if it's not according to the forum best practices After lots of searching, I decided to share the peculiar issue I'm running into in this community.I try to lo...

Data Engineering

261 Views
5 replies
0 kudos

Wednesday

View Replies

Latest Reply

Joost1024
New Contributor

Wednesday

0 kudos

I guess I was a bit over enthusiastic by accepting the answer.When I run the following on the single object array of arrays (as shown in the original post) I get a single row with column "value" and value null. from pyspark.sql import functions as F,...

0 kudos

Wednesday

4 More Replies

by ScottH • New Contributor III

yesterday

98 Views
0 replies
0 kudos

Can I create a serverless budget policy via Python SDK on Azure Databricks?

Hi, I am trying to use the Databricks Python SDK (v0.74.0) to automate the creation of budget policies in our Databricks account. See the Python code below where I am trying to create a serverless budget policy. Note the error.When I click the "Diagn...

Data Engineering

98 Views
0 replies
0 kudos

yesterday

by Maxrb • New Contributor

Thursday

196 Views
7 replies
2 kudos

pkgutils walk_packages stopped working in DBR 17.2

Hi,After moving from Databricks runtime 17.1 to 17.2 suddenly my pkgutils walk_packages doesn't identify any packages within my repository anymore.This is my example code:import pkgutil import os packages = pkgutil.walk_packages([os.getcwd()]) print...

Data Engineering

196 Views
7 replies
2 kudos

Thursday

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

yesterday

2 kudos

Hey @Maxrb , Just thinking out loud here, but this might be worth experimenting with. You could try using a Unity Catalog Volume as a lightweight package repository. Volumes can act as a secure, governed home for Python wheels (and JARs), and Databri...

2 kudos

yesterday

6 More Replies

by jpassaro • New Contributor

Thursday

118 Views
1 replies
0 kudos

does databricks respect parallel vacuum setting?

I am trying to run VACUUM on a delta table that i know has millions of obselete files.out of the box, VACUUM runs the deletes in sequence on the driver. that is bad news for me!According to OSS delta docs, the setting spark.databricks.delta.vacuum.pa...

Data Engineering

118 Views
1 replies
0 kudos

Thursday

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

yesterday

0 kudos

Greetings @jpassaro , Thanks for laying out the context and the links. Let me clarify what’s actually happening here and how I’d recommend moving forward. Short answer No. On Databricks Runtime, the spark.databricks.delta.vacuum.parallelDelete.enabl...

0 kudos

yesterday

by ismaelhenzel • Contributor III

yesterday

47 Views
0 replies
0 kudos

Declarative Pipelines - Dynamic Overwrite

Regarding the limitations of declarative pipelines—specifically the inability to use replaceWhere—I discovered through testing that materialized views actually support dynamic overwrites. This handles several scenarios where replaceWhere would typica...

Data Engineering

47 Views
0 replies
0 kudos

yesterday

by oye • New Contributor II

Thursday

120 Views
3 replies
0 kudos

Unavailable GPU compute

Hello,I would like to create a ML compute with GPU. I am on GCP europe-west1 and the only available options for me are the G2 family and one instance of the A3 family (a3-highgpu-8g [H100]). I have been trying multiple times at different times but I ...

Data Engineering

120 Views
3 replies
0 kudos

Thursday

View Replies

Latest Reply

SP_6721
Honored Contributor II

Thursday

0 kudos

Hi @oye ,You’re hitting a cloud capacity issue, not a Databricks configuration problem. The Databricks GCP GPU docs list A2 and G2 as the supported GPU instance families. A3/H100 is not in the supported list: https://docs.databricks.com/gcp/en/comput...

0 kudos

Thursday

2 More Replies

by Sunil_Patidar • New Contributor

Wednesday

110 Views
1 replies
1 kudos

Unable to read from or write to Snowflake Open Catalog via Databricks

I have Snowflake Iceberg tables whose metadata is stored in Snowflake Open Catalog. I am trying to read these tables from the Open Catalog and write back to the Open Catalog using Databricks.I have explored the available documentation but haven’t bee...

Data Engineering

110 Views
1 replies
1 kudos

Wednesday

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

Thursday

1 kudos

Greetings @Sunil_Patidar , Databricks and Snowflake can interoperate cleanly around Iceberg today — but how you do it matters. At a high level, interoperability works because both platforms meet at Apache Iceberg and the Iceberg REST Catalog API. Wh...

1 kudos

Thursday

by 969091 • New Contributor II

03-29-2023 2:53:41 AM

37648 Views
11 replies
10 kudos

Send custom emails from databricks notebook without using third party SMTP server. Would like to utilize databricks existing smtp or databricks api.

We want to use existing databricks smtp server or if databricks api can used to send custom emails. Databricks Workflows sends email notifications on success, failure, etc. of jobs but cannot send custom emails. So we want to send custom emails to di...

Data Engineering

37648 Views
11 replies
10 kudos

03-29-2023 2:53:41 AM

View Replies

Latest Reply

Shivaprasad
Contributor

Thursday

10 kudos

Did you able to get the custom email working from databricks notebook. I was trying but was not successful. let me know

10 kudos

Thursday

10 More Replies

by alesventus • Contributor

Tuesday

207 Views
5 replies
1 kudos

Resolved! Power BI refresh job task

I have tried Databricks job task to refresh power bi dataset and I have found 2 issues.1. I set up tables in Power BI Desktop using Import mode. After deploying the model to Power BI Service, I was able to download it as an Import mode model. However...

Data Engineering

207 Views
5 replies
1 kudos

Tuesday

View Replies

Latest Reply

emma_s
Databricks Employee

Tuesday

1 kudos

Can you send a screenshot of the refresh power BI task in the jobs UI within Databricks please?

1 kudos

Tuesday

4 More Replies

by timstrath • New Contributor

Thursday

94 Views
1 replies
1 kudos

Failed to create ingestion gateway due to no 'serverless compute'

Failed to create ingestion gatewayPipelines targeting catalogs using Default Storage must use serverless compute. If you don't have access to serverless compute, please contact Databricks to enable this feature for your workspace.

Data Engineering

94 Views
1 replies
1 kudos

Thursday

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

Thursday

1 kudos

Hi @timstrath ,It seems that your catalog is backed up by default storage. In that case this error is pretty explicit. You need to use serverless compute to create lakeflow ingestion pipeline if you have catalog using default storage (IBTW think you...

1 kudos

Thursday

by seefoods • Valued Contributor

2 weeks ago

227 Views
2 replies
2 kudos

Resolved! setup databricks connect on VsCode and PyCharm

Hello Guyz,Someone Know what's is the best pratices to setup databricks connect for Pycharm and VsCode using Docker, Justfile and .env file Cordially, Seefoods

Data Engineering

227 Views
2 replies
2 kudos

2 weeks ago

View Replies

Latest Reply

Gecofer
Contributor II

2 weeks ago

2 kudos

Hi @seefoods!I’ve worked with Databricks Connect and VSCode in different projects, and although your question mentions Docker, Justfile and .env, the “best practices” really depend on what you’re trying to do. Here’s what has worked best for me:1.- D...

2 kudos

2 weeks ago

1 More Replies

by rc10000 • New Contributor

Wednesday

143 Views
2 replies
3 kudos

Resolved! Data Bricks Engineer - DEA Exam vs Training

Hi, I love the Databricks resources but I'm a little confused on what training to take. My focus is studying and practicing for the Databricks Engineer Associate exam, but when I hear of the 'training', I'm not sure which training people are referrin...

Data Engineering

143 Views
2 replies
3 kudos

Wednesday

View Replies

Latest Reply

Advika
Community Manager

Wednesday

3 kudos

Hello @rc10000!+1 to what @Louis_Frolio mentioned above.The Learning Plan is designed for users preparing for the Databricks Certified Data Engineer Associate and Professional exams. Also below are a few paths, depending on what you’re looking for: ...

3 kudos

Wednesday

1 More Replies

by rc10000 • New Contributor

Tuesday

121 Views
1 replies
1 kudos

Resolved! Lakeflow Connect - Databricks Data Engineer Associate Exam Post-July 2025

Hi, I'm asking another Databricks Data Engineer Associate Exam Dec 2025 question. For those who have taken the DEA exam, is Lakeflow Connect a relevant topic for the test? Been a little confused on what resource to rely on besides the official study ...

Data Engineering

121 Views
1 replies
1 kudos

Tuesday

View Replies

Latest Reply

SP_6721
Honored Contributor II

Wednesday

1 kudos

Hi @rc10000,Lakeflow Connect is mentioned in the exam guide under training, but it’s more about the ingestion concepts. These topics come under the Development & Ingestion section. I’d suggest following the official exam guide first and Databricks Ac...

1 kudos

Wednesday

Databricks Community

Forum Posts

Queries from Serverless compute referring to older/deleted/vacuumed version of the delta tables.

spark conf for serveless jobs

Read Array of Arrays of Objects JSON file using Spark

Can I create a serverless budget policy via Python SDK on Azure Databricks?

pkgutils walk_packages stopped working in DBR 17.2

does databricks respect parallel vacuum setting?

Declarative Pipelines - Dynamic Overwrite

Unavailable GPU compute

Unable to read from or write to Snowflake Open Catalog via Databricks

Send custom emails from databricks notebook without using third party SMTP server. Would like to utilize databricks existing smtp or databricks api.

Resolved! Power BI refresh job task

Failed to create ingestion gateway due to no 'serverless compute'

Resolved! setup databricks connect on VsCode and PyCharm

Resolved! Data Bricks Engineer - DEA Exam vs Training

Resolved! Lakeflow Connect - Databricks Data Engineer Associate Exam Post-July 2025

Join Us as a Local Community Builder!

Power BI refresh job task

setup databricks connect on VsCode and PyCharm

Data Bricks Engineer - DEA Exam vs Training

Lakeflow Connect - Databricks Data Engineer Associ...

AWS_INSUFFICIENT_INSTANCE_CAPACITY_FAILURE when st...