Data Engineering

Forum Posts

Sorted by:

by drag7ter • Contributor

05-21-2025 3:28:52 AM

2382 Views
1 replies
0 kudos

Delta sharing view and cached data in DSFF

I've created a view with row level access based on CURRENT_RECIPIENT() function in the where clause. And I have 100s of clients as recipients that query this view.The problem is, when I modify this view CREATE OR REPLACE with a new sql code, and reci...

Data Engineering

2382 Views
1 replies
0 kudos

05-21-2025 3:28:52 AM

View Replies

Latest Reply

AbhaySingh
New Contributor

an hour ago

0 kudos

Have you tried something like this already? Force Cache Invalidation (Recommended) -- After CREATE OR REPLACE VIEW, execute: ALTER SHARE <share_name> REMOVE TABLE <schema>.<view_name>; ALTER SHARE <share_name> ADD TABLE <schema>.<view_name>; Thi...

0 kudos

an hour ago

by abhijit007 • New Contributor

06-30-2025 9:15:30 AM

1929 Views
1 replies
0 kudos

Lakebridge code conversion | Permission issue

Hi,I’ve successfully installed the transpile module from Lakebridge and tried the tool to convert Informatica mappings into PySpark code. However, I’m encountering a PermissionError during execution. I’ve provided the relevant environment details and...

Data Engineering

Lakebridge

Warehouse Migration

1929 Views
1 replies
0 kudos

06-30-2025 9:15:30 AM

View Replies

Latest Reply

dkushari
Databricks Employee

2 hours ago

0 kudos

Hi @abhijit007 - I see that this has been resolved in the 0.10.5 release. Can you please retest and confirm?

0 kudos

2 hours ago

by AbhishekNakka • New Contributor II

3 hours ago

15 Views
0 replies
0 kudos

Databricks professional data engineer

Hi, i wanted to know i anyone has given databricks professional data engineering exam recently after oct 2025. I wanted to know if the syllabus has been updated or not ?

Data Engineering

15 Views
0 replies
0 kudos

3 hours ago

by AlexSantiago • New Contributor II

09-10-2022 11:40:01 PM

6786 Views
20 replies
4 kudos

spotify API get token - raw_input was called, but this frontend does not support input requests.

hello everyone, I'm trying use spotify's api to analyse my music data, but i'm receiving a error during authentication, specifically when I try get the token, above my code.Is it a databricks bug?pip install spotipyfrom spotipy.oauth2 import SpotifyO...

Data Engineering

6786 Views
20 replies
4 kudos

09-10-2022 11:40:01 PM

View Replies

Latest Reply

Alyceveum25
Visitor

7 hours ago

4 kudos

Thank you

4 kudos

7 hours ago

19 More Replies

by DatabricksEngi1 • New Contributor III

yesterday

58 Views
2 replies
0 kudos

Problem in VS Code Extention

Until a few days ago, I was working with Databricks Connect using the VS Code extension, and everything worked perfectly.In my .databrickscfg file, I had authentication configured like this: [name]host:token: When I ran my code, everything worked fi...

Data Engineering

58 Views
2 replies
0 kudos

yesterday

View Replies

Latest Reply

dkushari
Databricks Employee

9 hours ago

0 kudos

Hi @DatabricksEngi1 - Please ensure you have a Python Venv set up for each Python version that you use with Databricks Connect. Also, I have given step-by-step ways to debug the issue, clear the cache, etc [Read the files and instructions carefully b...

0 kudos

9 hours ago

1 More Replies

by raghvendrarm1 • New Contributor

Friday

131 Views
2 replies
1 kudos

Results from the spark application to driver

I tried to read many articles but still not clear on this:The executors complete the execution of tasks and have the results with them.1. The results(output data) from all executors is transported to driver in all cases or executors persist it if tha...

Data Engineering

131 Views
2 replies
1 kudos

Friday

View Replies

Latest Reply

K_Anudeep
Databricks Employee

10 hours ago

1 kudos

Hello @raghvendrarm1 , Below are the answers to your questions: Do executors always send “results” to the driver? No. Only actions that return values (e.g., collect, take, first, count) bring data back to the driver. collect explicitly “returns al...

1 kudos

10 hours ago

1 More Replies

by pranaav93 • New Contributor II

yesterday

31 Views
0 replies
0 kudos

TransformWithState is not emitting for live streams

Hi Team, For one of my custom logics i went with transformwithState processor. However it is not working for live stream inputs., I have a start date filter on my df_base and when I give start date that is not current, the processor computes df_loss ...

Data Engineering

apachespark

pyspark

StatefulStreaming

StructuredStreaming

transformWithState

31 Views
0 replies
0 kudos

yesterday

by Saf4Databricks • New Contributor III

Friday

169 Views
3 replies
0 kudos

Cannot import pyspark.pipelines module

Question: What could be a cause of the following error of my code in a Databricks notebook, and how can we fix the error? I'm using latest Free Edition of Databricks that has runtime version 17.2 and PySpark version 4.0.0.Error:ImportError: cannot im...

Data Engineering

169 Views
3 replies
0 kudos

Friday

View Replies

Latest Reply

dkushari
Databricks Employee

yesterday

0 kudos

Hi @Saf4Databricks - Are you trying to use it from a standalone Databricks notebook? You should only use it from with Lakeflow Declarative Pipeline (LDP). The link you shared is about LDP. Here is an example where I used it.

0 kudos

yesterday

2 More Replies

by TalessRocha • New Contributor II

08-08-2025 4:28:54 PM

1308 Views
10 replies
8 kudos

Resolved! Connect to azure data lake storage using databricks free edition

Hello guys, i'm using databricks free edition (serverless) and i am trying to connect to a azure data lake storage.The problem I'm having is that in the free edition we can't configure the cluster so I tried to make the connection via notebook using ...

Data Engineering

1308 Views
10 replies
8 kudos

08-08-2025 4:28:54 PM

View Replies

Latest Reply

BS_THE_ANALYST
Esteemed Contributor II

08-17-2025 7:38:54 AM

8 kudos

@TalessRocha thanks for getting back to us! Glad to hear you got it working, that's awesome. Best of luck with your projects.All the best,BS

8 kudos

08-17-2025 7:38:54 AM

9 More Replies

by rajg • New Contributor

yesterday

69 Views
0 replies
0 kudos

Cannot export embedded dashboard widget as CSV or other formats except PNG

I’ve integrated a Databricks dashboard into my web application for all my users, following the guidelines in this article:Embedding Databricks Dashboards.This integration worked perfectly initially. However, I’m now encountering an issue with exporti...

Data Engineering

69 Views
0 replies
0 kudos

yesterday

by maninegi05 • New Contributor

Friday

161 Views
2 replies
0 kudos

DLT Pipeline Stopped working

Hello, Suddenly our DLT pipelines we're getting failures saying thatLookupError: Traceback (most recent call last): result_df = result_df.withColumn("input_file_path", col("_metadata.file_path")).withColumn( ...

Data Engineering

161 Views
2 replies
0 kudos

Friday

View Replies

Latest Reply

Khaja_Zaffer
Contributor III

Friday

0 kudos

May be there is internally some updates from databricks Can Check and Switch Your Pipeline Channel, In the DLT pipeline settings (under Advanced > Channel), confirm if it's set to "Preview". Switch to "Current" for a more stable engine version, then...

0 kudos

Friday

1 More Replies

by Malthe • Contributor II

Friday

248 Views
4 replies
0 kudos

Resolved! Can't enable "variantType-preview" using DLTs

Using create_streaming_table and passing table properties as follows, I get an error running the pipeline for the first time:> Your table schema requires manually enablement of the following table feature(s): variantType-preview.I'm using this code:c...

Data Engineering

248 Views
4 replies
0 kudos

Friday

View Replies

Latest Reply

Malthe
Contributor II

Friday

0 kudos

There's a workaround available in most situations which is to first create the table without the VARIANT column, run the pipeline at least once, and then add the column in a subsequent refresh.

0 kudos

Friday

3 More Replies

by Upendra_Dwivedi • Contributor

05-22-2025 4:02:39 AM

2487 Views
1 replies
1 kudos

Databricks APP OBO User Authorization

Hi All,We are using on-behalf of user authorization method for our app and the x-forwarded-access-token is expiring after sometime and we have to redeploy our app to rectify the issue. I am not sure what is the issue or how we can keep the token aliv...

Data Engineering

2487 Views
1 replies
1 kudos

05-22-2025 4:02:39 AM

View Replies

Latest Reply

jamesl
Databricks Employee

Friday

1 kudos

Hi @Upendra_Dwivedi , are you still facing this issue? The x-forwarded-access-token your app receives is the current user’s access token that Databricks forwards in HTTP headers for on‑behalf‑of‑user access. You should read it from the request on eac...

1 kudos

Friday

by Mous92i • New Contributor

Wednesday

207 Views
3 replies
2 kudos

Resolved! Liquid Clustering With Merge

Hello I’m facing severe performance issues with a merge into databricksmerge_condition = """ source.data_hierarchy = target.data_hierarchy AND source.sensor_id = target.sensor_id AND source.timestamp = target.timestamp """The target Delt...

Data Engineering

207 Views
3 replies
2 kudos

Wednesday

View Replies

Latest Reply

Mous92i
New Contributor

Friday

2 kudos

Thanks for your response

2 kudos

Friday

2 More Replies

by databricksero • New Contributor

Wednesday

378 Views
8 replies
3 kudos

DLT pipeline fails with “can not infer schema from empty dataset” — works fine when run manually

Hi everyone,I’m running into an issue with a Delta Live Tables (DLT) pipeline that processes a few transformation layers (raw → intermediate → primary → feature).When I trigger the entire pipeline, it fails with the following error:can not infer sche...

Data Engineering

378 Views
8 replies
3 kudos

Wednesday

View Replies

Latest Reply

ManojkMohan
Honored Contributor

Thursday

3 kudos

@databricksero Explicit Schema Definition: When calling spark.createDataFrame(pdf_cleaned), explicitly provide the schema even if the DataFrame is empty. This helps Spark infer the types and prevents the “cannot infer schema from empty dataset” erro...

3 kudos

Thursday

7 More Replies

Databricks Community

Forum Posts

Delta sharing view and cached data in DSFF

Lakebridge code conversion | Permission issue

Databricks professional data engineer

spotify API get token - raw_input was called, but this frontend does not support input requests.

Problem in VS Code Extention

Results from the spark application to driver

TransformWithState is not emitting for live streams

Cannot import pyspark.pipelines module

Resolved! Connect to azure data lake storage using databricks free edition

Cannot export embedded dashboard widget as CSV or other formats except PNG

DLT Pipeline Stopped working

Resolved! Can't enable "variantType-preview" using DLTs

Databricks APP OBO User Authorization

Resolved! Liquid Clustering With Merge

DLT pipeline fails with “can not infer schema from empty dataset” — works fine when run manually

Join Us as a Local Community Builder!

DAB + DLT destroy fails due to ownership/permissio...

Can't enable "variantType-preview" using DLTs

Liquid Clustering With Merge

deadlock occurs with use statement

is there another way to authen to azure databricks...