cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

Richard3
by New Contributor II
  • 317 Views
  • 6 replies
  • 5 kudos

IDENTIFIER in SQL Views not supported?

Dear community,We are phasing out the dollar param `${catalog_name}` because it has been deprecated since runtime 15.2.We use this parameter in many queries and should now be replaced by the IDENTIFIER clause.In the query below where we retrieve data...

Richard3_0-1765199283388.png Richard3_1-1765199860462.png
  • 317 Views
  • 6 replies
  • 5 kudos
Latest Reply
Hubert-Dudek
Esteemed Contributor III
  • 5 kudos

I have good news: in runtime 18, IDENTIFIER and parameter markers are supported everywhere! We need to wait a month or two as the SQL warehouse and serverless are still on runtime 17.

  • 5 kudos
5 More Replies
RobFer1985
by New Contributor
  • 139 Views
  • 2 replies
  • 0 kudos

Databricks pipeline fails expectation on execute python script, throws error: Update FAILES

Hi Community,I'm new to Databricks and am trying to make and implement pipeline expectations, The pipelines work without errors and my job works. I've tried multiple ways to implement expectations, sql and python. I keep resolving the errors but end ...

  • 139 Views
  • 2 replies
  • 0 kudos
Latest Reply
carlo968rojer
  • 0 kudos

Hello, @RobFer1985 The primary cause of your error is a circular reference in your logic: you are defining a table named orders_2 while simultaneously trying to readStream from that same table. In Delta Live Tables (DLT), the function acts as the "wr...

  • 0 kudos
1 More Replies
lindsey
by New Contributor II
  • 2660 Views
  • 1 replies
  • 1 kudos

"Error: cannot read mws credentials: invalid Databricks Account configuration" on TF Destroy

I have a terraform project that creates a workspace in Databricks, assigns it to an existing metastore, then creates external location/storage credential/catalog. The apply works and all expected resources are created. However, without touching any r...

  • 2660 Views
  • 1 replies
  • 1 kudos
Latest Reply
eduardo_287
  • 1 kudos

I have the same problem, were you able to solve it?

  • 1 kudos
alesventus
by Contributor
  • 75 Views
  • 4 replies
  • 0 kudos

Power BI refresh job task

I have tried Databricks job task to refresh power bi dataset and I have found 2 issues.1. I set up tables in Power BI Desktop using Import mode. After deploying the model to Power BI Service, I was able to download it as an Import mode model. However...

alesventus_0-1765874332890.png alesventus_1-1765874393964.png alesventus_3-1765874486812.png
  • 75 Views
  • 4 replies
  • 0 kudos
Latest Reply
emma_s
Databricks Employee
  • 0 kudos

Can you send a screenshot of the refresh power BI task in the jobs UI within Databricks please?  

  • 0 kudos
3 More Replies
ndw
by New Contributor II
  • 108 Views
  • 5 replies
  • 0 kudos

Extract Snowflake data based on environment

Hi all, In the development workspace, I need to extract data from a table/view in Snowflake development environment. Example table is called as VD_DWH.SALES.SALES_DETAILWhen we deploy the code into production, it needs to extract data from a table/vi...

  • 108 Views
  • 5 replies
  • 0 kudos
Latest Reply
nayan_wylde
Esteemed Contributor
  • 0 kudos

Create a single job that runs your migration notebook.In the job settings, under Parameters, add a key like env with a default value (e.g., dev).When you create different job runs (or schedule them), override the parameter:For development runs, set e...

  • 0 kudos
4 More Replies
angel_ba
by New Contributor II
  • 2222 Views
  • 3 replies
  • 0 kudos

unity catalog system.access.audit lag

Hello,We have unity catalog enabled workspace. To get the completion time of a pipeline that runs multiple times a day, I am  checking system.access.audit table. Comparing the completion time of the pipeline compared to other pipeline time I am creat...

  • 2222 Views
  • 3 replies
  • 0 kudos
Latest Reply
Raman_Unifeye
Contributor III
  • 0 kudos

@angel_ba - This is expected/designed behaviour.Audit logs are ingested into the system tables asynchronously. Databricks batches these events befor surfacing them in UC system tables. Alternate (prhaps) the best way is to use Job API for start/compl...

  • 0 kudos
2 More Replies
Gaganmjain_012
by New Contributor
  • 42 Views
  • 1 replies
  • 0 kudos

AI/BI Genie

I was working with genie and started using Research agent, and now I want to make the genie as a sharable Infrastructure as Code where I can manage all the changes through GitHub and so does anyone have any suggestions how to do this in a best optimi...

  • 42 Views
  • 1 replies
  • 0 kudos
Latest Reply
saurabh18cs
Honored Contributor II
  • 0 kudos

Hi @Gaganmjain_012 Hi once in github then if  you want to deploy via asset bundles then https://github.com/databricks/cli/issues/3008 looks like an open request/

  • 0 kudos
hidden
by New Contributor II
  • 37 Views
  • 1 replies
  • 0 kudos

integrating linear app with databricks

 I want to integrate linear app with databricks .. the moment any job fails i want to create a linear task with the error in the description . can you guide me to implement this 

  • 37 Views
  • 1 replies
  • 0 kudos
Latest Reply
ManojkMohan
Honored Contributor II
  • 0 kudos

@hidden Configure a Generic Webhook notification destination in Databricks that points to your own small service (AWS Lambda)In that service, parse the payload, detect job failure events, extract the relevant error/stack trace, and call the Linear Gr...

  • 0 kudos
rc10000
by New Contributor
  • 96 Views
  • 1 replies
  • 2 kudos

Resolved! Databricks Data Engineer Associate Exam Dec 2025

Hi, I am prepping for the Databricks DEA exam. I am seeing some conflicts with DLT/DP or declarative pipeline syntax. I am trying to see if the most up to date syntax is going to be on the exam as opposed to the legacy DLT. For example, current DP sy...

  • 96 Views
  • 1 replies
  • 2 kudos
Latest Reply
kiwi286dew
New Contributor
  • 2 kudos

Hello, @rc10000 You are correct to prioritize the current syntax. The most up-to-date Databricks Certified Data Engineer Associate exam will focus on the declarative syntax within Lakeflow Spark Declarative Pipelines. You should prepare for CREATE OR...

  • 2 kudos
Shimon
by New Contributor
  • 123 Views
  • 2 replies
  • 0 kudos

Jackson version conflict

Hi,I am trying to implement the Spark TableProvider api and i am experiencing a jar conflict (I am using the 17.3 runtime). com.fasterxml.jackson.databind.JsonMappingException: Scala module 2.15.2 requires Jackson Databind version >= 2.15.0 and < 2.1...

  • 123 Views
  • 2 replies
  • 0 kudos
Latest Reply
Shimon
New Contributor
  • 0 kudos

For now we are trying to contact Databricks, In worst case scenario we were planning to shade the dependencies we need.would love to hear what has worked for you.Best,Shimon

  • 0 kudos
1 More Replies
dgahram
by New Contributor
  • 76 Views
  • 1 replies
  • 0 kudos

DLT File Level Deduplication

I want to create a DLT pipeline that incrementally processes csv files arriving daily. However, some of those files are duplicate - they have the same names and data but are in different directories. What is the best way to handle this? I'm assuming ...

  • 76 Views
  • 1 replies
  • 0 kudos
Latest Reply
K_Anudeep
Databricks Employee
  • 0 kudos

Hello @dgahram ,   Auto Loader tracks ingestion progress by persisting discovered file metadata in a RocksDB store within the checkpoint, which provides “exactly-once” processing for discovered files.Doc: https://docs.databricks.com/aws/en/ingestion...

  • 0 kudos
j_unspeakable
by New Contributor III
  • 2150 Views
  • 4 replies
  • 5 kudos

Resolved! Permission Denied when Creating External Tables Using Workspace Default Credential

I’m building out schemas, volumes, and external Delta tables in Unity Catalog via Terraform. The schemas and volumes are created successfully, but all external tables are failing.The error message from Terraform doesn't highlight what the issue is bu...

image.png image.png Screenshot 2025-06-15 152848.png
  • 2150 Views
  • 4 replies
  • 5 kudos
Latest Reply
artopihlaja
New Contributor II
  • 5 kudos

Feature or bug, I discovered the same. I couldn't create tables with the default credential. To test, I assigned the default credential and a custom credential the same access rights to the storage container that is the target of the external locatio...

  • 5 kudos
3 More Replies
Galih
by New Contributor
  • 146 Views
  • 3 replies
  • 3 kudos

Spark structured streaming- calculate signal, help required! 🙏

Hello everyone!I’m very very new to Spark Structured Streaming, and not a data engineer I would appreciate guidance on how to efficiently process streaming data and emit only changed aggregate results over multiple time windows.Input Stream:Source: A...

  • 146 Views
  • 3 replies
  • 3 kudos
Latest Reply
Hubert-Dudek
Esteemed Contributor III
  • 3 kudos

I would implement stateful streaming by using transformWithStateInPandas to keep the state and implement the logic there. I would avoid doing stream-stream JOINs.

  • 3 kudos
2 More Replies

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels