cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

SparkMan
by New Contributor
  • 0 Views
  • 0 replies
  • 0 kudos

Job Cluster Reuse

Hi, I have a job where a job cluster is reused twice for task A and task C. Between A and C, task B runs for 4 hours on a different interactive cluster. The issue here is that the job cluster doesn't terminate as soon as Task A is completed and sits ...

  • 0 Views
  • 0 replies
  • 0 kudos
nkrish
by New Contributor II
  • 5 Views
  • 1 replies
  • 0 kudos

Regarding Accelerators

Are there any databricks accelerators to convert the c# and qlikview code to pyspark ? We are using the Open source AI tools to convert now but wondering is there any better way to do the same?Thanks in advance 

  • 5 Views
  • 1 replies
  • 0 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 0 kudos

Hi @nkrish ,Unfortunately, I don't think so. Available accelerators you can find here:Databricks Solution Accelerators for Data & AI | DatabricksBut I haven't heard anything about accelerator for c# and qlikview specifically.

  • 0 kudos
GergoBo
by Visitor
  • 19 Views
  • 1 replies
  • 0 kudos

How to Play or Stream MP4 Videos from Unity Catalog Volumes in Databricks (Flask/Dash)?

Hello Databricks Community,I am working on a Dash dashboard (Python/Flask backend) deployed on Databricks, and I need to play or stream MP4 video files stored in a Unity Catalog Volume. I have tried accessing these files both from a Databricks notebo...

  • 19 Views
  • 1 replies
  • 0 kudos
Latest Reply
Raman_Unifeye
Contributor III
  • 0 kudos

@GergoBo - Since notebooks cannot reach out to the file system to stream, you must embed the video as a Base64 encoded string. I tried below code and it works well in Notebook as it plays the video in the output. import base64from IPython.display imp...

  • 0 kudos
Malthe
by Contributor III
  • 26 Views
  • 1 replies
  • 0 kudos

Intermittent task execution issues

We're getting intermittent errors:[ISOLATION_STARTUP_FAILURE.SANDBOX_STARTUP] Failed to start isolated execution environment. Sandbox startup failed. Exception class: INTERNAL. Exception message: INTERNAL: LaunchSandboxRequest create failed - Error e...

  • 26 Views
  • 1 replies
  • 0 kudos
Latest Reply
sandy_123
Visitor
  • 0 kudos

Hi @Malthe ,This might be because of New DBR (18.0) GA release yesterday(January 2026 - Azure Databricks | Microsoft Learn). you might need to use a custom spark version by the time engineering team fixes this issue in DBR. Below is the response from...

  • 0 kudos
shadman140
by Visitor
  • 39 Views
  • 1 replies
  • 0 kudos

Unable to update view comment via SQL/API unless owner, but column comments work (Unity Catalog)

I’m working with Databricks Unity Catalog and observing an inconsistent permission behavior for views.ScenarioA view exists that was created by another userI have sufficient privileges on the catalog/schema/view (SELECT, MODIFY, ALL PRIVILEGES)I can:...

  • 39 Views
  • 1 replies
  • 0 kudos
Latest Reply
stbjelcevic
Databricks Employee
  • 0 kudos

Interesting, for UC, COMMENT ON COLUMN requires MODIFY on a table, and OWNER on a view. If multiple people need to maintain a view, the recommended pattern is to make a group the owner and grant that group the required access to source data. Why is i...

  • 0 kudos
deepu1
by New Contributor
  • 72 Views
  • 1 replies
  • 0 kudos

DLT Gold aggregation with apply_change

I am building a Gold table using Delta Live Tables (DLT). The Gold table contains aggregated data derived from a Silver table. Aggregation happens monthly. However, the requirement is Only the current (year, month) should be recalculated. Previous mo...

  • 72 Views
  • 1 replies
  • 0 kudos
Latest Reply
aleksandra_ch
Databricks Employee
  • 0 kudos

Hi @deepu1 , Assuming that @dlt.table refers to a Materialized View (MV), you are correct that this is the standard way to create aggregated tables in the Gold layer. A Materialized View is essentially a table that stores the results of a specific qu...

  • 0 kudos
PabloCSD
by Valued Contributor II
  • 159 Views
  • 5 replies
  • 3 kudos

How to use/install a driver in Spark Declarative Pipelines (ETL)?

Salutations,I'm using SDP for an ETL that extracts data from HANA and put it in the Unity Catalog. I defined a Policy with the needed driver:But I get this error:An error occurred while calling o1013.load. : java.lang.ClassNotFoundException: com.sap....

PabloCSD_0-1768228884826.png
  • 159 Views
  • 5 replies
  • 3 kudos
Latest Reply
anshu_roy
Databricks Employee
  • 3 kudos

At this time, Databricks does not offer native connectors for SAP HANA. You can find the complete list of managed connectors currently available in Databricks here. We generally recommend beginning with SAP’s own commercial tools, prioritizing SAP Bu...

  • 3 kudos
4 More Replies
Digvijay_11
by New Contributor
  • 49 Views
  • 1 replies
  • 1 kudos

Lakeflow Spark Declarative Pipeline

How we can run a SDP pipeline in parallel manner with dynamic parameter parsing on pipeline level. How we can consume job level parameter in Pipeline. If similar name parameters are defined in pipeline level then job level parameters are getting over...

  • 49 Views
  • 1 replies
  • 1 kudos
Latest Reply
osingh
Contributor
  • 1 kudos

To run an SDP (Spark Declarative Pipeline) in parallel with dynamic parameters, you need to understand that SDP is "smart"—it builds a dependency graph and runs everything it can at the same time by default.Here is a simple breakdown of how to handle...

  • 1 kudos
r0nald
by New Contributor II
  • 10712 Views
  • 5 replies
  • 1 kudos

UDF not working inside transform() & lambda (SQL)

Below is toy example of what I'm trying to achieve, but don't understand why it fails. Can anyone explain why, and suggest a fix or not overly bloated workaround?%sqlcreate or replace function status_map(status int)returns stringreturn map(10, "STATU...

  • 10712 Views
  • 5 replies
  • 1 kudos
Latest Reply
marcogrcr
New Contributor
  • 1 kudos

Scoped variables in a transform() are not accessible by UDFs. However, you can workaround this using explode():# equivalent of: select transform(arr, e -> status_map(e.v1)) from s1 select collect_list(status_map(status_id)) from explode((select trans...

  • 1 kudos
4 More Replies
dpc
by Contributor
  • 41 Views
  • 1 replies
  • 0 kudos

Case insensitive data

For all it's positives, one of the first general issues we had with databricks was case sensitivity.We have a lot of data specific filters in our codeProblem is, we land and view data from lots of different case insensitive source systems e.g. SQL Se...

  • 41 Views
  • 1 replies
  • 0 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 0 kudos

Hi @dpc ,I think you can try to use a collation for that purpose. A collation is a set of rules that determines how string comparisons are performed. Collations are used to compare strings in a case-insensitive, accent-insensitive, or trailing space ...

  • 0 kudos
seefoods
by Valued Contributor
  • 76 Views
  • 3 replies
  • 3 kudos

Resolved! write both logging error Pyspark and Python exceptions

Hello guyz, Happy new year and best wishes for all of us. I am catching both Pyspark and Python exceptions but i want to write this logging error inside a delta table when i logging. Someone knows the best practise for this ? Thanks Cordially, 

  • 76 Views
  • 3 replies
  • 3 kudos
Latest Reply
seefoods
Valued Contributor
  • 3 kudos

Thanks a lot @szymon_dybczak 

  • 3 kudos
2 More Replies
DataGuy2
by Visitor
  • 40 Views
  • 0 replies
  • 0 kudos

Databricks notebook Issue

Hello Databricks Community,I’m facing multiple issues while working in Azure Databricks notebooks, and I’d appreciate guidance or troubleshooting suggestions.Issue 1: Failed to reconnectWhile running a notebook, I frequently see a “Failed to reconnec...

  • 40 Views
  • 0 replies
  • 0 kudos
Labels