Data Engineering

Forum Posts

Sorted by:

by zmsoft • Contributor

04-02-2025 3:48:01 PM

1030 Views
3 replies
0 kudos

How to set DLT pipeline warning alert?

Hi there,The example description of custom event hooks in the documentation is not clear enough, I do not know how to implement it inside python functions. event-hooks My Code: %python # Read the insertion of data raw_user_delta_streaming=spark.rea...

Data Engineering

1030 Views
3 replies
0 kudos

04-02-2025 3:48:01 PM

View Replies

Latest Reply

Priyanka_Biswas
Databricks Employee

04-03-2025 2:33:03 PM

0 kudos

Hi @zmsoft The event hook provided, user_event_hook, must be a Python callable that accepts exactly one parameter - a dictionary representation of the event that triggered the execution of this event hook. The return value of the event hook has no s...

0 kudos

04-03-2025 2:33:03 PM

2 More Replies

by NamNguyenCypher • New Contributor II

04-23-2025 9:51:26 AM

925 Views
2 replies
2 kudos

Resolved! Adding column masks to a column using the DLT Python create_streaming_table API

I'm having difficulty adding a mask function to columns while creating streaming tables using the DLT Python method create_streaming_table() like this but it does not work, the streaming table is created but no column is masked:def prepare_column_pro...

Data Engineering

925 Views
2 replies
2 kudos

04-23-2025 9:51:26 AM

View Replies

Latest Reply

lingareddy_Alva
Honored Contributor III

04-23-2025 10:18:23 AM

2 kudos

@NamNguyenCypher Delta Live Tables’ Python API does not currently honor column-mask metadata embedded in a PySpark StructType. Masking (and row filters) on DLT tables are only applied when you define your table with a DDL-style schema that includes a...

2 kudos

04-23-2025 10:18:23 AM

1 More Replies

by vziog • New Contributor II

04-22-2025 7:51:33 AM

1009 Views
5 replies
1 kudos

Unexpected SKU Names in Usage Table for Job Cost Calculation

I'm trying to calculate the cost of a job using the usage and list_prices system tables, but I'm encountering some unexpected behavior that I can't explain.When I run a job using a shared cluster, the sku_name in the usage table is PREMIUM_JOBS_SERVE...

Data Engineering

1009 Views
5 replies
1 kudos

04-22-2025 7:51:33 AM

View Replies

Latest Reply

vziog
New Contributor II

04-23-2025 3:27:25 AM

1 kudos

Thank you all for your replies. @lingareddy_Alva what about @Walter_C and @mnorland mentioned about enabling serverless tasks. Is this possible and how?

1 kudos

04-23-2025 3:27:25 AM

4 More Replies

by ankit001mittal • New Contributor III

04-22-2025 6:36:39 AM

659 Views
1 replies
0 kudos

DLT Publish event log to metastore

Hi Guys,I am trying to use DLT Publish event log to metastore feature.and I noticed it creates a table with the logs for each DLT pipelines separately. Does it mean it maintains the separate log table for all the DLT tables ( in our case, we have 100...

Data Engineering

dlt

659 Views
1 replies
0 kudos

04-22-2025 6:36:39 AM

View Replies

Latest Reply

SP_6721
Contributor III

04-23-2025 6:11:10 AM

0 kudos

Hi @ankit001mittal Yes, you're right, when you enable the "Publish Event Log to Metastore" option for DLT pipelines, Databricks creates a separate event log table for each pipeline. So, if you have thousands of pipelines, you'll see thousands of log ...

0 kudos

04-23-2025 6:11:10 AM

by holychs • New Contributor III

04-06-2025 11:07:00 AM

584 Views
2 replies
0 kudos

Repairing running workflow with few failed child jobs

I have a parent job that calls multiple child jobs in workflow, Out of 10 child jobs, one has failed and rest 9 are still running, I want to repair the failed child tasks. can I do that while the other child jobs are running?

Data Engineering

584 Views
2 replies
0 kudos

04-06-2025 11:07:00 AM

View Replies

Latest Reply

Brahmareddy
Esteemed Contributor

04-06-2025 7:39:22 PM

0 kudos

Hi holychs,How are you doing today?, As per my understanding, yes, in Databricks Workflows, if you're running a multi-task job (like your parent job triggering multiple child tasks), you can repair only the failed task without restarting the entire j...

0 kudos

04-06-2025 7:39:22 PM

1 More Replies

by vivi007 • New Contributor

04-22-2025 10:59:01 AM

870 Views
1 replies
0 kudos

Can we have a depend-on for jobs to run on two different dabs?

If there are two different DABs. Can we have a dependency for one job from one DAB to run after a job run from another DAB? Similar to how tasks can depend on each other to run one after the other in the same DAB. Can we have the same for two differ...

Data Engineering

DAB

870 Views
1 replies
0 kudos

04-22-2025 10:59:01 AM

View Replies

Latest Reply

lingareddy_Alva
Honored Contributor III

04-22-2025 9:14:17 PM

0 kudos

@vivi007 Yes, you can create dependencies between jobs in different DABs (Databricks Asset Bundles), but this requires a different approach than task dependencies within a single DAB.Since DABs are designed to be independently deployable units, direc...

0 kudos

04-22-2025 9:14:17 PM

by ShreevaniRao • New Contributor III

04-14-2025 1:22:58 PM

6767 Views
13 replies
4 kudos

Newbie learning DLT pipelines

Hello,I am learning to create DLT pipelines using different graphs using a 14 day trial version of the premium Databricks. I have currently one graph Mat view -> Streaming Table -> Mat view. When i ran this pipeline(serverless compute) 1st time, ran...

Data Engineering

6767 Views
13 replies
4 kudos

04-14-2025 1:22:58 PM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

04-15-2025 9:29:26 PM

4 kudos

use this https://www.youtube.com/watch?v=iqf_QHC7tgQ&list=PL2IsFZBGM_IGpBGqxhkiNyEt4AuJXA0Gg it will help you a lot

4 kudos

04-15-2025 9:29:26 PM

12 More Replies

by ktagseth • New Contributor II

04-22-2025 9:41:38 AM

482 Views
3 replies
0 kudos

dbutils.fs.mv inefficient with ADLS

dbutils.fs.mv with ADLS currently copies the file and then deletes the old one. This incurs costs and has a lot of overhead vs using the rename functionality in ADLS which is instant and doesn't incur extra costs involved with writing the 'new' data....

Data Engineering

482 Views
3 replies
0 kudos

04-22-2025 9:41:38 AM

View Replies

Latest Reply

BigRoux
Databricks Employee

04-22-2025 12:16:19 PM

0 kudos

The tool is really meant for dbfs and is only accessible from within Databricks. If I had to guess the idea is that most folks will not be using dbfs for production or sensitive data (for a host of good reasons) and as such there has not been a big ...

0 kudos

04-22-2025 12:16:19 PM

2 More Replies

by fedemgp • New Contributor

03-27-2025 6:49:21 AM

807 Views
1 replies
0 kudos

Configure verbose audit logs through terraform

Hi everyone,I was looking into the databricks_workspace_conf Terraform resource to configure Verbose Audit Logs (and avoid changing it through the UI). However, I attempted to apply this configuration and encountered the following error:Error: cannot...

Data Engineering

807 Views
1 replies
0 kudos

03-27-2025 6:49:21 AM

View Replies

Latest Reply

TheRealOliver
Contributor

04-22-2025 9:54:06 AM

0 kudos

@fedemgp I was able to turn the desired setting on and off with Terraform with this code: GitHub Gist I'm using Databricks Terraform provider version 1.74.0 and my Databricks runs on Google Cloud.

0 kudos

04-22-2025 9:54:06 AM

by gm_co • New Contributor

02-12-2025 10:15:33 AM

1064 Views
1 replies
0 kudos

Bar chart data labels in percent

Hello, I am currently working with bar visualizations in a new workbook editor. When I use labels, I can see the count of rows returned, and hovering over them shows the percentage of the two values returned. How can I make the percentage display on ...

Data Engineering

1064 Views
1 replies
0 kudos

02-12-2025 10:15:33 AM

View Replies

Latest Reply

Advika
Databricks Employee

04-22-2025 8:05:16 AM

0 kudos

Hello @gm_co! Were you able to sort this out?You can display % in two ways: In the General settings, check the box for Normalize values to percentage.As you have enabled Labels, just set the Data labels to {{ @@yPercent }}. This will show the percent...

0 kudos

04-22-2025 8:05:16 AM

by MrJava • New Contributor III

02-07-2023 8:59:29 AM

16040 Views
17 replies
13 kudos

How to know, who started a job run?

Hi there!We have different jobs/workflows configured in our Databricks workspace running on AWS and would like to know who actually started the job run? Are they started by a user or a service principle using curl?Currently one can only see, who is t...

Data Engineering

16040 Views
17 replies
13 kudos

02-07-2023 8:59:29 AM

View Replies

Latest Reply

jeremy98
Honored Contributor

04-22-2025 7:08:03 AM

13 kudos

News on this feature?

13 kudos

04-22-2025 7:08:03 AM

16 More Replies

by VicS • Contributor

04-17-2025 2:01:26 AM

1590 Views
6 replies
3 kudos

Resolved! How can I use Terraform to assign an external location to multiple workspaces?

How can I use Terraform to assign an external location to multiple workspaces?When I create an external location with Terraform, I do not see any option to directly link workspaces. it also only links to the workspace of the databricks profile that I...

Data Engineering

1590 Views
6 replies
3 kudos

04-17-2025 2:01:26 AM

View Replies

Latest Reply

TheRealOliver
Contributor

04-21-2025 6:20:18 PM

3 kudos

@Walter_C I think you need to use databricks_workspace_binding resource for that multi-workspace binding. I was able to achieve it in Terraform. The resource docs seem to agree with result that I have. My Databricks runs on Google Cloud.My Terraform...

3 kudos

04-21-2025 6:20:18 PM

5 More Replies

by oakhill • New Contributor III

01-28-2025 1:31:04 AM

1577 Views
4 replies
0 kudos

Unable to read Delta Table using external tools

I am using the new credential vending API to get tokens and url for my tables in Unity Catalog.I get the token, url and I am able to scan the folder using read_parquet, but NOT any Delta Lake functions. Not TableExists, scan_delta or delta_scan from ...

Data Engineering

1577 Views
4 replies
0 kudos

01-28-2025 1:31:04 AM

View Replies

Latest Reply

matovitch
New Contributor II

04-22-2025 6:42:53 AM

0 kudos

When copying a problematic delta table and reading the copy the issue disappear, it seems to be related to the new delta checkpointPolicy (v2) not supported by the rust implementation of delta but fine with the scala/java one (deltalake vs delta-spar...

0 kudos

04-22-2025 6:42:53 AM

3 More Replies

by carlos_tasayco • Contributor

04-11-2025 8:04:52 AM

542 Views
1 replies
1 kudos

Power bi connection

https://databrickster.medium.com/databricks-will-refresh-your-powerbi-semantic-model-both-dataset-metadata-and-data-4e8279e10b8eAbove is what I am trying to do, I already created the connection apparently all looks good, I added the task to my workfl...

Data Engineering

542 Views
1 replies
1 kudos

04-11-2025 8:04:52 AM

View Replies

Latest Reply

Renu_
Valued Contributor II

04-22-2025 6:30:44 AM

1 kudos

Hi @carlos_tasayco, it looks like the issue is due to missing permissions in Power BI. If the role is set to Viewer, try updating it to Contributor or higher. You may need help from the Power BI admin to adjust the access. This should resolve the err...

1 kudos

04-22-2025 6:30:44 AM

by anand010210 • New Contributor

04-10-2025 10:39:33 AM

588 Views
1 replies
0 kudos

AWS account databricks account / subscriptions disable before 14days trial over

I had registered trial version for 14days thru AWS, its been deactivated now before 14 days over. when i am going thru manage subscription and it is asking to register product and redirecting me to create and link account which was already there and...

Data Engineering

588 Views
1 replies
0 kudos

04-10-2025 10:39:33 AM

View Replies

Latest Reply

Advika
Databricks Employee

04-22-2025 4:36:28 AM

0 kudos

Hello @anand010210! Were you able to resolve the issue? It seems that your AWS account is still linked to an active Databricks subscription, even though it's been disabled. I recommend checking your AWS Marketplace subscription to see if Databricks i...

0 kudos

04-22-2025 4:36:28 AM

Databricks Community

Forum Posts

How to set DLT pipeline warning alert?

Resolved! Adding column masks to a column using the DLT Python create_streaming_table API

Unexpected SKU Names in Usage Table for Job Cost Calculation

DLT Publish event log to metastore

Repairing running workflow with few failed child jobs

Can we have a depend-on for jobs to run on two different dabs?

Newbie learning DLT pipelines

dbutils.fs.mv inefficient with ADLS

Configure verbose audit logs through terraform

Bar chart data labels in percent

How to know, who started a job run?

Resolved! How can I use Terraform to assign an external location to multiple workspaces?

Unable to read Delta Table using external tools

Power bi connection

AWS account databricks account / subscriptions disable before 14days trial over

Join Us as a Local Community Builder!

Issue with Databricks Jobs: SQLSTATE: XXKST

Unable to use Auto Loader for External Location in...

Error [UNAUTHORIZED_ACCESS]: Merging into a table ...

Issue in renaming the table

mongodb spark