Data Engineering

Forum Posts

Sorted by:

by RabahO • New Contributor III

4 hours ago

26 Views
2 replies
0 kudos

Dashboard always display truncated data

Hello, we're working with a serverless SQL cluster to query Delta tables and display some analytics in dashboards. We have some basic group by queries that generate around 36k lines, and they are executed without the "limit" key word. So in the data ...

Data Engineering

26 Views
2 replies
0 kudos

4 hours ago

View Replies

Latest Reply

mhiltner
New Contributor II

54m ago

0 kudos

Hey @RabahO This is likely a memory issue. The current behavior is that Databricks will only attempt to display the first 64000 rows of data. If the first 64000 rows of data are larger than 2187 MB, then it will fail to display anything. In your cas...

0 kudos

54m ago

1 More Replies

by pragarwal • New Contributor II

5 hours ago

28 Views
2 replies
0 kudos

Adding Member to group using account databricks rest api

Hi All,I want to add a member to a group in databricks account level using rest api (https://docs.databricks.com/api/azure/account/accountgroups/patch) as mentioned in this link I could able to authenticate but not able to add member while using belo...

Data Engineering

28 Views
2 replies
0 kudos

5 hours ago

View Replies

Latest Reply

pragarwal
New Contributor II

15m ago

0 kudos

Hi @Kaniz I have tried suggest body also but still member is not added to group. is there any other method that i can use add member to the group at account levelThanks,Phani.

0 kudos

15m ago

1 More Replies

by smedegaard • New Contributor III

2 weeks ago

603 Views
3 replies
0 kudos

DLT run filas with "com.databricks.cdc.spark.DebeziumJDBCMicroBatchProvider not found"

I've created a streaming live table from a foreign catalog. When I run the DLT pipeline it fils with "com.databricks.cdc.spark.DebeziumJDBCMicroBatchProvider not found".I haven't seen any documentation that suggests I need to install Debezium manuall...

Data Engineering

603 Views
3 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

Kaniz
Community Manager

42m ago

0 kudos

Hi @smedegaard, The error message you’re encountering, “com.databricks.cdc.spark.DebeziumJDBCMicroBatchProvider not found,” indicates that the specified class is not available in your classpath. To address this issue, follow these steps: Verif...

0 kudos

42m ago

2 More Replies

by Chengzhu • New Contributor

3 weeks ago

124 Views
1 replies
0 kudos

Databricks Model Registry Notification

Hi community,Currently, I am training models on databricks cluster and use mlflow to log and register models. My goal is to send notification to me when a new version of registered model happens (if the new run achieves some model performance baselin...

Data Engineering

124 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

Kaniz
Community Manager

37m ago

0 kudos

Hi @Chengzhu, It seems like you’re using MLflow’s Model Registry to manage the lifecycle of your machine learning models. Let’s explore this further. The MLflow Model Registry provides a centralized model store, APIs, and a UI to collaboratively m...

0 kudos

37m ago

by EWhitley • New Contributor II

2 weeks ago

254 Views
1 replies
0 kudos

Custom ENUM input as parameter for SQL UDF?

Hello - We're migrating from T-SQL to Spark SQL. We're migrating a significant number of queries."datediff(unit, start,end)" is different between these two implementations (in a good way). For the purpose of migration, we'd like to stay as consiste...

Data Engineering

sql

udf

254 Views
1 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

Kaniz
Community Manager

45m ago

0 kudos

Hi @EWhitley, You’re on the right track with creating a custom UDF in Python for your migration. To achieve similar behaviour to the T-SQL DATEDIFF function with an enum-like unit parameter, you can follow these steps: Create a Custom UDF: Define...

0 kudos

45m ago

by YannLevavasseur • New Contributor

2 weeks ago

341 Views
1 replies
0 kudos

SQL function refactoring into Databricks environment

Hello all,I'm currently working on importing some SQL functions from Informix Database into Databricks using Asset Bundle deploying Delta Live Table to Unity Catalog. I'm struggling importing a recursive one, there is the code :CREATE FUNCTION "info...

Data Engineering

341 Views
1 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

Kaniz
Community Manager

51m ago

0 kudos

Hi @YannLevavasseur, It looks like you’re dealing with a recursive SQL function for calculating the weight of articles in a Databricks environment. Handling recursion in SQL can be tricky, especially when translating existing Informix code to Data...

0 kudos

51m ago

by Sambit_S • New Contributor II

2 weeks ago

270 Views
1 replies
0 kudos

Error during deserializing protobuf data

I am receiving protobuf data in a json attribute and along with it I receive a descriptor file.I am using from_protobuf to deserialize the data as below,It works most of the time but giving error when there are some recursive fields within the protob...

Data Engineering

270 Views
1 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

Kaniz
Community Manager

an hour ago

0 kudos

Hi @Sambit_S, Handling recursive fields in Protobuf can indeed be tricky, especially when deserializing data. Let’s explore some potential solutions to address this issue: Casting Issue with Recursive Fields: The error you’re encountering might b...

0 kudos

an hour ago

by Skr7 • New Contributor II

3 hours ago

33 Views
1 replies
0 kudos

Databricks Asset Bundles

Hi, I'm implementing Databricks Asset bundles, my scripts are in GitHub and my /resource has all the .yml of my Databricks workflow which are pointing to the main branch git_source: git_url: https://github.com/xxxx git_provider: ...

Data Engineering

Databricks

33 Views
1 replies
0 kudos

3 hours ago

View Replies

Latest Reply

Kaniz
Community Manager

an hour ago

0 kudos

Hi @Skr7 , Let’s break down your requirements: Dynamically Changing Git Branch for Databricks Asset Bundles (DABs): When deploying and running your DAB, you want the Databricks workflows to point to your feature branch instead of the main branch....

0 kudos

an hour ago

by jainshasha • New Contributor

Monday

91 Views
5 replies
0 kudos

Job Cluster in Databricks workflow

Hi,I have configured 20 different workflows in Databricks. All of them configured with job cluster with different name. All 20 workfldows scheduled to run at same time. But even configuring different job cluster in all of them they run sequentially w...

Data Engineering

91 Views
5 replies
0 kudos

Monday

View Replies

Latest Reply

Wojciech_BUK
Contributor III

an hour ago

0 kudos

HI @jainshasha i tried to replicate your problem but in my case i was able to run jobs in parallel(the only difference is that i am running notebook from workspace, not from repo)As you can see jobs did not started exactly same time but it run in par...

0 kudos

an hour ago

4 More Replies

by madhumitha • Visitor

yesterday

57 Views
4 replies
0 kudos

Connect power bi desktop semantic model output to databricks

Hello, I am trying to connect the power bi semantic model output (basically the data that has already been pre processed) to databricks. Does anybody know how to do this? I would like it to be an automated process so I would like to know any way to p...

Data Engineering

57 Views
4 replies
0 kudos

yesterday

View Replies

Latest Reply

Kaniz
Community Manager

4 hours ago

0 kudos

Hi @madhumitha, Connecting Power BI semantic model output to Databricks can be done in a few steps. Here are a couple of options: Databricks Power Query Connector: The new Databricks connector is natively integrated into Power BI. You can configu...

0 kudos

4 hours ago

3 More Replies

by dbdude • New Contributor II

08-17-2023 4:01:48 PM

4579 Views
7 replies
0 kudos

AWS Secrets Works In One Cluster But Not Another

Why can I use boto3 to go to secrets manager to retrieve a secret with a personal cluster but I get an error with a shared cluster?NoCredentialsError: Unable to locate credentials

Data Engineering

4579 Views
7 replies
0 kudos

08-17-2023 4:01:48 PM

View Replies

Latest Reply

Kaniz
Community Manager

2 hours ago

0 kudos

Hi @dbdude and @drii_cavalcanti , The NoCredentialsError you’re encountering when using Boto3 to retrieve a secret from AWS Secrets Manager typically indicates that the AWS SDK is unable to find valid credentials for your API request. Let’s explor...

0 kudos

2 hours ago

6 More Replies

by Skr7 • New Contributor II

09-21-2023 7:27:06 AM

1145 Views
2 replies
1 kudos

Resolved! Scheduled job output export

Hi ,I have a Databricks job that results in a dashboard post run , I'm able to download the dashboard as HTML from the view job runs page , but I want to automate the process , so I tried using the Databricks API , but it says {"error_code":"INVALID_...

Data Engineering

data engineering

1145 Views
2 replies
1 kudos

09-21-2023 7:27:06 AM

View Replies

Latest Reply

Kaniz
Community Manager

09-22-2023 12:13:20 AM

1 kudos

Hi @Skr7, You cannot automate exporting the dashboard as HTML using the Databricks API. The Databricks API only supports exporting results for notebook task runs, not for job run dashboards. Here's the relevant excerpt from the provided sources: Exp...

1 kudos

09-22-2023 12:13:20 AM

1 More Replies

by Anske • New Contributor II

2 weeks ago

96 Views
1 replies
0 kudos

DLT apply_changes applies only deletes and inserts not updates

Hi,I have a DLT pipeline that applies changes from a source table (cdctest_cdc_enriched) to a target table (cdctest), by the following code:dlt.apply_changes( target = "cdctest", source = "cdctest_cdc_enriched", keys = ["ID"], sequence_by...

Data Engineering

Delta Live Tables

96 Views
1 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

Kaniz
Community Manager

2 hours ago

0 kudos

Hi @Anske, It seems you’re encountering an issue with your Delta Live Tables (DLT) pipeline where updates from the source table are not being correctly applied to the target table. Let’s troubleshoot this together! Pipeline Update Process: Whe...

0 kudos

2 hours ago

by niruban • New Contributor II

2 weeks ago

80 Views
1 replies
0 kudos

Migrate a notebook that reside in workspace using Databricks Asset Bundle

Hello Community Folks -Did anyone implemented migration of notebooks that is in workspace to production databricks workspace using Databricks Asset Bundle? If so can you please help me with any documentation which I can refer? Thanks!!RegardsNiruban ...

Data Engineering

80 Views
1 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

Kaniz
Community Manager

2 hours ago

0 kudos

Hi @niruban, Migrating notebooks from one Databricks workspace to another using Databricks Asset Bundles is a useful approach. Let me guide you through the process and provide relevant documentation. Databricks Asset Bundles Overview: Databricks ...

0 kudos

2 hours ago

by Oliver_Angelil • Valued Contributor II

2 weeks ago

105 Views
1 replies
0 kudos

Append-only table from non-streaming source in Delta Live Tables

I have a DLT pipeline, where all tables are non-streaming (materialized views), except for the last one, which needs to be append-only, and is therefore defined as a streaming table.The pipeline runs successfully on the first run. However on the seco...

Data Engineering

105 Views
1 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

Kaniz
Community Manager

2 hours ago

0 kudos

Hi @Oliver_Angelil, It appears that you’re encountering an issue with your DLT (Databricks Delta Live Tables) pipeline, specifically related to having an append-only table at the end of the pipeline. Let’s explore some potential solutions: Stream...

0 kudos

2 hours ago

User

Count

1603

736

344

284

247

Databricks

Forum Posts

Dashboard always display truncated data

Adding Member to group using account databricks rest api

DLT run filas with "com.databricks.cdc.spark.DebeziumJDBCMicroBatchProvider not found"

Databricks Model Registry Notification

Custom ENUM input as parameter for SQL UDF?

SQL function refactoring into Databricks environment

Error during deserializing protobuf data

Databricks Asset Bundles

Job Cluster in Databricks workflow

Connect power bi desktop semantic model output to databricks

AWS Secrets Works In One Cluster But Not Another

Resolved! Scheduled job output export

DLT apply_changes applies only deletes and inserts not updates

Migrate a notebook that reside in workspace using Databricks Asset Bundle

Append-only table from non-streaming source in Delta Live Tables

Azure Data Factory and Photon

Scheduled job output export

Upload file from local file system to Unity Catalo...

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT