Data Engineering

Forum Posts

Sorted by:

by jfvizoso • New Contributor

09-28-2022 3:20:02 AM

4243 Views
4 replies
0 kudos

Can I pass parameters to a Delta Live Table pipeline at running time?

I need to execute a DLT pipeline from a Job, and I would like to know if there is any way of passing a parameter. I know you can have settings in the pipeline that you use in the DLT notebook, but it seems you can only assign values to them when crea...

Data Engineering

4243 Views
4 replies
0 kudos

09-28-2022 3:20:02 AM

View Replies

Latest Reply

Mustafa_Kamal
New Contributor II

yesterday

0 kudos

Hi @jfvizoso ,I also have the same scenario, did you find any work around.Thanks in advance.

0 kudos

yesterday

3 More Replies

by smedegaard • New Contributor II

yesterday

274 Views
0 replies
0 kudos

DLT run filas with "com.databricks.cdc.spark.DebeziumJDBCMicroBatchProvider not found"

I've created a streaming live table from a foreign catalog. When I run the DLT pipeline it fils with "com.databricks.cdc.spark.DebeziumJDBCMicroBatchProvider not found".I haven't seen any documentation that suggests I need to install Debezium manuall...

Data Engineering

274 Views
0 replies
0 kudos

yesterday

by LorenRD • Contributor

11-18-2021 8:15:32 AM

5188 Views
8 replies
9 kudos

Resolved! Is it posible to share a Dashboard with an user inside your org that doesn't have a Databricks account?

Data Engineering

5188 Views
8 replies
9 kudos

11-18-2021 8:15:32 AM

View Replies

Latest Reply

miranda_luna_db
Contributor II

yesterday

9 kudos

Hi friends - To confirm, with new lakeview dashboards you can share dashboards to users and groups in your organization without having to provide any workspace and/or compute access. https://docs.databricks.com/en/dashboards/index.html#what-is-shar...

9 kudos

yesterday

7 More Replies

by amar1995 • Visitor

yesterday

108 Views
1 replies
0 kudos

Performance Issue with XML Processing in Spark Databricks

I am reaching out to bring attention to a performance issue we are encountering while processing XML files using Spark-XML, particularly with the configuration spark.read().format("com.databricks.spark.xml").Currently, we are experiencing significant...

Data Engineering

108 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

shan_chandra
Honored Contributor III

yesterday

0 kudos

@amar1995 - Can you try this streaming approach and see if it works for your use case (using autoloader) - https://kb.databricks.com/streaming/stream-xml-auto-loader

0 kudos

yesterday

by BrianJ • Visitor

yesterday

693 Views
0 replies
0 kudos

{{job.trigger.type}} not working and throws error on Edit Parameter from Job page

Following the instruction on the Job Parameter Dynamic values, I am able to use {{job.id}}{{job.name}}{{job.run_id}}{{job.repair_count}}{{job.start_time.[argument]}}However, when I set trigger_type as trigger_type: {{job.trigger.type}} and hit SAVE, ...

Data Engineering

693 Views
0 replies
0 kudos

yesterday

by Hubert-Dudek • Esteemed Contributor III

Thursday

920 Views
2 replies
0 kudos

Nulls in Merge

If you are going to handle any null values in your MERGE condition, better watch out for your syntax #databricks

Data Engineering

920 Views
2 replies
0 kudos

Thursday

View Replies

Latest Reply

Lakshay
Esteemed Contributor

yesterday

0 kudos

Interesting

0 kudos

yesterday

1 More Replies

by andre_rizzatti • Visitor

yesterday

215 Views
3 replies
0 kudos

Ingest __databricks_internal catalog - PERMISSION DENIED

Good morning, I have a DLT process with CDC incremental load and I need to ingest the history as CDC transactions are only recent. To do this I need to ingest data in the __databricks_internal catalog. In my case, as I am full admin, I can do it, how...

Data Engineering

215 Views
3 replies
0 kudos

yesterday

View Replies

Latest Reply

andre_rizzatti
Visitor

yesterday

0 kudos

The tables do not have specific configuration, and the user who is receiving the error is in a group that has full permission in the INTERNAL catalog

0 kudos

yesterday

2 More Replies

by Snoonan • New Contributor

yesterday

149 Views
4 replies
0 kudos

Unity catalog issues

Hi all,I have recently enabled Unity catalog in my DBX workspace. I have created a new catalog with an external location on Azure data storage.I can create new schemas(databases) in the new catalog but I can't create a table. I get the below error wh...

Data Engineering

149 Views
4 replies
0 kudos

yesterday

View Replies

Latest Reply

daniel_sahal
Honored Contributor III

yesterday

0 kudos

@Snoonan First of all, check the networking tab on the storage account to see if it's behind firewall. If it is, make sure that Databricks/Storage networking is properly configured (https://learn.microsoft.com/en-us/azure/databricks/security/network/...

0 kudos

yesterday

3 More Replies

by amde99 • New Contributor

a week ago

172 Views
2 replies
0 kudos

How can I throw an exception when a .json.gz file has multiple roots?

I have a situation where source files in .json.gz sometimes arrive with invalid syntax containing multiple roots separated by empty braces []. How can I detect this and thrown an exception? Currently the code runs and picks up only record set 1, and ...

Data Engineering

json

172 Views
2 replies
0 kudos

a week ago

View Replies

Latest Reply

Lakshay
Esteemed Contributor

yesterday

0 kudos

Schema validation should help here.

0 kudos

yesterday

1 More Replies

by Karlo_Kotarac • New Contributor II

Wednesday

75 Views
3 replies
0 kudos

Run failed with error message ContextNotFound

Hi all!Recently we've been getting lots of these errors when running Databricks notebooks:At that time we observed DRIVER_NOT_RESPONDING (Driver is up but is not responsive, likely due to GC.) log on the single-user cluster we use.Previously when thi...

Data Engineering

75 Views
3 replies
0 kudos

Wednesday

View Replies

Latest Reply

Lakshay
Esteemed Contributor

yesterday

0 kudos

You may also try to run the failing notebook on the job cluster

0 kudos

yesterday

2 More Replies

by Kanti1989 • Visitor

yesterday

26 Views
1 replies
0 kudos

Pyspark execution error

I am getting a error message when executing a simple pyspark code. Can anyone help me with this.

Data Engineering

26 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

Lakshay
Esteemed Contributor

yesterday

0 kudos

The error message says "system cannot find the file specified". Could you check in the error message which file it is complaining about?

0 kudos

yesterday

by amitca71 • Contributor II

05-14-2023 6:00:43 AM

3143 Views
5 replies
4 kudos

Resolved! exception when using java SQL client

Hi,I try to use java sql. i can see that the query on databricks is executed properly.However, on my client i get exception (see below).versions:jdk: jdk-20.0.1 (tryed also with version 16, same results)https://www.oracle.com/il-en/java/technologies/...

Data Engineering

3143 Views
5 replies
4 kudos

05-14-2023 6:00:43 AM

View Replies

Latest Reply

xebia
New Contributor

yesterday

4 kudos

I am using java 17 and getting the same error.

4 kudos

yesterday

4 More Replies

by drag7ter • New Contributor II

Thursday

114 Views
1 replies
0 kudos

Configure Service Principle access to GiLab

I'm facing an issue while trying to run my job in db and my notebooks located in Git Lab. When I run job under my personal user_Id it works fine, because I added Git Lab token to my user_Id profile and job able to pull branch from repository. But whe...

Data Engineering

114 Views
1 replies
0 kudos

Thursday

View Replies

Latest Reply

Kaniz
Community Manager

yesterday

0 kudos

Hi @drag7ter, There might be a missing piece in the setup. Ensure that you’ve correctly entered the Git provider credentials (username and personal access token) for your Service Principle.Confirm that you’ve selected the correct Git provider (GitLab...

0 kudos

yesterday

by cszczotka • New Contributor II

Thursday

329 Views
1 replies
0 kudos

Ephemeral storage how to create/mount.

Hi,I'm looking for information how to create/mount ephemeral storage to Databricks driver node in Azure Cloud. Does anyone have any experience working with ephemeral storage?Thanks,

Data Engineering

329 Views
1 replies
0 kudos

Thursday

View Replies

Latest Reply

Kaniz
Community Manager

yesterday

0 kudos

Hi @cszczotka, Azure Databricks allows you to mount cloud object storage to the Databricks File System (DBFS) to simplify data access patterns for users who are unfamiliar with cloud concepts. Mounted data does not work with Unity Catalog, and Dat...

0 kudos

yesterday

by dashawn • New Contributor

Thursday

48 Views
1 replies
0 kudos

DLT Pipeline Error Handling

Hello all.We are a new team implementing DLT and have setup a number of tables in a pipeline loading from s3 with UC as the target. I'm noticing that if any of the 20 or so tables fail to load, the entire pipeline fails even when there are no depende...

Data Engineering

Delta Live Tables

48 Views
1 replies
0 kudos

Thursday

View Replies

Latest Reply

Kaniz
Community Manager

yesterday

0 kudos

Hi @dashawn, When data processing fails, manual investigation of logs to understand the failures, data cleanup, and determining the restart point can be time-consuming and costly. DLT provides features to handle errors more intelligently.By default,...

0 kudos

yesterday

User

Count

1599

734

343

284

246

Databricks

Forum Posts

Can I pass parameters to a Delta Live Table pipeline at running time?

DLT run filas with "com.databricks.cdc.spark.DebeziumJDBCMicroBatchProvider not found"

Resolved! Is it posible to share a Dashboard with an user inside your org that doesn't have a Databricks account?

Performance Issue with XML Processing in Spark Databricks

{{job.trigger.type}} not working and throws error on Edit Parameter from Job page

Nulls in Merge

Ingest __databricks_internal catalog - PERMISSION DENIED

Unity catalog issues

How can I throw an exception when a .json.gz file has multiple roots?

Run failed with error message ContextNotFound

Pyspark execution error

Resolved! exception when using java SQL client

Configure Service Principle access to GiLab

Ephemeral storage how to create/mount.

DLT Pipeline Error Handling

Unit Testing with the new Databricks Connect in Py...

Cluster pools

What is difference between streaming and streaming...

Liquid Clustering With Merge

Accessing ADLS Gen 2 Raw Files with UC ?