Data Engineering

Forum Posts

Sorted by:

by tomph • New Contributor II

02-02-2024 1:41:47 AM

3498 Views
2 replies
0 kudos

Resolved! Databricks Asset Bundles - Manage existing jobs

Hello,we are starting to experiment with Databricks Asset Bundles, especially to keep jobs aligned between workspaces. Is there a way to start managing existing jobs, to avoid erasing previous runs history?Thank you,Tommaso

Data Engineering

3498 Views
2 replies
0 kudos

02-02-2024 1:41:47 AM

View Replies

Latest Reply

tomph
New Contributor II

02-05-2024 12:37:06 AM

0 kudos

Great news, thanks!

0 kudos

02-05-2024 12:37:06 AM

1 More Replies

by matt_stanford • New Contributor III

02-04-2024 4:35:52 AM

4215 Views
1 replies
0 kudos

Resolved! Type 2 SCD when using Auto Loader

Hi there! I'm pretty new to using Auto Loader, so this may be a really obvious fix, but it's stumped me for a few weeks, so I'm hoping someone can help! I have a small csv file saved in ADLS with a list of pizzas for an imaginary pizza restaurant. I'...

Data Engineering

4215 Views
1 replies
0 kudos

02-04-2024 4:35:52 AM

View Replies

Latest Reply

matt_stanford
New Contributor III

02-05-2024 12:16:00 AM

0 kudos

So, I figured out what the issue was. I needed to delete checkpoint folder. After I did this and re-ran the notebook, everything worked fine!

0 kudos

02-05-2024 12:16:00 AM

by AlexPedurand • New Contributor

02-02-2024 7:36:32 AM

2612 Views
1 replies
0 kudos

SOAP API - Connection

HelloWe have a workflow in our team to perform usual monthly tasks to be ran on the first working day of the month.Each of the ~20 users will run a clone of this workflow most likely all around the same time but with different options. Because we don...

Data Engineering

2612 Views
1 replies
0 kudos

02-02-2024 7:36:32 AM

View Replies

Latest Reply

feiyun0112
Honored Contributor

02-02-2024 6:59:13 PM

0 kudos

maybe you can set a lock before call SOAP APIpython - Using a Lock with redis-py - Stack Overflow

0 kudos

02-02-2024 6:59:13 PM

by data1233 • New Contributor

02-02-2024 12:11:47 PM

2733 Views
1 replies
0 kudos

create an array sorted by a field

How do i create an array from a field while applying sorting?how do I do this in data brick since databricks does not support order by in array_agg? same is possible in Snowflake and(Array agg) or Redshift(listagg). SELECT ARRAY_AGG(O_ORDERKEY) WITH...

Data Engineering

2733 Views
1 replies
0 kudos

02-02-2024 12:11:47 PM

View Replies

Latest Reply

feiyun0112
Honored Contributor

02-02-2024 6:50:28 PM

0 kudos

%sql SELECT array_sort(array_agg(col) ,(left, right) -> CASE WHEN left < right THEN -1 WHEN left > right THEN 1 ELSE 0 END) arr_col FROM VALUES (3), (2), (1) AS tab(col); https://docs.databricks.com/en/sql/language-manual/functions/array_sort.h...

0 kudos

02-02-2024 6:50:28 PM

by jerryrard • New Contributor

02-02-2024 11:04:34 AM

2879 Views
2 replies
0 kudos

Python Databricks how to run all cells in another notebook except the last cell

I have a Python Databricks notebook which I want to call/run another Databricks notebook using dbutils.notebook.run()... but I want to run all the cells in the "called" notebook except the last one.Is there a way to do a count of cells in the called ...

Data Engineering

2879 Views
2 replies
0 kudos

02-02-2024 11:04:34 AM

View Replies

Latest Reply

feiyun0112
Honored Contributor

02-02-2024 5:53:26 PM

0 kudos

In the alternative way, you can use dbutils.notebook.run to pass the parameters, and use dbutils.widgets.get in another notebook to get the parameter values,and determine the parameter values to decide whether to execute codes in the specified cellh...

0 kudos

02-02-2024 5:53:26 PM

1 More Replies

by Kai • New Contributor II

02-02-2024 2:00:40 AM

5285 Views
1 replies
0 kudos

Resolved! Differences Between "TEMPORARY STREAMING TABLE" and "TEMPORARY STREAMING LIVE VIEW" in DLT

Hello Databricks community,I'm seeking clarification on the distinctions between the following two syntaxes:CREATE OR REFRESH TEMPORARY STREAMING TABLECREATE TEMPORARY STREAMING LIVE VIEWAs of my understanding, both of these methods do not store data...

Data Engineering

5285 Views
1 replies
0 kudos

02-02-2024 2:00:40 AM

View Replies

Latest Reply

gabsylvain
Databricks Employee

02-02-2024 7:38:58 AM

0 kudos

Hi @Kai, The two syntaxes you're asking about, CREATE OR REFRESH TEMPORARY STREAMING TABLE and CREATE TEMPORARY STREAMING LIVE VIEW, are used in Delta Live Tables and have distinct purposes. CREATE OR REFRESH TEMPORARY STREAMING TABLE: This syntax i...

0 kudos

02-02-2024 7:38:58 AM

by leaw • Databricks Partner

01-12-2024 8:34:29 AM

9628 Views
7 replies
0 kudos

Resolved! How to load xml files with spark-xml ?

Hello,I cannot load xml files.First, I tried to install Maven library com.databricks:spark-xml_2.12:0.14.0 as it is told in documentation, but I could not find it. I only have HyukjinKwon:spark-xml:0.1.1-s_2.10, with this one I have this error: DRIVE...

Data Engineering

9628 Views
7 replies
0 kudos

01-12-2024 8:34:29 AM

View Replies

Latest Reply

Frustrated_DE
New Contributor III

02-01-2024 1:29:39 AM

0 kudos

Mismatch on Scala version, my bad! Sorted

0 kudos

02-01-2024 1:29:39 AM

6 More Replies

by rsamant07 • New Contributor III

01-26-2024 7:01:40 AM

4970 Views
2 replies
1 kudos

DBT JOBS FAILING

HI ,we have dbt workflow jobs and its been failing randomly from last few days with below error. is there any known issue for this , any help on the root cause will be helpful.Encountered an error: Runtime Error Database Error __init__() got an une...

Data Engineering

dbt

4970 Views
2 replies
1 kudos

01-26-2024 7:01:40 AM

View Replies

Latest Reply

rsamant07
New Contributor III

02-01-2024 1:10:55 AM

1 kudos

setting dbt-databricks==1.7.3 solved this issue but now we randomly get the below error . it gets fixed after restrating the cluster sometimes. but is there any permanent solution for this ? from dbt.events import types_pb2 File "/databricks/python3/...

1 kudos

02-01-2024 1:10:55 AM

1 More Replies

by Andyt • New Contributor

06-29-2023 3:57:22 PM

2268 Views
1 replies
0 kudos

Restore sql editor

any Options restore sql editors query after workspace was accidentally deleted and restored

Data Engineering

2268 Views
1 replies
0 kudos

06-29-2023 3:57:22 PM

View Replies

Latest Reply

arpit
Databricks Employee

01-31-2024 11:34:07 PM

0 kudos

@Andyt If the workspace is accidentally deleted, there is not way to retrieve content from SQL editor.

0 kudos

01-31-2024 11:34:07 PM

by WhistlePodu • New Contributor

08-13-2023 8:17:58 AM

3568 Views
1 replies
0 kudos

How to get Workflow status and error description programmatically ?

Hi,I want to take some basic info by running workflow and populate a table with those data. I want to add logic programmatically in a notebook and will run it by attaching it in a task of workflow.Information required to be populated in table:Job idJ...

Data Engineering

3568 Views
1 replies
0 kudos

08-13-2023 8:17:58 AM

View Replies

Latest Reply

arpit
Databricks Employee

01-31-2024 11:21:19 PM

0 kudos

@WhistlePodu You can review the jobs API for getting the other fields like jobs status etc

0 kudos

01-31-2024 11:21:19 PM

by Sangram • New Contributor III

01-21-2024 6:10:41 AM

1687 Views
1 replies
1 kudos

data engineer course materials are throwing error

Your course material for data engineering associate program is throwing error.Please correct the below error: -This is from section 2.2 "ETL with Spark".

Data Engineering

data engineering

pyspark

spark

1687 Views
1 replies
1 kudos

01-21-2024 6:10:41 AM

View Replies

Latest Reply

arpit
Databricks Employee

01-31-2024 10:35:10 PM

1 kudos

@Sangram Can you please confirm if that DBFS exists in the specified location?

1 kudos

01-31-2024 10:35:10 PM

by Fnazar • New Contributor II

01-31-2024 3:15:45 AM

1731 Views
1 replies
0 kudos

Streaming delta table - Performance with incremental refresh

Hi Team,We are hitting performance issues with Streaming live delta table specifically when evaluating large tables of more than 10million rows. What are the workarounds to handle these streaming live tables in an attempt to load these large tables. ...

Data Engineering

1731 Views
1 replies
0 kudos

01-31-2024 3:15:45 AM

View Replies

Latest Reply

Priyanka_Biswas
Databricks Employee

01-31-2024 5:24:09 PM

0 kudos

Hi @Fnazar When dealing with streaming data, you might end up with many small files, which can be inefficient. Use Delta Lake's OPTIMIZE command to compact files into larger ones and ZORDER to colocate related information in the same set of files. T...

0 kudos

01-31-2024 5:24:09 PM

by TCorr15 • Databricks Partner

01-25-2024 2:15:48 AM

7915 Views
1 replies
0 kudos

Databricks Connect V2 - OPENSSL_internal: CERTIFICATE_VERIFY_FAILED

I am getting an error when using Databricks V2 in when running anything relating to databricks-sql-connector/databricks.sql.connect(). Would anyone know how to resolve this issue?Sample Error Message Additional DetailsPython Version 3.11.4Sample Code...

Data Engineering

7915 Views
1 replies
0 kudos

01-25-2024 2:15:48 AM

View Replies

Latest Reply

arpit
Databricks Employee

01-31-2024 5:18:55 PM

0 kudos

Can you directly use Databricks connect and validate if it works from CLI?Also, confirm the databrics-connect version please

0 kudos

01-31-2024 5:18:55 PM

by nitinsingh1 • Databricks Partner

06-22-2022 8:19:30 AM

5692 Views
5 replies
2 kudos

Databricks Runtime compatibility error with latest version while reading from (ADLS) Dynamic 365 .

We are trying to establish ingestion from dynamic 365 >> ADLS >> Databricks, While reading information we need to use databricks runtime 6.4 to read the raw data from ADLS into Databricks. Latest databricks runtime couldn’t be used, Need your help to...

Data Engineering

5692 Views
5 replies
2 kudos

06-22-2022 8:19:30 AM

View Replies

Latest Reply

BobBubble2000
New Contributor II

08-09-2023 1:20:19 AM

2 kudos

Hi @nitinsingh1 Thank you for bringing up this topic, I'm also currently looking into how to ingest exported Dynamics 365 FO data (csv files with CDM) from ADLS into Databricks. Could you share how you achieved this? I'd be very curious to see your a...

2 kudos

08-09-2023 1:20:19 AM

4 More Replies

by Manjusha • New Contributor II

01-29-2024 9:51:43 AM

2670 Views
3 replies
0 kudos

Failed to create notebook on community edition

Hi,I am unable to create new notebook on databricks community edition. getting error 'failed to create notebook' when I click on create-> notebookIs anyone else facing the same issue? if so, any tips on how to resolve it?

Data Engineering

2670 Views
3 replies
0 kudos

01-29-2024 9:51:43 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

01-31-2024 11:19:15 AM

0 kudos

Thank you for the update. Please select the best response as a solution, so other community members will be able to get unblock if they have this issue

0 kudos

01-31-2024 11:19:15 AM

2 More Replies

Databricks Community

Forum Posts

Resolved! Databricks Asset Bundles - Manage existing jobs

Resolved! Type 2 SCD when using Auto Loader

SOAP API - Connection

create an array sorted by a field

Python Databricks how to run all cells in another notebook except the last cell

Resolved! Differences Between "TEMPORARY STREAMING TABLE" and "TEMPORARY STREAMING LIVE VIEW" in DLT

Resolved! How to load xml files with spark-xml ?

DBT JOBS FAILING

Restore sql editor

How to get Workflow status and error description programmatically ?

data engineer course materials are throwing error

Streaming delta table - Performance with incremental refresh

Databricks Connect V2 - OPENSSL_internal: CERTIFICATE_VERIFY_FAILED

Databricks Runtime compatibility error with latest version while reading from (ADLS) Dynamic 365 .

Failed to create notebook on community edition

File Arrival Trigger - Multiple tables

Issue while handling Deletes and Inserts in Struct...

DLT with CDC and schema changes in streaming pipel...

how to update not tracked column only in new row v...

Databricks Cost Estimation Template