Data Engineering

Forum Posts

Sorted by:

by 332588 • New Contributor II

01-11-2023 4:14:11 AM

747 Views
3 replies
3 kudos

We are using the Databricks managed MLflow to log experiment runs for quite some time already and never experienced issues. However, now we seem to have encountered a bug in the associated Databricks UI.

We observe the following behavior when we keep adding new runs to an experiment:- In the beginning, the runs are still displayed correctly in the UI.- After a certain number of total runs, the following bug occurs in the UI: - In the UI, there are ...

Data Engineering

747 Views
3 replies
3 kudos

01-11-2023 4:14:11 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

02-28-2023 10:25:33 PM

3 kudos

Hi @Timo Burmeister Apologies for the delay! I went through the video, does it happen all the time? I see after sorting it with different filter the list appears.

3 kudos

02-28-2023 10:25:33 PM

2 More Replies

by prasadvaze • Valued Contributor

01-31-2023 2:49:24 PM

5149 Views
3 replies
0 kudos

Error loading MANAGED table in unity catalog delta lake on azure. Anyone seen this issue? "ErrorClass=INVALID_PARAMETER_VALUE] Input path <file system name>.dfs.core.windows.net overlaps with other external tables"

00007160: 2023-01-30T14:22:06 [TARGET_LOAD ]E: Failed (retcode -1) to execute statement: 'COPY INTO `e2underwriting_dbo`.`product` FROM(SELECT cast(_c0 as INT) as `ProductID`, _c1 as `ShortName`, cast(_c2 as INT) as `Status`, cast(_c3 as TIMESTA...

Data Engineering

5149 Views
3 replies
0 kudos

01-31-2023 2:49:24 PM

View Replies

Latest Reply

prasadvaze
Valued Contributor

02-28-2023 1:53:50 PM

0 kudos

we have solved this issue related to Qlik replicate copying data into delta table

0 kudos

02-28-2023 1:53:50 PM

2 More Replies

by youssefmrini • Honored Contributor III

02-28-2023 3:17:57 AM

631 Views
1 replies
1 kudos

Resolved! Does Databricks workflows support continuous jobs ?

Data Engineering

631 Views
1 replies
1 kudos

02-28-2023 3:17:57 AM

View Replies

Latest Reply

youssefmrini
Honored Contributor III

02-28-2023 3:18:02 AM

1 kudos

You can ensure there is always an active run of your Databricks job with the new continuous trigger type. https://docs.databricks.com/workflows/jobs/jobs.html#continuous-jobs

1 kudos

02-28-2023 3:18:02 AM

by tw1 • New Contributor III

02-15-2023 3:09:21 AM

4828 Views
9 replies
3 kudos

Resolved! Can't write / overwrite delta table with error: oxxxx.saveAsTable. (Driver Error: OutOfMemory)

Current Cluster Config:Standard_DS3_v2 (14GB, 4 Cores) 2-6 workersStandard_DS3_v2 (14GB, 4Cores) for driverRuntime: 10.4x-scala2.12We want to overwrite a temporary delta table with new records. The records will be load by another delta table and tran...

Data Engineering

4828 Views
9 replies
3 kudos

02-15-2023 3:09:21 AM

View Replies

Latest Reply

tw1
New Contributor III

02-26-2023 11:39:12 PM

3 kudos

Hi,thank you for your help!We tested the configuration settings and it runs without any errors.Could you give us some more information, where we can find some documentation about such settings. We searched hours to fix our problem. So we contacted th...

3 kudos

02-26-2023 11:39:12 PM

8 More Replies

by Lulka • New Contributor II

02-20-2023 11:55:17 PM

2364 Views
4 replies
2 kudos

Resolved! How limit input rate reading delta table as stream?

Hello to everyone!I am trying to read delta table as a streaming source using spark. But my microbatches are disbalanced - one very small and the other are very huge. How I can limit this? I used different configurations with maxBytesPerTrigger and m...

Data Engineering

2364 Views
4 replies
2 kudos

02-20-2023 11:55:17 PM

View Replies

Latest Reply

Kaniz
Community Manager

02-22-2023 2:50:28 AM

2 kudos

Hi @Yuliya Valava, If you are setting the maxBytesPerTrigger and maxFilesPerTrigger options when reading a Delta table as a stream, but the batch size is not changing, there could be a few reasons for this:The input data rate is not exceeding the li...

2 kudos

02-22-2023 2:50:28 AM

3 More Replies

by Erik • Valued Contributor II

01-30-2022 8:01:05 AM

9628 Views
22 replies
15 kudos

How to enable/verify cloud fetch from PowerBI

I tried to benchmark the Powerbi Databricks connector vs the powerbi Delta Lake reader on a dataset of 2.15million rows. I found that the delta lake reader used 20 seconds, while importing through the SQL compute endpoint took ~75 seconds. When I loo...

Data Engineering

9628 Views
22 replies
15 kudos

01-30-2022 8:01:05 AM

View Replies

Latest Reply

pulkitm
New Contributor III

02-27-2023 7:24:33 AM

15 kudos

Guys, is there any way to switch off CloudFetch and fall back to ArrowResultSet by default irrespective of size? using the latest version of Spark Simba ODBC driver?

15 kudos

02-27-2023 7:24:33 AM

21 More Replies

by RyanHager • Contributor

04-28-2022 12:15:29 PM

1597 Views
6 replies
2 kudos

Are there any plans to add functions on the partition by fields of a delta table definition such as day() ? A similar capability exists in iceberg.

Benefit: This will help simplify the where clauses of the consumers of the tables? Just query on the main date field if I need all the data for a day. Not an extra day field we had to make.

Data Engineering

1597 Views
6 replies
2 kudos

04-28-2022 12:15:29 PM

View Replies

Latest Reply

Kaniz
Community Manager

05-13-2022 6:05:52 AM

2 kudos

Hi @Ryan Hager , Just a friendly follow-up. Do you still need help, or @Hubert Dudek (Customer) 's response help you to find the solution? Please let us know.

2 kudos

05-13-2022 6:05:52 AM

5 More Replies

by akihiko • New Contributor III

02-23-2023 11:47:37 PM

1673 Views
3 replies
1 kudos

Resolved! Attach notebook to cluster via REST API

Is it possible to attach a notebook to cluster and run it via the REST API?The closest approach I have found is to run a notebook, export the results (HTML!) and import it into the workspace again, but this does not allow us to retain the original ex...

Data Engineering

1673 Views
3 replies
1 kudos

02-23-2023 11:47:37 PM

View Replies

Latest Reply

Vivian_Wilfred
Honored Contributor

02-24-2023 6:33:24 AM

1 kudos

Hi @Akihiko Nagata , have you checked the jobs API? You can run a job on the existing cluster that can use the notebook of concern. I believe this is the only way.https://docs.databricks.com/dev-tools/api/latest/jobs.html#operation/JobsRunsSubmit

1 kudos

02-24-2023 6:33:24 AM

2 More Replies

by hare • New Contributor III

02-19-2023 10:56:59 PM

1666 Views
4 replies
3 kudos

Implementation of Late arriving dimension in databricks

Hi Team, Can you please suggest to me how to implement the late arriving dimension or early arriving fact with examples or any sample script for reference? I have to implement the same using pyspark.Thanks.

Data Engineering

1666 Views
4 replies
3 kudos

02-19-2023 10:56:59 PM

View Replies

Latest Reply

Anonymous
Not applicable

02-21-2023 10:54:38 PM

3 kudos

Hi @Hare Krishnan Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Than...

3 kudos

02-21-2023 10:54:38 PM

3 More Replies

by none_ranjeet • New Contributor III

01-09-2023 6:09:11 AM

1683 Views
4 replies
2 kudos

Resolved! Passed the Fundamentals of the Databricks Lakehouse Platform Accreditation, but no badge recieved. Tried "https://v2.accounts.accredible.com/retrieve-credentials?" showing no badge.

Passed the Fundamentals of the Databricks Lakehouse Platform Accreditation, but no badge recieved. Tried "https://v2.accounts.accredible.com/retrieve-credentials?" showing no badge.

Data Engineering

1683 Views
4 replies
2 kudos

01-09-2023 6:09:11 AM

View Replies

Latest Reply

Chaitanya_Raju
Honored Contributor

01-10-2023 5:57:40 AM

2 kudos

Hi @Ranjeet Ahlawat ,Congratulations on the certification. For any certification you take in the databricks you will be receiving the certificate and the badge in 24-48 hours and sometimes in lesser time as well. All the best for your future certifi...

2 kudos

01-10-2023 5:57:40 AM

3 More Replies

by asami34 • New Contributor II

01-23-2023 2:47:28 PM

2088 Views
7 replies
0 kudos

Cannot reset password, no support

I cannot log in to my Databricks community account. I have already tried to receive support and no real support has been given. I attempt to reset my password, the link gets sent, but once I enter the new password it gets stuck permanently loading. I...

Data Engineering

2088 Views
7 replies
0 kudos

01-23-2023 2:47:28 PM

View Replies

Latest Reply

Anonymous
Not applicable

02-24-2023 8:12:52 PM

0 kudos

Hi @Ahmet Korkmaz Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Than...

0 kudos

02-24-2023 8:12:52 PM

6 More Replies

by sujai_sparks • New Contributor III

02-24-2023 8:42:54 AM

8411 Views
14 replies
15 kudos

Resolved! How to convert records in Azure Databricks delta table to a nested JSON structure?

Let's say I have a delta table in Azure databricks that stores the staff details (denormalized). I wanted to export the data in the JSON format and save it as a single file on a storage location. I need help with the databricks sql query to group/co...

Data Engineering

8411 Views
14 replies
15 kudos

02-24-2023 8:42:54 AM

View Replies

Latest Reply

NateAnth
Valued Contributor

02-24-2023 6:14:40 PM

15 kudos

Glad it worked for you!!

15 kudos

02-24-2023 6:14:40 PM

13 More Replies

by Shanthala • New Contributor III

01-23-2023 12:47:02 PM

857 Views
3 replies
3 kudos

Where is the learning material to get Fundamentals of the Databricks Lakehouse Platform Accreditation?

Please provide me some information about how to get the martial to pass Fundamentals of the Databricks Lakehouse Platform Accreditation?

Data Engineering

857 Views
3 replies
3 kudos

01-23-2023 12:47:02 PM

View Replies

Latest Reply

jose_gonzalez
Moderator

02-24-2023 3:45:06 PM

3 kudos

Hi @Shanthala Baleer,Just a friendly follow-up. Are you still looking for help? adding @Vidula Khanna for visibility

3 kudos

02-24-2023 3:45:06 PM

2 More Replies

by DavidMayer-Foul • New Contributor II

02-20-2023 6:51:16 PM

523 Views
2 replies
0 kudos

How to restart snowflake connector?

After using spark.read.format("snowflake").options(**options).option("dbtable", "table_name").load() to read a table from Snowflake, when I then change the table from Snowflake and read it again, it gives me the first version of the table. I have wor...

Data Engineering

523 Views
2 replies
0 kudos

02-20-2023 6:51:16 PM

View Replies

Latest Reply

DavidMayer-Foul
New Contributor II

02-24-2023 4:51:40 PM

0 kudos

Yes, that would work. However, it is a longish Snowflake query producing a number of tables that are all called by the Databricks notebook, so it requires quite a few changes. I'll use this alternative if I automate the process. However, I think this...

0 kudos

02-24-2023 4:51:40 PM

1 More Replies

by EmilioGC • New Contributor III

02-01-2023 9:56:52 PM

2626 Views
5 replies
7 kudos

Resolved! Why was SQL formatting removed inside spark.sql functions? Now it looks like a plain string.

Previously we were able to see SQL queries inside spark.sql() like this:But now it just looks like a plain string: I know it's not a big issue, but it's still annoying to have to code in SQL while having it all be blue, it makes debugging more cumber...

Data Engineering

2626 Views
5 replies
7 kudos

02-01-2023 9:56:52 PM

View Replies

Latest Reply

jose_gonzalez
Moderator

02-24-2023 3:52:00 PM

7 kudos

Hi @Emilio Garza,Just a friendly follow-up. Did any of the responses help you to resolve your question? if it did, please mark it as best. Otherwise, please let us know if you still need help.

7 kudos

02-24-2023 3:52:00 PM

4 More Replies

User

Count

1602

736

344

284

247

Databricks

Forum Posts

We are using the Databricks managed MLflow to log experiment runs for quite some time already and never experienced issues. However, now we seem to have encountered a bug in the associated Databricks UI.

Error loading MANAGED table in unity catalog delta lake on azure. Anyone seen this issue? "ErrorClass=INVALID_PARAMETER_VALUE] Input path <file system name>.dfs.core.windows.net overlaps with other external tables"

Resolved! Does Databricks workflows support continuous jobs ?

Resolved! Can't write / overwrite delta table with error: oxxxx.saveAsTable. (Driver Error: OutOfMemory)

Resolved! How limit input rate reading delta table as stream?

How to enable/verify cloud fetch from PowerBI

Are there any plans to add functions on the partition by fields of a delta table definition such as day() ? A similar capability exists in iceberg.

Resolved! Attach notebook to cluster via REST API

Implementation of Late arriving dimension in databricks

Resolved! Passed the Fundamentals of the Databricks Lakehouse Platform Accreditation, but no badge recieved. Tried "https://v2.accounts.accredible.com/retrieve-credentials?" showing no badge.

Cannot reset password, no support

Resolved! How to convert records in Azure Databricks delta table to a nested JSON structure?

Where is the learning material to get Fundamentals of the Databricks Lakehouse Platform Accreditation?

How to restart snowflake connector?

Resolved! Why was SQL formatting removed inside spark.sql functions? Now it looks like a plain string.

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...