Data Engineering

Forum Posts

Sorted by:

by Anonymous • Not applicable

06-23-2022 10:38:14 AM

12487 Views
7 replies
11 kudos

Resolved! MetadataChangedException

A delta lake table is created with identity column and I'm not able to load the data parallelly from four process. i'm getting the metadata exception error.I don't want to load the data in temp table . Need to load directly and parallelly in to delta...

Data Engineering

12487 Views
7 replies
11 kudos

06-23-2022 10:38:14 AM

View Replies

Latest Reply

seans
New Contributor III

08-06-2024 9:09:21 PM

11 kudos

I recently ran into this MetadataChangedException. Watching the video @Hubert Dudek posted it's pretty clear what is going on: object storage folks not thinking like someone who builds relational database engines built it. That's to be expected. Dat...

11 kudos

08-06-2024 9:09:21 PM

6 More Replies

by ranged_coop • Valued Contributor II

06-20-2022 1:51:42 AM

17101 Views
22 replies
28 kudos

How to install Chromium Browser and Chrome Driver on DBX runtime 10.4 and above ?

Hi Team,We are wondering if there is a recommended way to install the chromium browser and chrome driver on Databricks Runtime 10.4 and above ?I have been through the site and have come across several links to this effect, but they all seem to be ins...

Data Engineering

17101 Views
22 replies
28 kudos

06-20-2022 1:51:42 AM

View Replies

Latest Reply

Kaizen
Valued Contributor

02-13-2024 9:44:55 AM

28 kudos

Look into Playwrite instead of Selenium. I went through the same process y'all went through here (ended up writing a init script to install the drivers etc)This is all done for you in playwright. Refer to this post - I hope it helps!!https://communit...

28 kudos

02-13-2024 9:44:55 AM

21 More Replies

by ros • New Contributor III

05-31-2023 12:47:59 AM

1613 Views
2 replies
2 kudos

merge vs MERGE INTO

from 10.4 LTS version we have low shuffle merge, so merge is more faster. But what about MERGE INTO function that we run in sql notebook of databricks. Is there any performance difference when we use databrciks pyspark ".merge" function vs databricks...

Data Engineering

1613 Views
2 replies
2 kudos

05-31-2023 12:47:59 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-01-2023 12:10:35 AM

2 kudos

Hi @Roshan RC Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers you...

2 kudos

06-01-2023 12:10:35 AM

1 More Replies

by pepe • New Contributor II

03-09-2023 1:09:32 PM

8478 Views
2 replies
1 kudos

Why can't I install python libraries when i update cluster runtime from 10.1 to 12.1?

This same question was asked here 9 months ago without any answer:https://community.databricks.com/s/question/0D58Y000096VjKrSAK/managedlibraryinstallfailed-when-changing-databricks-runtime-version-from-91-to-110I was using runtime 9.1, and then upgr...

Data Engineering

8478 Views
2 replies
1 kudos

03-09-2023 1:09:32 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-31-2023 5:54:18 PM

1 kudos

Hi @JOSE RODRIGUEZ Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us s...

1 kudos

03-31-2023 5:54:18 PM

1 More Replies

by powerus • New Contributor III

01-24-2023 1:12:04 AM

4876 Views
1 replies
0 kudos

Resolved! "Failure to initialize configurationInvalid configuration value detected for fs.azure.account.key" using com.databricks:spark-xml_2.12:0.12.0

Hi community,I'm trying to read XML data from Azure Datalake Gen 2 using com.databricks:spark-xml_2.12:0.12.0:spark.read.format('XML').load('abfss://[CONTAINER]@[storageaccount].dfs.core.windows.net/PATH/TO/FILE.xml')The code above gives the followin...

Data Engineering

4876 Views
1 replies
0 kudos

01-24-2023 1:12:04 AM

View Replies

Latest Reply

powerus
New Contributor III

01-24-2023 4:43:25 AM

0 kudos

The issue was also raised here: https://github.com/databricks/spark-xml/issues/591A fix is to use the "spark.hadoop" prefix in front of the fs.azure spark config keys:spark.hadoop.fs.azure.account.oauth2.client.id.nubulosdpdlsdev01.dfs.core.windows.n...

0 kudos

01-24-2023 4:43:25 AM

by ranged_coop • Valued Contributor II

12-09-2022 4:18:29 AM

1612 Views
2 replies
3 kudos

Equivalent Machine Types between Databricks on Azure and GCP

Hi All,Hope everyone is doing well.We are currently validating Databricks on GCP and Azure.We have a python notebook that does some ETL (Copy, extract zip files and process files within the zip files)Our Cluster Config on AzureDBX Runtime - 10.4 - Dr...

Data Engineering

1612 Views
2 replies
3 kudos

12-09-2022 4:18:29 AM

View Replies

Latest Reply

ranged_coop
Valued Contributor II

12-09-2022 5:26:04 AM

3 kudos

hi @Tunde Abib , I have gone through the links while updating, but did not see any major documented slow downs mentioned in them.

3 kudos

12-09-2022 5:26:04 AM

1 More Replies

by Anonymous • Not applicable

05-20-2022 2:31:45 AM

1651 Views
1 replies
1 kudos

Query silently failed

Hello all, I'm using the older 6.4 runtime and noticed that a query return no result whereas the same query on 10.4 provided the expected result. This is bad, because I got no error, simply no result at all.Is there is some spark settings on the clus...

Data Engineering

1651 Views
1 replies
1 kudos

05-20-2022 2:31:45 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-06-2022 5:58:51 AM

1 kudos

Hi @Alessio Palma following up did you get chance to check @Kaniz Fatma 's previous comments ?

1 kudos

06-06-2022 5:58:51 AM

by Emiel_Smeenk • New Contributor III

04-12-2022 10:15:02 AM

12824 Views
5 replies
8 kudos

Resolved! Databricks Runtime 10.4 LTS - AnalysisException: No such struct field id in 0, 1 after upgrading

Hello,We are working to migrate to databricks runtime 10.4 LTS from 9.1 LTS but we're running into weird behavioral issues. Our existing code works up until runtime 10.3 and in 10.4 it stopped working.Problem:We have a nested json file that we are fl...

Data Engineering

12824 Views
5 replies
8 kudos

04-12-2022 10:15:02 AM

View Replies

Latest Reply

Emiel_Smeenk
New Contributor III

04-20-2022 8:59:22 AM

8 kudos

It seems like the issue was miraculously resolved. I did not make any code changes but everything is now running as expected. Maybe the latest runtime 10.4 fix released on April 19th also resolved this issue unintentionally.

8 kudos

04-20-2022 8:59:22 AM

4 More Replies

Databricks Community

Resolved! MetadataChangedException

How to install Chromium Browser and Chrome Driver on DBX runtime 10.4 and above ?

merge vs MERGE INTO

Why can't I install python libraries when i update cluster runtime from 10.1 to 12.1?

Resolved! "Failure to initialize configurationInvalid configuration value detected for fs.azure.account.key" using com.databricks:spark-xml_2.12:0.12.0

Equivalent Machine Types between Databricks on Azure and GCP

Query silently failed

Resolved! Databricks Runtime 10.4 LTS - AnalysisException: No such struct field id in 0, 1 after upgrading