Data Engineering

Forum Posts

Sorted by:

by najmead • Contributor

03-10-2023 3:30:27 AM

7960 Views
2 replies
1 kudos

Spark Settings in SQL Warehouse

I'm running a query, trying to parse a string into a map, and I get the following error;org.apache.spark.SparkRuntimeException: Duplicate map key was found, please check the input data. If you want to remove the duplicated keys, you can set "spark.s...

Data Engineering

7960 Views
2 replies
1 kudos

03-10-2023 3:30:27 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-31-2023 5:15:38 PM

1 kudos

Hi @Nicholas Mead Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedbac...

1 kudos

03-31-2023 5:15:38 PM

1 More Replies

by farooqurrehman • New Contributor

01-23-2023 8:59:17 AM

3232 Views
3 replies
2 kudos

Unable to connect/read files from ADLS Gen2 using account key

It gives error[RequestId=5e57b66f-b69f-4e8b-8706-3fe5baeb77a0 ErrorClass=METASTORE_DOES_NOT_EXIST] No metastore assigned for the current workspace.using the following codespark.conf.set( "fs.azure.account.key.mystorageaccount.dfs.core.windows.net", ...

Data Engineering

3232 Views
3 replies
2 kudos

01-23-2023 8:59:17 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

02-24-2023 3:36:46 PM

2 kudos

Hi @Farooq ur rehman,Just a friendly follow-up. Did any of the responses help you to resolve your question? if it did, please mark it as best. Otherwise, please let us know if you still need help.

2 kudos

02-24-2023 3:36:46 PM

2 More Replies

by KVNARK • Honored Contributor II

01-19-2023 10:05:39 PM

1414 Views
1 replies
5 kudos

accessing secret from spark cluster.

passing spark configuration to access blob, adls from data factory while creating job clusterit's working fine, but when in the property we are accessing secret it's not workingspark.hadoop.fs.azure.account.auth.type.{{secrets/scope/key}}.dfs.core.wi...

Data Engineering

1414 Views
1 replies
5 kudos

01-19-2023 10:05:39 PM

View Replies

Latest Reply

sher
Valued Contributor II

01-22-2023 4:43:00 AM

5 kudos

check here : https://docs.databricks.com/security/secrets/secrets.html

5 kudos

01-22-2023 4:43:00 AM

by hitesh1 • New Contributor III

08-17-2022 3:08:40 PM

10007 Views
1 replies
5 kudos

java.util.NoSuchElementException: key not found

Hello,We are using a Azure Databricks with Standard DS14_V2 Cluster with Runtime 9.1 LTS, Spark 3.1.2 and Scala 2.12 and facing the below issue frequently when running our ETL pipeline. As part of the operation that is failing there are several joins...

Data Engineering

10007 Views
1 replies
5 kudos

08-17-2022 3:08:40 PM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-02-2022 1:53:14 AM

5 kudos

Hey man,Please use these configuration in your cluster and it will work,spark.sql.storeAssignmentPolicy LEGACYspark.sql.parquet.binaryAsString truespark.speculation falsespark.sql.legacy.timeParserPolicy LEGACYif it wont work let me know what problem...

5 kudos

12-02-2022 1:53:14 AM

by ramankr48 • Contributor II

11-09-2022 11:28:32 PM

4022 Views
2 replies
3 kudos

Issue with identity key column in databricks?

For the identity key I've used both GENERATED ALWAYS AS IDENTITY(start with 1 increment by 1) andGENERATED BY DEFAULT AS IDENTITY(start with 1 increment by 1)but in both cases, if I'm running my script once then it is fine (identity key is working as...

Data Engineering

4022 Views
2 replies
3 kudos

11-09-2022 11:28:32 PM

View Replies

Latest Reply

lizou
Contributor III

11-16-2022 1:25:29 PM

3 kudos

yes, by default option allow duplicated values per design.I will avoid this option and use only use GENERATED ALWAYS AS IDENTITY Using BY DEFAULT option is worse than not using it at all in BY Default option, If I forget to set starting value, the ID...

3 kudos

11-16-2022 1:25:29 PM

1 More Replies

by alejandrofm • Valued Contributor

03-07-2022 6:24:01 AM

5933 Views
3 replies
3 kudos

Resolved! Delta, the specified key does not exist error

Hi, I'm having this error too frequently on a few tables, I check on S3 and the partition exists and the file is there on the partition.error: Spectrum Scan Error: DeltaManifestcode: 15005context: Error fetching Delta Lake manifest delta/product/sub_...

Data Engineering

5933 Views
3 replies
3 kudos

03-07-2022 6:24:01 AM

View Replies

Latest Reply

alejandrofm
Valued Contributor

03-08-2022 5:07:55 AM

3 kudos

@Hubert Dudek , I'll add that sometimes, just running:GENERATE symlink_format_manifest FOR TABLE schema.tablesolves it, but, how can the symlink get broken?Thanks!

3 kudos

03-08-2022 5:07:55 AM

2 More Replies

by MoJaMa • Databricks Employee

06-28-2021 2:00:07 PM

1845 Views
0 replies
0 kudos

I'm a customer and enabled CMK for notebooks. How can I check that the encryption key is actually used?

Data Engineering

1845 Views
0 replies
0 kudos

06-28-2021 2:00:07 PM

Databricks Community

Spark Settings in SQL Warehouse

Unable to connect/read files from ADLS Gen2 using account key

accessing secret from spark cluster.

java.util.NoSuchElementException: key not found

Issue with identity key column in databricks?

Resolved! Delta, the specified key does not exist error

I'm a customer and enabled CMK for notebooks. How can I check that the encryption key is actually used?