Data Engineering

Forum Posts

Sorted by:

Start a conversation

by Yogybricks • New Contributor II

06-28-2023 4:51:12 PM

1011 Views
2 replies
0 kudos

Best practices to recover failed DLT pipeline

Data Engineering

1011 Views
2 replies
0 kudos

06-28-2023 4:51:12 PM

View Replies

Latest Reply

Anonymous
Not applicable

07-12-2023 3:05:34 AM

0 kudos

Hi @Yogybricks Hope you are well. Just wanted to see if you were able to find an answer to your question and would you like to mark an answer as best? It would be really helpful for the other members too. Cheers!

0 kudos

07-12-2023 3:05:34 AM

1 More Replies

by zsucic1 • New Contributor III

07-06-2023 4:08:00 AM

1533 Views
2 replies
0 kudos

Resolved! Trigger file_arrival of job on Delta Lake table change

Is there a way to avoid having to create an external data location Simply to trigger a job when new data comes to a specific Delta Lake table?

Data Engineering

1533 Views
2 replies
0 kudos

07-06-2023 4:08:00 AM

View Replies

Latest Reply

Anonymous
Not applicable

07-12-2023 2:44:14 AM

0 kudos

Hi @zsucic1 Hope you are well. Just wanted to see if you were able to find an answer to your question and would you like to mark an answer as best? It would be really helpful for the other members too. Cheers!

0 kudos

07-12-2023 2:44:14 AM

1 More Replies

by SaraCorralLou • New Contributor III

07-09-2023 10:52:13 AM

5575 Views
7 replies
2 kudos

Resolved! dbutils.fs.mv - 1 folder and 1 file with the same name and only move the folder

Hello!I am contacting you because of the following problem I am having:In an ADLS folder I have two items, a folder and an automatically generated Block blob file with the same name as the folder.I want to use the dbutils.fs.mv command to move the fo...

Data Engineering

5575 Views
7 replies
2 kudos

07-09-2023 10:52:13 AM

View Replies

Latest Reply

Anonymous
Not applicable

07-11-2023 3:34:53 AM

2 kudos

Hi @SaraCorralLou Thank you for posting your question in our community! We are happy to assist you. To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answer...

2 kudos

07-11-2023 3:34:53 AM

6 More Replies

by KalingaSena • New Contributor II

07-11-2023 10:48:23 PM

1671 Views
3 replies
0 kudos

Not able to execute below SQL query in databricks notebook because of Pare error

Hi Team,I am unable to run the below command and it is giving me a parse error. Can any one point out the issue with the code:

Data Engineering

1671 Views
3 replies
0 kudos

07-11-2023 10:48:23 PM

View Replies

Latest Reply

BkP
Contributor

07-11-2023 11:49:03 PM

0 kudos

Hi,From the error , it looks like there is no space between the brackets and the "in" keyword after the where clause. Can you please try again see if you facing the same error.

0 kudos

07-11-2023 11:49:03 PM

2 More Replies

by TheoDeSo • New Contributor III

07-11-2023 4:44:47 AM

6587 Views
7 replies
5 kudos

Resolved! Error on Azure-Databricks write output to blob storage account

Hello,After implementing the use of Secret Scope to store Secrets in an azure key vault, i faced a problem.When writting an output to the blob i get the following error:shaded.databricks.org.apache.hadoop.fs.azure.AzureException: Unable to access con...

Data Engineering

6587 Views
7 replies
5 kudos

07-11-2023 4:44:47 AM

View Replies

Latest Reply

TheoDeSo
New Contributor III

07-12-2023 1:39:51 AM

5 kudos

Hi all thank you for the suggestions. Doing This spark.conf.set("fs.azure.account.key.{storage_account}.dfs.core.windows.net", "{myStorageAccountKey}")For the hadoop configuration does not work.And the suggestion of @Tharun-Kumar would suggest to har...

5 kudos

07-12-2023 1:39:51 AM

6 More Replies

by apiury • New Contributor III

07-12-2023 12:07:11 AM

1017 Views
2 replies
1 kudos

Consume gold data layer from web application

Hello!We are developing a web application in .NET, we need to consume data in gold layer, (as if we had a relational database), how can we do it? export data to sql server from gold layer?

Data Engineering

1017 Views
2 replies
1 kudos

07-12-2023 12:07:11 AM

View Replies

Latest Reply

Anonymous
Not applicable

07-12-2023 1:33:56 AM

1 kudos

Hi @apiury Thank you for posting your question in our community! We are happy to assist you. To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers your ...

1 kudos

07-12-2023 1:33:56 AM

1 More Replies

by NithinTiruveedh • New Contributor II

06-20-2016 11:59:28 AM

15037 Views
12 replies
0 kudos

How can I split a Spark Dataframe into n equal Dataframes (by rows)? I tried to add a Row ID column to acheive this but was unsuccessful.

I have a dataframe that has 5M rows. I need to split it up into 5 dataframes of ~1M rows each. This would be easy if I could create a column that contains Row ID. Is that possible?

Data Engineering

15037 Views
12 replies
0 kudos

06-20-2016 11:59:28 AM

View Replies

Latest Reply

Anonymous
Not applicable

07-12-2023 1:19:12 AM

0 kudos

Hi @NithinTiruveedh Thank you for posting your question in our community! We are happy to assist you. To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answ...

0 kudos

07-12-2023 1:19:12 AM

11 More Replies

by BkP • Contributor

07-11-2023 11:32:17 PM

449 Views
0 replies
0 kudos

Higher Order Function: AGGREGATE not working in the example notebook mentioned in Documentation

Hi All,I am running a sample notebook from Databricks Documentation section on Higher Order Function on my community edition. I am running this notebook on DBR 12.2 LTS.Databricks Documentation URL : https://docs.databricks.com/optimizations/higher-o...

Data Engineering

449 Views
0 replies
0 kudos

07-11-2023 11:32:17 PM

by babyhari • New Contributor II

07-01-2023 9:20:39 AM

1067 Views
2 replies
3 kudos

Databricks streaming dataframe into Snowflake

Any suggestions on how to stream data from databricks into snowflake?. Is snowpipe is the only option?. Snowpipe is not faster since it runs copy into in a small batch intervals and not in few seconds. If no option other than snowpipe, how to call it...

Data Engineering

1067 Views
2 replies
3 kudos

07-01-2023 9:20:39 AM

View Replies

Latest Reply

Anonymous
Not applicable

07-11-2023 10:15:50 PM

3 kudos

Hi @babyhari Hope everything is going great. Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so we ...

3 kudos

07-11-2023 10:15:50 PM

1 More Replies

by Gil • New Contributor III

06-29-2023 12:58:52 PM

2877 Views
10 replies
7 kudos

DLT optimize and vacuum

We were finally able to get DLT pipelines to run the optimize and vacuum automatically. We verified this via the the table history. However I am able to still query versions older than 7 days. Has anyone been experiencing this and how were you a...

Data Engineering

2877 Views
10 replies
7 kudos

06-29-2023 12:58:52 PM

View Replies

Latest Reply

Anonymous
Not applicable

07-11-2023 9:35:45 PM

7 kudos

Hi @Gil Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you. Thanks!

7 kudos

07-11-2023 9:35:45 PM

9 More Replies

by Anonymous • Not applicable

06-13-2023 5:49:44 AM

1639 Views
3 replies
5 kudos

Dear @Werner Stinckens and @Tyler Retzlaff We would like to express our gratitude for your participation and dedication in the Databricks Commun...

Dear @Werner Stinckens and @Tyler Retzlaff We would like to express our gratitude for your participation and dedication in the Databricks Community last week. Your interactions with customers have been valuable and we truly appreciate the time...

Data Engineering

1639 Views
3 replies
5 kudos

06-13-2023 5:49:44 AM

View Replies

Latest Reply

dplante
Contributor II

07-11-2023 8:24:16 PM

5 kudos

Congratulations guys!

5 kudos

07-11-2023 8:24:16 PM

2 More Replies

by Constantine • Contributor III

03-30-2022 9:19:56 AM

4451 Views
2 replies
4 kudos

Resolved! How does merge schema work

Let's say I create a table like CREATE TABLE IF NOT EXISTS new_db.data_table ( key STRING, value STRING, last_updated_time TIMESTAMP ) USING DELTA LOCATION 's3://......';Now when I insert into this table I insert data which has say 20 columns a...

Data Engineering

4451 Views
2 replies
4 kudos

03-30-2022 9:19:56 AM

View Replies

Latest Reply

timdriscoll22
New Contributor II

07-11-2023 12:51:24 PM

4 kudos

I tried running "REFRESH TABLE tablename;" but I still do not see the added columns in the data explorer columns, while I do see the added columns in the sample data

4 kudos

07-11-2023 12:51:24 PM

1 More Replies

by pjain • New Contributor II

07-09-2023 11:22:17 PM

1864 Views
4 replies
0 kudos

_sqldf value in case of query failure in %sql cell

I am trying to write a code for Error Handling in Databricks notebook in case of a SQL magic cell failure. I have a %sql cell followed by some python code in next cells. I want to abort the notebook if the query in %sql cell fails. To do so I am look...

Data Engineering

1864 Views
4 replies
0 kudos

07-09-2023 11:22:17 PM

View Replies

Latest Reply

Anonymous
Not applicable

07-11-2023 3:12:09 AM

0 kudos

Hi @pjain We haven't heard from you since the last response from @daniel_sahal , and I was checking back to see if her suggestions helped you. Or else, If you have any solution, please share it with the community, as it can be helpful to others. A...

0 kudos

07-11-2023 3:12:09 AM

3 More Replies

by GC-James • Contributor II

03-04-2022 7:34:53 AM

6048 Views
17 replies
5 kudos

Resolved! Lost memory when using dbutils

Why does copying a 9GB file from a container to the /dbfs lose me 50GB of memory? (Which doesn't come back until I restarted the cluster)

Data Engineering

6048 Views
17 replies
5 kudos

03-04-2022 7:34:53 AM

View Replies

Latest Reply

AdrianP
New Contributor II

07-11-2023 1:56:28 AM

5 kudos

Hi James,Did you get to the bottom of this? We are experiencing the same issue, and all the suggested solutions don't seem to work.Thanks,Adrian

5 kudos

07-11-2023 1:56:28 AM

16 More Replies

by Vadim1 • New Contributor III

06-03-2022 6:46:50 AM

1996 Views
4 replies
3 kudos

Resolved! Error on Azure-Databricks write RDD to storage account with wsabs://

Hi, I'm trying to write data from RDD to the storage account:Adding storage account key:spark.conf.set("fs.azure.account.key.y.blob.core.windows.net", "myStorageAccountKey")Read and write to the same storage:val path = "wasbs://x@y.blob.core.windows....

Data Engineering

1996 Views
4 replies
3 kudos

06-03-2022 6:46:50 AM

View Replies

Latest Reply

TheoDeSo
New Contributor III

07-11-2023 1:11:55 AM

3 kudos

Hello @Vadim1 and @User16764241763. I'm wondering if you find a way to avoid adding the hardcoded key in the advanced options spark config section in the cluster configuration. Is there a similar command to spark.conf.set("spark.hadoop.fs.azure.accou...

3 kudos

07-11-2023 1:11:55 AM

3 More Replies

User

Count

1601

736

343

284

247

Databricks

Forum Posts

Best practices to recover failed DLT pipeline

Resolved! Trigger file_arrival of job on Delta Lake table change

Resolved! dbutils.fs.mv - 1 folder and 1 file with the same name and only move the folder

Not able to execute below SQL query in databricks notebook because of Pare error

Resolved! Error on Azure-Databricks write output to blob storage account

Consume gold data layer from web application

How can I split a Spark Dataframe into n equal Dataframes (by rows)? I tried to add a Row ID column to acheive this but was unsuccessful.

Higher Order Function: AGGREGATE not working in the example notebook mentioned in Documentation

Databricks streaming dataframe into Snowflake

DLT optimize and vacuum

Dear @Werner Stinckens and @Tyler Retzlaff We would like to express our gratitude for your participation and dedication in the Databricks Commun...

Resolved! How does merge schema work

_sqldf value in case of query failure in %sql cell

Resolved! Lost memory when using dbutils

Resolved! Error on Azure-Databricks write RDD to storage account with wsabs://

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...