Data Engineering

Forum Posts

Sorted by:

by Ameshj • Visitor

4m ago

0 Views
0 replies
0 kudos

Dbfs init script migration

I need help with migrating from dbfs on databricks to workspace. I am new to databricks and am struggling with what is on the links provided.My workspace.yml also has dbfs hard-codedThis was done by an external vendor.

Data Engineering

Azure Databricks

dbfs

0 Views
0 replies
0 kudos

4m ago

by mrtellerz • Visitor

yesterday

26 Views
1 replies
0 kudos

Parallel execution of SQL cell in Databricks Notebooks

Hi Team,Please provide guidance on enabling SQL cells parallel execution in a notebook containing multiple SQL cells. Currently, when we execute notebook and all the SQL cells they run sequentially. I would appreciate assistance on how to execute th...

Data Engineering

26 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

Yeshwanth
Valued Contributor

yesterday

0 kudos

Hi @mrtellerz good day! I believe this document might help you: https://learn.microsoft.com/en-us/azure/databricks/notebooks/notebooks-code#--execute-sql-cells-in-parallel Best regards,

0 kudos

yesterday

by Sikki • New Contributor

Friday

215 Views
8 replies
0 kudos

Databricks Asset Bundle Workflow Redeployment Issue

Hello All,In my Databricks workflows, I have three tasks configured, with the final task set to run only if the condition "ALL_DONE" is met. During the first deployment, I observed that the dependency "ALL_DONE" was correctly assigned to the last tas...

Data Engineering

215 Views
8 replies
0 kudos

Friday

View Replies

Latest Reply

Yeshwanth
Valued Contributor

yesterday

0 kudos

@Sikki thanks for confirming. For Azure DevOps please check the version of Databricks CLI installed.

0 kudos

yesterday

7 More Replies

by Madalian • New Contributor III

yesterday

91 Views
3 replies
1 kudos

DownLoad CSV files from Delta Lake

We have around 1800 tables in Parq format (Delta Lake). These 1800 tables are very big, we have all these 1800 tables are converted into tables. But we have a requirement that, we need to download in CSV. (from PowerBI / any other reporting tool). Cu...

Data Engineering

91 Views
3 replies
1 kudos

yesterday

View Replies

Latest Reply

Madalian
New Contributor III

yesterday

1 kudos

Dear Kaniz,Thank you. one doubt 1) converting tables data into CSV and saving again one more of storage layer.IS there any way on fly we can convert these tables into CSV's. and export into PowerBI? and again i see in powerBI has limitations around >...

1 kudos

yesterday

2 More Replies

by Phani1 • Valued Contributor

yesterday

109 Views
3 replies
0 kudos

Parallel execution of SQL cell in Databricks Notebooks

Data Engineering

delta

109 Views
3 replies
0 kudos

yesterday

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

yesterday

0 kudos

Hi @Phani1 ,Can you please explain your usecase as databricks notebook support the sequential executions we have to look for workaround so it will great if you can explain it more.For now you can manually run multiple cell for sql but it's not possib...

0 kudos

yesterday

2 More Replies

by dashawn • New Contributor

2 weeks ago

136 Views
2 replies
0 kudos

DLT Pipeline Error Handling

Hello all.We are a new team implementing DLT and have setup a number of tables in a pipeline loading from s3 with UC as the target. I'm noticing that if any of the 20 or so tables fail to load, the entire pipeline fails even when there are no depende...

Data Engineering

Delta Live Tables

136 Views
2 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

0 kudos

Thank you for sharing this @Kaniz. @dashawn did you were able to check Kaniz's docs? do you still need help or shall you accept Kaniz's solution?

0 kudos

yesterday

1 More Replies

by Hubert-Dudek • Esteemed Contributor III

a week ago

263 Views
2 replies
1 kudos

The star inside WHERE

The star (*) can be used inside the WHERE clause in #Databricks as of runtime version 15.

Data Engineering

263 Views
2 replies
1 kudos

a week ago

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

1 kudos

Thanks a lot for sharing this information @Hubert-Dudek

1 kudos

yesterday

1 More Replies

by jenshumrich • New Contributor III

3 weeks ago

334 Views
3 replies
0 kudos

Filter not using partition

I have the following code:spark.sparkContext.setCheckpointDir("dbfs:/mnt/lifestrategy-blob/checkpoints") result_df.repartitionByRange(200, "IdStation") result_df_checked = result_df.checkpoint(eager=True) unique_stations = result_df.select("IdStation...

Data Engineering

334 Views
3 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

0 kudos

it seems like there is a filter being apply according to this. Filter (isnotnull(IdStation#2678) AND (IdStation#2678 = 1119844)) I would like to share the following notebook that covers in detail this topic, in case you would like to check it out h...

0 kudos

yesterday

2 More Replies

by MarcusC • Visitor

yesterday

188 Views
4 replies
0 kudos

Temporary views no longer working for Share Compute

If I do this%sqlcreate or replace temporary view myviewasselect * from silver.<schema>.<table>;SHOW VIEWS;select * from myview;It works. But if I do the same on a Shared Compute it fails with[TABLE_OR_VIEW_NOT_FOUND] The table or view `myview` cannot...

Data Engineering

188 Views
4 replies
0 kudos

yesterday

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

0 kudos

which DBR version are you using?

0 kudos

yesterday

3 More Replies

by EhsanSaba • New Contributor

Wednesday

337 Views
1 replies
0 kudos

RocksDB results in empty stream/stream joins dataframe

Since we enable RocksDB in our spark.conf the stream to stream joins/unions results in empty dataframe, does anyone else have the same experience? it is on AWSspark.conf.set("spark.sql.streaming.stateStore.providerClass","com.databricks.sql.streaming...

Data Engineering

337 Views
1 replies
0 kudos

Wednesday

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

0 kudos

Did you also update the checkpoint? You might need to use a new checkpoint after you enable the RocksDB state store.

0 kudos

yesterday

by Brammer88 • New Contributor III

2 weeks ago

446 Views
6 replies
2 kudos

Trying to run databricks academy labs, but execution fails due to method to clearcache not whilelist

Hi there,Im trying to run DE 2.1 - Querying Files Directly on my workspace with a default cluster configuration for found below,but I cannot seem to run this file (or any other labs) as it gives me this error message Resetting the learning environme...

Data Engineering

446 Views
6 replies
2 kudos

2 weeks ago

View Replies

Latest Reply

Brammer88
New Contributor III

yesterday

2 kudos

Hi @Kaniz and databricks team,Did you already found some other solution for this? Thanks,Bram

2 kudos

yesterday

5 More Replies

by AdityaM • Visitor

yesterday

44 Views
0 replies
0 kudos

Creating external tables using gzipped CSV file - S3 URI without extensions

Hi Databricks community,Hope you are doing well.I am trying to create an external table using a Gzipped CSV file uploaded to an S3 bucket.The S3 URI of the resource doesn't have any file extensions, but the content of the file is a Gzipped comma sepa...

Data Engineering

44 Views
0 replies
0 kudos

yesterday

by Mailendiran • New Contributor II

Saturday

138 Views
2 replies
0 kudos

Unity Catalog - Storage Account Data Access

I was exploring on unity catalog option on Databricks premium workspace.I understood that i need to create storage account credentials and external connection in workspace.Later, i can access the cloud data using 'abfss://storage_account_details' .I ...

Data Engineering

138 Views
2 replies
0 kudos

Saturday

View Replies

Latest Reply

DouglasMoore
New Contributor II

yesterday

0 kudos

Databricks strategic direction is to deprecate mount points in favor of Unity Catalog Volumes.Setup an STORAGE CREDENTIAL and EXTERNAL LOCATION to access and define how to get to your cloud storage account. To access data on the account, define a Tab...

0 kudos

yesterday

1 More Replies

by kazinahian • New Contributor III

yesterday

46 Views
0 replies
0 kudos

Lowcode ETL in Databricks

Hello everyone,I work as a Business Intelligence practitioner, employing tools like Alteryx or various low-code solutions to construct ETL processes and develop data pipelines for my Dashboards and reports. Currently, I'm delving into Azure Databrick...

Data Engineering

46 Views
0 replies
0 kudos

yesterday

by chloeh • New Contributor II

yesterday

36 Views
0 replies
0 kudos

Chaining window aggregations in SQL

In my SQL data transformation pipeline, I'm doing chained/cascading window aggregations: for example, I want to do average over the last 5 minutes, then compute average over the past day on top of the 5 minute average, so that my aggregations are mor...

Data Engineering

36 Views
0 replies
0 kudos

yesterday

User

Count

1602

736

343

284

247

Databricks

Forum Posts

Dbfs init script migration

Parallel execution of SQL cell in Databricks Notebooks

Databricks Asset Bundle Workflow Redeployment Issue

DownLoad CSV files from Delta Lake

Parallel execution of SQL cell in Databricks Notebooks

DLT Pipeline Error Handling

The star inside WHERE

Filter not using partition

Temporary views no longer working for Share Compute

RocksDB results in empty stream/stream joins dataframe

Trying to run databricks academy labs, but execution fails due to method to clearcache not whilelist

Creating external tables using gzipped CSV file - S3 URI without extensions

Unity Catalog - Storage Account Data Access

Lowcode ETL in Databricks

Chaining window aggregations in SQL

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...