Data Engineering

Forum Posts

Sorted by:

by MarcusC • Visitor

an hour ago

69 Views
2 replies
0 kudos

Temporary views no longer working for Share Compute

If I do this%sqlcreate or replace temporary view myviewasselect * from silver.<schema>.<table>;SHOW VIEWS;select * from myview;It works. But if I do the same on a Shared Compute it fails with[TABLE_OR_VIEW_NOT_FOUND] The table or view `myview` cannot...

Data Engineering

69 Views
2 replies
0 kudos

an hour ago

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

54m ago

0 kudos

Hi @MarcusC Yes, for west Europe there is issue with temp view, I think this will be resolved soon.

0 kudos

54m ago

1 More Replies

by Phani1 • Valued Contributor

yesterday

49 Views
1 replies
0 kudos

Execute Pyspark cells concurrently

Hi Team,Hi Team,Is it feasible to run pyspark cells concurrently in databricks notebooks? If so, kindly provide instructions on how to accomplish this. We aim to execute the intermediate steps simultaneously.The given scenario entails the simultaneou...

Data Engineering

49 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

Kaniz
Community Manager

an hour ago

0 kudos

Hi @Phani1, You can run PySpark cells concurrently in Databricks Notebooks. To achieve this, consider the following approaches: Using dbutils.notebook.run(): The simplest way is to utilize the dbutils.notebook.run() utility. You can call it from ...

0 kudos

an hour ago

by DLL • New Contributor

yesterday

40 Views
1 replies
0 kudos

Some columns are being dropped when moving to pandas data set.

Some columns are being dropped when moving to pandas data set. I see part of the dataset, but it does not show when displaying..

Data Engineering

40 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

Kaniz
Community Manager

2 hours ago

0 kudos

Hi @DLL, It seems like there might be some confusion or an issue with how the dataset is being loaded or processed. Could you please provide more details about which columns are being dropped and how you are moving the dataset to a pandas DataFrame? ...

0 kudos

2 hours ago

by Kayl669 • New Contributor III

2 hours ago

34 Views
0 replies
0 kudos

SQL code against tables with '>' in headers suddenly failing?

Just want to post this issue we're experiencing here in case other people are facing something similar. Below is the wording of the support ticket request I've raised:SQL code that has been working is suddenly failing due to syntax errors today. Ther...

Data Engineering

34 Views
0 replies
0 kudos

2 hours ago

by Madalian • New Contributor II

3 hours ago

36 Views
1 replies
0 kudos

DownLoad CSV files from Delta Lake

We have around 1800 tables in Parq format (Delta Lake). These 1800 tables are very big, we have all these 1800 tables are converted into tables. But we have a requirement that, we need to download in CSV. (from PowerBI / any other reporting tool). Cu...

Data Engineering

36 Views
1 replies
0 kudos

3 hours ago

View Replies

Latest Reply

Kaniz
Community Manager

2 hours ago

0 kudos

Hi @Madalian, In Power BI, you can directly export data from a visualization to a CSV file. Here’s how: Select the visual you want to export data from.Click the three dots (More options) and choose “Export data.”Specify a location for the CSV file an...

0 kudos

2 hours ago

by QuantumFries • Visitor

yesterday

35 Views
0 replies
0 kudos

Change {{job.start_time.[iso_date]}} Timezone

I am trying to schedule some jobs using workflows and leveraging dynamic variables. One caveat is that when I try to use {{job.start_time.[iso_date]}} it seems to be defaulted to UTC, is there a way to change it?

Data Engineering

35 Views
0 replies
0 kudos

yesterday

by Tom_Greenwood • New Contributor III

02-01-2024 7:16:54 AM

2071 Views
9 replies
2 kudos

UDF importing from other modules

Hi community,I am using a pyspark udf. The function is being imported from a repo (in the repos section) and registered as a UDF in a the notebook. I am getting a PythonException error when the transformation is run. This is comming from the databric...

Data Engineering

2071 Views
9 replies
2 kudos

02-01-2024 7:16:54 AM

View Replies

Latest Reply

DennisB
New Contributor III

03-14-2024 4:04:03 AM

2 kudos

I was getting a similar error (full traceback below), and determined that it's related to this issue. Setting the env variables DATABRICKS_HOST and DATABRICKS_TOKEN as suggested in that Github issue resolved the problem for me (albeit it's not a grea...

2 kudos

03-14-2024 4:04:03 AM

8 More Replies

by astrobil • New Contributor II

11-15-2023 7:36:21 AM

291 Views
1 replies
0 kudos

Tab Stops Indenting in SQL Editor

I am utilizing Databricks via Azure, and I've been consistently experiencing an issue with the SQL Editor. The tab button, instead of indenting, redirects my cursor to seemingly random parts of the page. This problem has persisted since I began using...

Data Engineering

291 Views
1 replies
0 kudos

11-15-2023 7:36:21 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

0 kudos

which DBR version are you using? which web browser are you using?

0 kudos

yesterday

by kartikmnc • New Contributor

12-17-2023 7:34:02 AM

423 Views
1 replies
1 kudos

Regarding Exam got Suspended at middle without any reason.

Hi Team,My Databricks Certified Data Engineer Associate exam got suspended on 17th December and it is in progress state.I was there continuously in front of the camera and suddenly the alert appeared, and support person asked me to show the desk and ...

Data Engineering

423 Views
1 replies
1 kudos

12-17-2023 7:34:02 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

1 kudos

Adding @Kaniz for visibility on this request

1 kudos

yesterday

by tariq • New Contributor III

3 weeks ago

267 Views
1 replies
0 kudos

SqlContext in DBR 14.3

I have a Databricks workspace in GCP and I am using the cluster with the Runtime 14.3 LTS (includes Apache Spark 3.5.0, Scala 2.12). I am trying to set the checkpoint directory location using the following command in a notebook:spark.sparkContext.set...

Data Engineering

267 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

0 kudos

is this error also happening on other DBR versions or only this version shows this message?

0 kudos

yesterday

by Hubert-Dudek • Esteemed Contributor III

Thursday

115 Views
1 replies
1 kudos

How much USD are you spending on Databricks?

Join two system tables and get exactly how much USD you are spending.The short version of the query: SELECT u.usage_date, u.sku_name, SUM(u.usage_quantity * p.pricing.default) AS total_spent, p.currency_code FROM system.billing....

Data Engineering

115 Views
1 replies
1 kudos

Thursday

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

1 kudos

Thank you for sharing this information @Hubert-Dudek

1 kudos

yesterday

by Darian • Visitor

yesterday

55 Views
1 replies
0 kudos

Delta Live table getting error of garbage collection after running few days

Hi, i am using delta live table in continuous mode for a real time streaming data pipeline. After running the pipeline like 2-3 days i am getting this garbage collection error:Driver/10.15.0.73 paused the JVM process 68 seconds during the past 120 se...

Data Engineering

55 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

0 kudos

could you share the ganglia metrics and how size/type is your driver?

0 kudos

yesterday

by Fresher • New Contributor II

Friday

68 Views
1 replies
0 kudos

Query is taking too long to run

I have two clusters. Cluster A(spark cluster) and cluster B(SQL warehouse). whenever I try to run a particular query using cluster B, it works fine but whenever I try to run same query using cluster A. It's taking time and never show the output

Data Engineering

68 Views
1 replies
0 kudos

Friday

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

0 kudos

Check the physical query plan of the query you are running. Also, check the Spark UI to identify where is taking time and why.

0 kudos

yesterday

by shanebo425 • New Contributor

Friday

106 Views
1 replies
0 kudos

Databricks OutOfMemory error on code that previously worked without issue

I have a notebook in Azure Databricks that does some transformations on a bronze tier table and inserts the transformed data into a silver tier table. This notebook is used to do an initial load of the data from our existing system into our new datal...

Data Engineering

106 Views
1 replies
0 kudos

Friday

View Replies

Latest Reply

jose_gonzalez
Moderator

yesterday

0 kudos

Please review your Spark UI from the old job execution versus the new job execution. You might need to check if the data volume has increase and that could be the reason of the OOM

0 kudos

yesterday

by Ruby8376 • Valued Contributor

yesterday

49 Views
0 replies
0 kudos

Databricks sql warehouse has Serverless compute as a public preview.

There is a risk form infosec as it is processed in the control plane shared with other azure clients. s there any control to mitigate the risk?

Data Engineering

49 Views
0 replies
0 kudos

yesterday

User

Count

1601

736

343

284

247

Databricks

Forum Posts

Temporary views no longer working for Share Compute

Execute Pyspark cells concurrently

Some columns are being dropped when moving to pandas data set.

SQL code against tables with '>' in headers suddenly failing?

DownLoad CSV files from Delta Lake

Change {{job.start_time.[iso_date]}} Timezone

UDF importing from other modules

Tab Stops Indenting in SQL Editor

Regarding Exam got Suspended at middle without any reason.

SqlContext in DBR 14.3

How much USD are you spending on Databricks?

Delta Live table getting error of garbage collection after running few days

Query is taking too long to run

Databricks OutOfMemory error on code that previously worked without issue

Databricks sql warehouse has Serverless compute as a public preview.

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...