Data Engineering

Forum Posts

Sorted by:

by Sikki • New Contributor

Friday

195 Views
7 replies
0 kudos

Databricks Asset Bundle Workflow Redeployment Issue

Hello All,In my Databricks workflows, I have three tasks configured, with the final task set to run only if the condition "ALL_DONE" is met. During the first deployment, I observed that the dependency "ALL_DONE" was correctly assigned to the last tas...

Data Engineering

195 Views
7 replies
0 kudos

Friday

View Replies

Latest Reply

Sikki
New Contributor

19m ago

0 kudos

After updating my CLI, I successfully deployed the job from Databricks CLI and it is functioning correctly. However, when attempting to deploy the same job using Azure DevOps, I encounter the same issue.

0 kudos

19m ago

6 More Replies

by al_joe • Contributor

02-05-2022 12:13:34 AM

5283 Views
4 replies
3 kudos

Resolved! Split a code cell at cursor position? Add a cell above/below?

In JupyterLab notebooks, we can --In edit mode, you can press Ctrl+Shift+Minus to split the current cell into two at the cursor position In command mode, you can click A or B to add a cell Above or Below the current cellare there equivalent shortcuts...

Data Engineering

5283 Views
4 replies
3 kudos

02-05-2022 12:13:34 AM

View Replies

Latest Reply

DavidKxx
New Contributor III

an hour ago

3 kudos

What's the status of the ctrl-alt-minus shortcut for splitting a cell? That keyboard combination does absolutely nothing in my interface (running Databricks via Chrome on GCP).

3 kudos

an hour ago

3 More Replies

by jaredrohe • New Contributor II

10-26-2023 7:35:09 PM

1352 Views
3 replies
1 kudos

Instance Profiles Do Not Work with Delta Live Tables Default Cluster Policy Access Mode "Shared"

Hello,I am attempting to configure Autoloader in File Notification mode with Delta Live Tables. I configured an instance profile, but it is not working because I immediately get AWS access denied errors. This is the same issue that is referenced here...

Data Engineering

Access Mode

Delta Live Tables

Instance Profiles

No Isolation Shared

1352 Views
3 replies
1 kudos

10-26-2023 7:35:09 PM

View Replies

Latest Reply

djhs
New Contributor III

an hour ago

1 kudos

Hi, I'm running into the same issue. Was this solved?

1 kudos

an hour ago

2 More Replies

by mickniz • Contributor

2 hours ago

9 Views
0 replies
0 kudos

Connect to Databricks from PowerApps

Hi All,Currently I trying to connect databricks Unity Catalog from Powerapps Dataflow by using spark connector specifying http url and using databricks personal access token as specified in below screenshot: I am able to connect but the issue is when...

Data Engineering

9 Views
0 replies
0 kudos

2 hours ago

by Lazloo • New Contributor III

05-08-2023 2:00:59 AM

5209 Views
6 replies
4 kudos

databricks-connect version 13: spark-class2.cmd not found

I install the newest version "databricks-connect==13.0.0". Now get the issue Command C:\Users\Y\AppData\Local\pypoetry\Cache\virtualenvs\X-py3.9\Lib\site-packages\pyspark\bin\spark-class2.cmd"" not found konnte nicht gefunden werden. Traceback...

Data Engineering

5209 Views
6 replies
4 kudos

05-08-2023 2:00:59 AM

View Replies

Latest Reply

Susumu_Asaga
New Contributor II

2 hours ago

4 kudos

Use this code:from databricks.connect import DatabricksSession spark = DatabricksSession.builder.getOrCreate()

4 kudos

2 hours ago

5 More Replies

by QuantumFries • Visitor

yesterday

51 Views
2 replies
1 kudos

Change {{job.start_time.[iso_date]}} Timezone

I am trying to schedule some jobs using workflows and leveraging dynamic variables. One caveat is that when I try to use {{job.start_time.[iso_date]}} it seems to be defaulted to UTC, is there a way to change it?

Data Engineering

51 Views
2 replies
1 kudos

yesterday

View Replies

Latest Reply

artsheiko
Valued Contributor III

2 hours ago

1 kudos

Hi, all the dynamic values are in UTC (documentation). Maybe you can use the code like the one presented below + pass the variables between tasks (see Share information between tasks in a Databricks job) ? %python from datetime import datetime, timed...

1 kudos

2 hours ago

1 More Replies

by rt-slowth • Contributor

01-10-2024 6:33:50 PM

749 Views
3 replies
2 kudos

AutoLoader File notification mode Configuration with AWS

from pyspark.sql import functions as F from pyspark.sql import types as T from pyspark.sql import DataFrame, Column from pyspark.sql.types import Row import dlt S3_PATH = 's3://datalake-lab/XXXXX/' S3_SCHEMA = 's3://datalake-lab/XXXXX/schemas/' ...

Data Engineering

749 Views
3 replies
2 kudos

01-10-2024 6:33:50 PM

View Replies

Latest Reply

djhs
New Contributor III

2 hours ago

2 kudos

Was this resolved? I run into the same issue

2 kudos

2 hours ago

2 More Replies

by Kayl669 • New Contributor III

6 hours ago

66 Views
4 replies
0 kudos

SQL code against tables with '>' in headers suddenly failing?

Just want to post this issue we're experiencing here in case other people are facing something similar. Below is the wording of the support ticket request I've raised:SQL code that has been working is suddenly failing due to syntax errors today. Ther...

Data Engineering

66 Views
4 replies
0 kudos

6 hours ago

View Replies

Latest Reply

Kayl669
New Contributor III

2 hours ago

0 kudos

This was code that was working yesterday? You have to use backticks because the field names of the table contain characters akin to spaces?

0 kudos

2 hours ago

3 More Replies

by Phani1 • Valued Contributor

4 hours ago

61 Views
2 replies
0 kudos

Parallel execution of SQL cell in Databricks Notebooks

Hi Team,Please provide guidance on enabling SQL cells parallel execution in a notebook containing multiple SQL cells. Currently, when we execute notebook and all the SQL cells they run sequentially. I would appreciate assistance on how to execute th...

Data Engineering

delta

61 Views
2 replies
0 kudos

4 hours ago

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

2 hours ago

0 kudos

Hi @Phani1 ,Can you please explain your usecase as databricks notebook support the sequential executions we have to look for workaround so it will great if you can explain it more.For now you can manually run multiple cell for sql but it's not possib...

0 kudos

2 hours ago

1 More Replies

by Phani1 • Valued Contributor

3 hours ago

29 Views
0 replies
0 kudos

Databricks cell-level code parallel execution through the Python threading library

Hi Team,We are currently planning to implement Databricks cell-level code parallel execution through the Python threading library. We are interested in comprehending the resource consumption and allocation process from the cluster. Are there any pot...

Data Engineering

delta

29 Views
0 replies
0 kudos

3 hours ago

by MarcusC • Visitor

6 hours ago

127 Views
3 replies
0 kudos

Temporary views no longer working for Share Compute

If I do this%sqlcreate or replace temporary view myviewasselect * from silver.<schema>.<table>;SHOW VIEWS;select * from myview;It works. But if I do the same on a Shared Compute it fails with[TABLE_OR_VIEW_NOT_FOUND] The table or view `myview` cannot...

Data Engineering

127 Views
3 replies
0 kudos

6 hours ago

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

3 hours ago

0 kudos

temp view will be deleted if cluster will stop as far as I know

0 kudos

3 hours ago

2 More Replies

by jitesh • Visitor

3 hours ago

23 Views
0 replies
0 kudos

Code reusability for silver table transformations

How/how many databricks notebooks should be created to populate multiple silver delta tables, all having different and complex transformations ? What's the best practice -1. create a single reusable notebook each for a silver table ?2. push SQL trans...

Data Engineering

23 Views
0 replies
0 kudos

3 hours ago

by Ruby8376 • Valued Contributor

yesterday

57 Views
1 replies
0 kudos

Databricks sql warehouse has Serverless compute as a public preview.

There is a risk form infosec as it is processed in the control plane shared with other azure clients. s there any control to mitigate the risk?

Data Engineering

57 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

PL_db
New Contributor III

4 hours ago

0 kudos

You can find more information on that topic here. "With Databricks, your serverless workloads are protected by multiple layers of security. These security layers form the foundation of Databricks’ commitment to providing a secure and reliable environ...

0 kudos

4 hours ago

by Phani1 • Valued Contributor

yesterday

57 Views
1 replies
0 kudos

Execute Pyspark cells concurrently

Hi Team,Hi Team,Is it feasible to run pyspark cells concurrently in databricks notebooks? If so, kindly provide instructions on how to accomplish this. We aim to execute the intermediate steps simultaneously.The given scenario entails the simultaneou...

Data Engineering

57 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

Kaniz
Community Manager

6 hours ago

0 kudos

Hi @Phani1, You can run PySpark cells concurrently in Databricks Notebooks. To achieve this, consider the following approaches: Using dbutils.notebook.run(): The simplest way is to utilize the dbutils.notebook.run() utility. You can call it from ...

0 kudos

6 hours ago

by DLL • New Contributor

yesterday

48 Views
1 replies
0 kudos

Some columns are being dropped when moving to pandas data set.

Some columns are being dropped when moving to pandas data set. I see part of the dataset, but it does not show when displaying..

Data Engineering

48 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

Kaniz
Community Manager

6 hours ago

0 kudos

Hi @DLL, It seems like there might be some confusion or an issue with how the dataset is being loaded or processed. Could you please provide more details about which columns are being dropped and how you are moving the dataset to a pandas DataFrame? ...

0 kudos

6 hours ago

User

Count

1601

736

343

284

247

Databricks

Forum Posts

Databricks Asset Bundle Workflow Redeployment Issue

Resolved! Split a code cell at cursor position? Add a cell above/below?

Instance Profiles Do Not Work with Delta Live Tables Default Cluster Policy Access Mode "Shared"

Connect to Databricks from PowerApps

databricks-connect version 13: spark-class2.cmd not found

Change {{job.start_time.[iso_date]}} Timezone

AutoLoader File notification mode Configuration with AWS

SQL code against tables with '>' in headers suddenly failing?

Parallel execution of SQL cell in Databricks Notebooks

Databricks cell-level code parallel execution through the Python threading library

Temporary views no longer working for Share Compute

Code reusability for silver table transformations

Databricks sql warehouse has Serverless compute as a public preview.

Execute Pyspark cells concurrently

Some columns are being dropped when moving to pandas data set.

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...