Data Engineering

Forum Posts

Sorted by:

by kazinahian • New Contributor III

29 seconds ago

1 Views
0 replies
0 kudos

Lowcode ETL in Databricks

Hello everyone,I work as a Business Intelligence practitioner, employing tools like Alteryx or various low-code solutions to construct ETL processes and develop data pipelines for my Dashboards and reports. Currently, I'm delving into Azure Databrick...

Data Engineering

1 Views
0 replies
0 kudos

29 seconds ago

by chloeh • New Contributor II

28m ago

5 Views
0 replies
0 kudos

Chaining window aggregations in SQL

In my SQL data transformation pipeline, I'm doing chained/cascading window aggregations: for example, I want to do average over the last 5 minutes, then compute average over the past day on top of the 5 minute average, so that my aggregations are mor...

Data Engineering

5 Views
0 replies
0 kudos

28m ago

by Willianms98 • New Contributor

an hour ago

7 Views
0 replies
0 kudos

Semenax Review (May 2024): semenax does it work

Semenax has truly transformed my life. With its natural ingredients, I've experienced a boost in energy and confidence like never before. Incorporating it into my routine has made a noticeable difference in how I feel every day. Not only do I have mo...

Data Engineering

7 Views
0 replies
0 kudos

an hour ago

by Fresher • New Contributor II

2 hours ago

17 Views
0 replies
0 kudos

users are deleted/ unsynced from azure AD to databricks

In azure AD, it's shows users are synced to Databricks. But in Databricks, it's showing users is not a part of the group. The user is not part of only one group , he is part of remaining groups. All the syncing works fine till yesterday. I don't now ...

Data Engineering

17 Views
0 replies
0 kudos

2 hours ago

by Darian • Visitor

yesterday

61 Views
2 replies
0 kudos

Delta Live table getting error of garbage collection after running few days

Hi, i am using delta live table in continuous mode for a real time streaming data pipeline. After running the pipeline like 2-3 days i am getting this garbage collection error:Driver/10.15.0.73 paused the JVM process 68 seconds during the past 120 se...

Data Engineering

61 Views
2 replies
0 kudos

yesterday

View Replies

Latest Reply

Darian
Visitor

3 hours ago

0 kudos

Here are the metrics:The size/type:Thanks!

0 kudos

3 hours ago

1 More Replies

by al_joe • Contributor

02-05-2022 12:13:34 AM

5363 Views
5 replies
3 kudos

Resolved! Split a code cell at cursor position? Add a cell above/below?

In JupyterLab notebooks, we can --In edit mode, you can press Ctrl+Shift+Minus to split the current cell into two at the cursor position In command mode, you can click A or B to add a cell Above or Below the current cellare there equivalent shortcuts...

Data Engineering

5363 Views
5 replies
3 kudos

02-05-2022 12:13:34 AM

View Replies

Latest Reply

DavidKxx
New Contributor III

5 hours ago

3 kudos

What's the status of the ctrl-alt-minus shortcut for splitting a cell? That keyboard combination does absolutely nothing in my interface (running Databricks via Chrome on GCP).

3 kudos

5 hours ago

4 More Replies

by Sikki • New Contributor

Friday

196 Views
7 replies
0 kudos

Databricks Asset Bundle Workflow Redeployment Issue

Hello All,In my Databricks workflows, I have three tasks configured, with the final task set to run only if the condition "ALL_DONE" is met. During the first deployment, I observed that the dependency "ALL_DONE" was correctly assigned to the last tas...

Data Engineering

196 Views
7 replies
0 kudos

Friday

View Replies

Latest Reply

Sikki
New Contributor

4 hours ago

0 kudos

After updating my CLI, I successfully deployed the job from Databricks CLI and it is functioning correctly. However, when attempting to deploy the same job using Azure DevOps, I encounter the same issue.

0 kudos

4 hours ago

6 More Replies

by jaredrohe • New Contributor II

10-26-2023 7:35:09 PM

1353 Views
3 replies
1 kudos

Instance Profiles Do Not Work with Delta Live Tables Default Cluster Policy Access Mode "Shared"

Hello,I am attempting to configure Autoloader in File Notification mode with Delta Live Tables. I configured an instance profile, but it is not working because I immediately get AWS access denied errors. This is the same issue that is referenced here...

Data Engineering

Access Mode

Delta Live Tables

Instance Profiles

No Isolation Shared

1353 Views
3 replies
1 kudos

10-26-2023 7:35:09 PM

View Replies

Latest Reply

djhs
New Contributor III

5 hours ago

1 kudos

Hi, I'm running into the same issue. Was this solved?

1 kudos

5 hours ago

2 More Replies

by mickniz • Contributor

5 hours ago

25 Views
0 replies
0 kudos

Connect to Databricks from PowerApps

Hi All,Currently I trying to connect databricks Unity Catalog from Powerapps Dataflow by using spark connector specifying http url and using databricks personal access token as specified in below screenshot: I am able to connect but the issue is when...

Data Engineering

25 Views
0 replies
0 kudos

5 hours ago

by Lazloo • New Contributor III

05-08-2023 2:00:59 AM

5232 Views
6 replies
4 kudos

databricks-connect version 13: spark-class2.cmd not found

I install the newest version "databricks-connect==13.0.0". Now get the issue Command C:\Users\Y\AppData\Local\pypoetry\Cache\virtualenvs\X-py3.9\Lib\site-packages\pyspark\bin\spark-class2.cmd"" not found konnte nicht gefunden werden. Traceback...

Data Engineering

5232 Views
6 replies
4 kudos

05-08-2023 2:00:59 AM

View Replies

Latest Reply

Susumu_Asaga
New Contributor II

5 hours ago

4 kudos

Use this code:from databricks.connect import DatabricksSession spark = DatabricksSession.builder.getOrCreate()

4 kudos

5 hours ago

5 More Replies

by QuantumFries • Visitor

yesterday

55 Views
2 replies
1 kudos

Change {{job.start_time.[iso_date]}} Timezone

I am trying to schedule some jobs using workflows and leveraging dynamic variables. One caveat is that when I try to use {{job.start_time.[iso_date]}} it seems to be defaulted to UTC, is there a way to change it?

Data Engineering

55 Views
2 replies
1 kudos

yesterday

View Replies

Latest Reply

artsheiko
Valued Contributor III

5 hours ago

1 kudos

Hi, all the dynamic values are in UTC (documentation). Maybe you can use the code like the one presented below + pass the variables between tasks (see Share information between tasks in a Databricks job) ? %python from datetime import datetime, timed...

1 kudos

5 hours ago

1 More Replies

by rt-slowth • Contributor

01-10-2024 6:33:50 PM

749 Views
3 replies
2 kudos

AutoLoader File notification mode Configuration with AWS

from pyspark.sql import functions as F from pyspark.sql import types as T from pyspark.sql import DataFrame, Column from pyspark.sql.types import Row import dlt S3_PATH = 's3://datalake-lab/XXXXX/' S3_SCHEMA = 's3://datalake-lab/XXXXX/schemas/' ...

Data Engineering

749 Views
3 replies
2 kudos

01-10-2024 6:33:50 PM

View Replies

Latest Reply

djhs
New Contributor III

5 hours ago

2 kudos

Was this resolved? I run into the same issue

2 kudos

5 hours ago

2 More Replies

by Kayl669 • New Contributor III

9 hours ago

78 Views
4 replies
0 kudos

SQL code against tables with '>' in headers suddenly failing?

Just want to post this issue we're experiencing here in case other people are facing something similar. Below is the wording of the support ticket request I've raised:SQL code that has been working is suddenly failing due to syntax errors today. Ther...

Data Engineering

78 Views
4 replies
0 kudos

9 hours ago

View Replies

Latest Reply

Kayl669
New Contributor III

6 hours ago

0 kudos

This was code that was working yesterday? You have to use backticks because the field names of the table contain characters akin to spaces?

0 kudos

6 hours ago

3 More Replies

by Phani1 • Valued Contributor

7 hours ago

68 Views
2 replies
0 kudos

Parallel execution of SQL cell in Databricks Notebooks

Hi Team,Please provide guidance on enabling SQL cells parallel execution in a notebook containing multiple SQL cells. Currently, when we execute notebook and all the SQL cells they run sequentially. I would appreciate assistance on how to execute th...

Data Engineering

delta

68 Views
2 replies
0 kudos

7 hours ago

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

6 hours ago

0 kudos

Hi @Phani1 ,Can you please explain your usecase as databricks notebook support the sequential executions we have to look for workaround so it will great if you can explain it more.For now you can manually run multiple cell for sql but it's not possib...

0 kudos

6 hours ago

1 More Replies

by Phani1 • Valued Contributor

6 hours ago

32 Views
0 replies
0 kudos

Databricks cell-level code parallel execution through the Python threading library

Hi Team,We are currently planning to implement Databricks cell-level code parallel execution through the Python threading library. We are interested in comprehending the resource consumption and allocation process from the cluster. Are there any pot...

Data Engineering

delta

32 Views
0 replies
0 kudos

6 hours ago

User

Count

1601

736

343

284

247

Databricks

Forum Posts

Lowcode ETL in Databricks

Chaining window aggregations in SQL

Semenax Review (May 2024): semenax does it work

users are deleted/ unsynced from azure AD to databricks

Delta Live table getting error of garbage collection after running few days

Resolved! Split a code cell at cursor position? Add a cell above/below?

Databricks Asset Bundle Workflow Redeployment Issue

Instance Profiles Do Not Work with Delta Live Tables Default Cluster Policy Access Mode "Shared"

Connect to Databricks from PowerApps

databricks-connect version 13: spark-class2.cmd not found

Change {{job.start_time.[iso_date]}} Timezone

AutoLoader File notification mode Configuration with AWS

SQL code against tables with '>' in headers suddenly failing?

Parallel execution of SQL cell in Databricks Notebooks

Databricks cell-level code parallel execution through the Python threading library

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...