Data Engineering

Forum Posts

Sorted by:

by reachbharathan • New Contributor III

06-10-2023 7:31:59 PM

1416 Views
3 replies
4 kudos

Resolved! How to checkout specific commit version via databricks UI

I have integrated gitlab with my azure databricks repo, I am able to push and pull commits from the databricks UI, I want to checkout to a specific commit version via databricks UI.Note: I am aware that via the gitlab i have checkout to specific vers...

Data Engineering

1416 Views
3 replies
4 kudos

06-10-2023 7:31:59 PM

View Replies

Latest Reply

reachbharathan
New Contributor III

06-14-2023 7:52:01 PM

4 kudos

After getting more context on databricks repo in details,Currently databricks doesn't support checkout of repo to specific commit.databricks provides only limited functionality mentioned belowAdd a repo and connect remotely laterClone a repo connecte...

4 kudos

06-14-2023 7:52:01 PM

2 More Replies

by fhmessas • New Contributor II

06-01-2023 9:44:48 AM

915 Views
2 replies
2 kudos

Trigger.AvailableNow getting stuck when there is no event

Hi, I have several streaming jobs, however one of them uses the Trigger.AvailableNow. The issue is that it gets stuck when there is no events or finishes ingesting all events. The expected behavior would be the job being shutdown.I've already checked...

Data Engineering

915 Views
2 replies
2 kudos

06-01-2023 9:44:48 AM

View Replies

Latest Reply

fhmessas
New Contributor II

06-14-2023 4:03:03 PM

2 kudos

Hi, the source is an S3 bucket using file notification with SQS.No errors or warns in the logs, the AvailableNow trigger just gets stuck.

2 kudos

06-14-2023 4:03:03 PM

1 More Replies

by andrew0117 • Contributor

05-19-2023 9:54:28 AM

528 Views
1 replies
0 kudos

what is best practice to handle the concurrency issue in batch processing?

Normally, our ELT framework takes in batches one by one and loads the data into target tables. But if more than one batches come in at the same time, the framework will break due to the concurrency issue that multiple sources are trying to write the ...

Data Engineering

528 Views
1 replies
0 kudos

05-19-2023 9:54:28 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

06-14-2023 2:25:50 PM

0 kudos

you can partition you table to avoid the changes of getting this exception.

0 kudos

06-14-2023 2:25:50 PM

by jwu1 • Contributor II

06-09-2023 10:37:06 AM

540 Views
1 replies
3 kudos

www.databricks.com

Attention Community! For a limited period, we are offering a generous 50% discount on training at the Data + AI Summit. Simply apply the code FLS4vop5ep during the registration process. Hurry, though, as this offer will expire on June 12, 2023. Don'...

Data Engineering

540 Views
1 replies
3 kudos

06-09-2023 10:37:06 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

06-14-2023 2:01:27 PM

3 kudos

Thank you for sharing this @Juliet Wu!!!

3 kudos

06-14-2023 2:01:27 PM

by Sas • New Contributor II

05-14-2023 11:12:32 PM

854 Views
1 replies
0 kudos

A streaming job going into infinite looping

HiBelow i am trying to read data from kafka, determine whether its fraud or not and then i need to write it back to mongodbbelow is my code read_kafka.pyfrom pyspark.sql import SparkSession from pyspark.sql.functions import * from pyspark.sql.types i...

Data Engineering

854 Views
1 replies
0 kudos

05-14-2023 11:12:32 PM

View Replies

Latest Reply

swethaNandan
New Contributor III

06-14-2023 10:35:00 AM

0 kudos

Hi Saswata,Can you remove the filter and see if it is printing output to console?kafka_df5=kafka_df4.filter(kafka_df4.status=="FRAUD")Thanks and RegardsSwetha Nandajan

0 kudos

06-14-2023 10:35:00 AM

by Qwetroman • New Contributor

05-15-2023 10:27:06 AM

799 Views
1 replies
0 kudos

AutoML runs fail after 5 seconds

Hi everyoneI am exploring automl, and I met a strange problem - after I launch a classification experiment on my personal newly created cluster (screenshot attached) it successfully performs data exploration, but after that, all runs fail after appro...

Data Engineering

799 Views
1 replies
0 kudos

05-15-2023 10:27:06 AM

View Replies

Latest Reply

swethaNandan
New Contributor III

06-14-2023 10:29:33 AM

0 kudos

Hi Qwetroman,we can see the following error message in the notebook - ExecutionTimeoutError: Execution timed out before any trials could be successfully run. Please increase the timeout for AutoML to run some trials.What's the size of the dataset? St...

0 kudos

06-14-2023 10:29:33 AM

by Nikhil3107 • New Contributor III

06-12-2023 9:29:36 AM

976 Views
2 replies
3 kudos

Deploy model to AWS Sagemaker: ModuleNotFoundError: No module named 'docker'

Greetings, When trying to run the following command: %sh mlflow sagemaker build-and-push-containerI get the following error:/databricks/python3/lib/python3.9/site-packages/click/core.py:2309: UserWarning: Virtualenv support is still experimental and ...

Data Engineering

976 Views
2 replies
3 kudos

06-12-2023 9:29:36 AM

View Replies

Latest Reply

Kaniz
Community Manager

06-14-2023 3:49:12 AM

3 kudos

Hi @Nikhil Gajghate, Which cluster are you working on?

3 kudos

06-14-2023 3:49:12 AM

1 More Replies

by BenLambert • Contributor

05-17-2023 4:12:03 AM

804 Views
2 replies
2 kudos

Table Refresh UI Error

Within the UI it is possible to "Select tables for refresh" for a specific Delta Live Tables Workflow. I often use it to make a full refresh on smaller tables during development. Unfortunately, when an error occurs during the full refresh on selected...

Data Engineering

804 Views
2 replies
2 kudos

05-17-2023 4:12:03 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

06-06-2023 11:37:41 AM

2 kudos

Could you please share the full error stack trace? it will help us to narrow down the issue

2 kudos

06-06-2023 11:37:41 AM

1 More Replies

by Mado • Valued Contributor II

06-13-2023 10:20:54 PM

1184 Views
2 replies
1 kudos

Resolved! How to set timezone for SQL Warehouse?

Hi, I want to change the default time zone for SQL Warehoue in the SQL Persona. When I try to Edit the SQL warehouse settings in the "SQL Warehouses" section, I am not able to find any setting where I can set the time zone. I am aware that I can set ...

Data Engineering

1184 Views
2 replies
1 kudos

06-13-2023 10:20:54 PM

View Replies

Latest Reply

Mado
Valued Contributor II

06-14-2023 4:04:23 AM

1 kudos

Thanks. I am aware of the SET TIME ZONE command but I need to run this command every time I start the SQL warehouse. I am looking for a way to change the default time zone of the SQL warehouse. Something like "spark.sql.session.timeZone GMT+10" that ...

1 kudos

06-14-2023 4:04:23 AM

1 More Replies

by iptkrisna • New Contributor III

05-08-2023 7:54:17 PM

1009 Views
2 replies
4 kudos

Error while rendering UI editor

Hi, does anyone facing an issue related to error while rendering editor on databricks notebook? it seems like this

Data Engineering

1009 Views
2 replies
4 kudos

05-08-2023 7:54:17 PM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

05-09-2023 11:59:19 PM

4 kudos

Hi, This looks like a browser issue. Could you please try it with some other browser? Or clear the cookies and caches of the same browser and confirm? Please tag @Debayan with your next comment so that I will get notified. Thank you!

4 kudos

05-09-2023 11:59:19 PM

1 More Replies

by Rishabh_T • New Contributor III

06-05-2023 10:53:59 PM

2050 Views
7 replies
7 kudos

Resolved! DLT pipeline is unable to process struct with hyphen in nested column name

Hello,I have some nested columns with hyphen i.e. sample-1 in struct column, recently DLT pipeline has started throwing synatx error. Before May 24, 2023, this was working fine.Is this a new bug in May 2023 release?After clearing table and table's da...

Data Engineering

2050 Views
7 replies
7 kudos

06-05-2023 10:53:59 PM

View Replies

Latest Reply

Anonymous
Not applicable

06-14-2023 12:06:17 AM

7 kudos

Hi @Rishabh Tomar We haven't heard from you since the last response from @Kaniz Fatma . Kindly share the information with us, and in return, we will provide you with the necessary solution. Thanks and Regards

7 kudos

06-14-2023 12:06:17 AM

6 More Replies

by Dinu2 • New Contributor III

06-08-2023 12:43:15 AM

2294 Views
7 replies
5 kudos

Timestamp in databricks are getting converted to different timezone

Timestamp columns which are extracted from source databases using jdbc read are getting converted to different timezone and is not matching with source timestamp. Could anyone suggest how can we get same timestamp data like source data?

Data Engineering

2294 Views
7 replies
5 kudos

06-08-2023 12:43:15 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-13-2023 11:34:16 PM

5 kudos

Hi @Dinu Sukumara We haven't heard from you since the last response from @Werner Stinckens . Kindly share the information with us, and in return, we will provide you with the necessary solution.Thanks and Regards

5 kudos

06-13-2023 11:34:16 PM

6 More Replies

by Anonymous • Not applicable

06-13-2023 5:37:07 AM

1219 Views
4 replies
2 kudos

Dear Community,I want to understand from you all - How do you debug your codes when using Databricks? Have you tried the Variable Explorer of Databr...

Dear Community,I want to understand from you all - How do you debug your codes when using Databricks?Have you tried the Variable Explorer of Databricks? This allows the users to view at-a-glance all the variables defined in their notebooks, inspect...

Data Engineering

1219 Views
4 replies
2 kudos

06-13-2023 5:37:07 AM

View Replies

Latest Reply

etsyal1e2r3
Honored Contributor

06-13-2023 7:18:17 PM

2 kudos

I just create code in notebooks that allow me to check outputs at different steps. These methods usually include print statements or .display() of dataframes. If youre working with lots of data the .show(truncate=100,vertical=True) may help you. I ha...

2 kudos

06-13-2023 7:18:17 PM

3 More Replies

by CloudBull • New Contributor

06-12-2023 10:13:19 AM

1575 Views
3 replies
2 kudos

How to calculate DBU cost/billing for Azure SQL Server instances

Data Engineering

1575 Views
3 replies
2 kudos

06-12-2023 10:13:19 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-14-2023 12:56:44 AM

2 kudos

@Gerard Blackburns :Calculating the cost or billing for Azure SQL Server instances involves considering the Azure SQL Database Unit (DBU) pricing model. DBUs are the unit of measure for the consumption of Azure SQL Database resources. To calculate t...

2 kudos

06-14-2023 12:56:44 AM

2 More Replies

by js54123875 • New Contributor III

06-01-2023 5:45:02 PM

1481 Views
3 replies
3 kudos

Setup for Unity Catalog, autoloader, three-level namespace, SCD2

I am trying to setup delta live tables pipelines to ingest data to bronze and silver tables. Bronze and Silver are separate schema. This will be triggered by a daily job. It appears to run fine when set as continuous, but fails when triggered.Table...

Data Engineering

1481 Views
3 replies
3 kudos

06-01-2023 5:45:02 PM

View Replies

Latest Reply

Anonymous
Not applicable

06-14-2023 12:17:18 AM

3 kudos

Hi @Jennette Shepard Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answ...

3 kudos

06-14-2023 12:17:18 AM

2 More Replies

User

Count

1601

736

343

284

246

Databricks

Forum Posts

Resolved! How to checkout specific commit version via databricks UI

Trigger.AvailableNow getting stuck when there is no event

what is best practice to handle the concurrency issue in batch processing?

www.databricks.com

A streaming job going into infinite looping

AutoML runs fail after 5 seconds

Deploy model to AWS Sagemaker: ModuleNotFoundError: No module named 'docker'

Table Refresh UI Error

Resolved! How to set timezone for SQL Warehouse?

Error while rendering UI editor

Resolved! DLT pipeline is unable to process struct with hyphen in nested column name

Timestamp in databricks are getting converted to different timezone

Dear Community,I want to understand from you all - How do you debug your codes when using Databricks? Have you tried the Variable Explorer of Databr...

How to calculate DBU cost/billing for Azure SQL Server instances

Setup for Unity Catalog, autoloader, three-level namespace, SCD2

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...