Data Engineering

Forum Posts

Sorted by:

by brian999 • Contributor

07-10-2024 10:03:19 AM

2008 Views
4 replies
2 kudos

Resolved! Managing libraries in workflows with multiple tasks - need to configure a list of libs for all tasks

I have workflows with multiple tasks, each of which need 5 different libraries to run. When I have to update those libraries, I have to go in and make the update in each and every task. So for one workflow I have 20 different places where I have to g...

Data Engineering

2008 Views
4 replies
2 kudos

07-10-2024 10:03:19 AM

View Replies

Latest Reply

brian999
Contributor

07-11-2024 8:22:53 AM

2 kudos

Actually I think I found most of a solution here in one of the replies: https://community.databricks.com/t5/administration-architecture/installing-libraries-on-job-clusters/m-p/37365/highlight/true#M245It seems like I only have to define libs for the...

2 kudos

07-11-2024 8:22:53 AM

3 More Replies

by Jreco • Contributor

07-11-2024 10:44:43 AM

3320 Views
2 replies
1 kudos

Resolved! SQLServer Incorrect syntax near the keyword 'WITH'

Hi Mates!I'm trying to get some data from an SQLServer using a query; the query has a WITH statement but I'm getting the following error:raise convert_exception(pyspark.errors.exceptions.connect.SparkConnectGrpcException: (com.microsoft.sqlserver.jdb...

Data Engineering

3320 Views
2 replies
1 kudos

07-11-2024 10:44:43 AM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

07-11-2024 1:11:48 PM

1 kudos

Hi @Jreco ,You need to use prepare query option and then query like below: url = "jdbc:sqlserver://server_name:1433;database=db_name" df = spark.read \ .format("jdbc") \ .option("url", url) \ .option("prepareQuery", "with cte as ( SELECT ...

1 kudos

07-11-2024 1:11:48 PM

1 More Replies

by gaurav_khanna • New Contributor II

03-12-2022 5:54:13 AM

6226 Views
4 replies
3 kudos

Resolved! Notebook is not attaching to a cluster, asks to contact your administrator. Completely stumped. Please help.

Data Engineering

6226 Views
4 replies
3 kudos

03-12-2022 5:54:13 AM

View Replies

Latest Reply

BartRJD
New Contributor II

07-11-2024 12:32:19 PM

3 kudos

I am having the same issue (Azure Databricks).I have a running compute cluster analytics-compute-cluster running in Single User access mode. The Event Log for the cluster says the cluster is running and the "Driver is healthy".I have Manage permissi...

3 kudos

07-11-2024 12:32:19 PM

3 More Replies

by Magesh2798 • New Contributor II

07-10-2024 1:48:34 PM

661 Views
1 replies
0 kudos

Databricks to IDT Connection using Databricks Service Principal

Hello all,I'm trying to create a connection from Databricks to Information Design Tool using access token generated using Databricks Service Principal.While testing the connection I'm getting this error: [Databricks][JDBCDriver](500593) Communication...

Data Engineering

661 Views
1 replies
0 kudos

07-10-2024 1:48:34 PM

View Replies

by leungi • Contributor

07-09-2024 7:51:30 AM

901 Views
1 replies
0 kudos

Error on ANALYZE TABLE [...] COMPUTE STATISTICS FOR COLUMNS [...]

End goal is to apply OPTIMIZE and ZORDER table.However, one of the columns to be ZORDER doesn't have stats collected.Running ANALYZE generates the error below.QueryANALYZE TABLE <catalog>.<schema>.<table> COMPUTE STATISTICS FOR COLUMNS my_col_1, my_c...

Data Engineering

901 Views
1 replies
0 kudos

07-09-2024 7:51:30 AM

View Replies

by duttong • New Contributor III

06-26-2024 7:53:07 AM

4088 Views
8 replies
7 kudos

[Errno 11] resource temporarily unavailable

Hi Databricks Community,We faced a strange error today where the error below was returned when a notebook was being run. It only happens on git connected notebooks and on rerun it succeeds. What is the issue?

Data Engineering

4088 Views
8 replies
7 kudos

06-26-2024 7:53:07 AM

View Replies

Latest Reply

Witold
Honored Contributor

07-11-2024 3:13:11 AM

7 kudos

Just follow https://status.azuredatabricks.net, there you'll see an active incident in West Europe

7 kudos

07-11-2024 3:13:11 AM

7 More Replies

by jim12321 • New Contributor II

03-12-2024 11:15:34 AM

3045 Views
3 replies
0 kudos

Databricks CLI how to start a job and pass the parameters?

I try to start a job ID 85218616788189 and pass one parameters 'demo' in Windows Shell.This works:databricks jobs run-now 85218616788189 If I try this one,databricks jobs run-now --json '{"job_id":85218616788189,"notebook_params": {"demo":"parameter...

Data Engineering

3045 Views
3 replies
0 kudos

03-12-2024 11:15:34 AM

View Replies

Latest Reply

Husky
New Contributor III

07-11-2024 2:09:33 AM

0 kudos

@jim12321 I got the same error with CLI version 0.223.1, but I could resolve it by escaping the quotation marks:databricks jobs run-now --json '{\"job_id\":85218616788189,\"notebook_params\": {\"demo\":\"parameter successful\"}}'Seems to be fixed by ...

0 kudos

07-11-2024 2:09:33 AM

2 More Replies

by SrinuM • New Contributor III

06-26-2024 3:21:54 AM

1236 Views
4 replies
1 kudos

CLOUD_PROVIDER_LAUNCH_FAILURE (CLOUD_FAILURE) for workflow job with all-purpose cluster

One of our databricks workflow job is failing occasionally with below error, after re-running and working fine without any issue.What is the exact reason for the issue and how can we fix itError:Unexpected failure while waiting for the cluster to be ...

Data Engineering

1236 Views
4 replies
1 kudos

06-26-2024 3:21:54 AM

View Replies

Latest Reply

PSR100
New Contributor III

07-11-2024 2:06:38 AM

1 kudos

These are cloud provider related errors and we will not have much error details from the error message. Based on the error message and also, that you have enough CPU/VM quota available, I think the issue might be due to the storage creation stage in ...

1 kudos

07-11-2024 2:06:38 AM

3 More Replies

by RKNutalapati • Valued Contributor

07-11-2022 6:41:22 AM

1987 Views
3 replies
0 kudos

Jobs API "run now" - How to set task wise parameters

I have a job with multiple tasks like Task1 -> Task2 -> Task3. I am trying to call the job using api "run now". Task details are belowTask1 - It executes a Note Book with some input parametersTask2 - It runs using "ABC.jar", so its a jar based task ...

Data Engineering

1987 Views
3 replies
0 kudos

07-11-2022 6:41:22 AM

View Replies

Latest Reply

Harsha777
New Contributor III

07-10-2024 11:14:58 AM

0 kudos

Hi,It would be a good feature to pass parameters at task level. We have scenarios where we would like to create a job with multiple tasks (notebook/dbt) and pass parameters at task level.

0 kudos

07-10-2024 11:14:58 AM

2 More Replies

by hadoan • New Contributor II

07-03-2024 2:46:58 AM

999 Views
2 replies
1 kudos

How to define DLT table with cyclic reference

@Dlt.table def table_A(): return ( dlt.read_stream(...) ) @dlt.table def table_join_A_and_C(): df_A = dlt.read_stream(table_A) df_C = dlt.read_stream(table_C) return ( ....df_A.join(df_C) ) @dlt.table def table_C(): return ( ...

Data Engineering

999 Views
2 replies
1 kudos

07-03-2024 2:46:58 AM

View Replies

Latest Reply

Rishabh_Tiwari
Databricks Employee

07-10-2024 9:59:49 AM

1 kudos

Hi @hadoan , Thank you for reaching out to our community! We're here to help you. To ensure we provide you with the best support, could you please take a moment to review the response and choose the one that best answers your question? Your feedback ...

1 kudos

07-10-2024 9:59:49 AM

1 More Replies

by safoineext • New Contributor

07-03-2024 5:33:48 AM

1058 Views
1 replies
0 kudos

Uploading wheel using `dbutils.fs.cp` to workspace and install it in Runtime>15

I have been trying to find an alternative to copying a wheel file from my local file system to Databricks and then installing it into the cluster. Doing this databricks_client.dbutils.fs.cp("file:/local..../..whl", "dbfs:/Workspace/users/..../..whl")...

Data Engineering

1058 Views
1 replies
0 kudos

07-03-2024 5:33:48 AM

View Replies

Latest Reply

Rishabh_Tiwari
Databricks Employee

07-10-2024 9:53:49 AM

0 kudos

Hi @safoineext , Thank you for reaching out to our community! We're here to help you. To ensure we provide you with the best support, could you please take a moment to review the response and choose the one that best answers your question? Your feedb...

0 kudos

07-10-2024 9:53:49 AM

by Mahesh_Yadav • New Contributor II

07-02-2024 11:22:38 PM

664 Views
1 replies
0 kudos

System Access Column lineage showing inaccurate results

Hi All,I have been trying to leverage the system column lineage table to check the overall journey of a column. But i am getting inaccurate results wherever unpivot transformations are used.Instead of showing the results in a way that 20 columns are ...

Data Engineering

664 Views
1 replies
0 kudos

07-02-2024 11:22:38 PM

View Replies

Latest Reply

Rishabh_Tiwari
Databricks Employee

07-10-2024 9:49:07 AM

0 kudos

Hi @Mahesh_Yadav , Thank you for reaching out to our community! We're here to help you. To ensure we provide you with the best support, could you please take a moment to review the response and choose the one that best answers your question? Your fee...

0 kudos

07-10-2024 9:49:07 AM

by beautrincia • New Contributor

07-03-2024 1:54:04 PM

721 Views
1 replies
0 kudos

How to get data permissions from Sharepoint and Confluence to Unity Catalog for RAG LLM chatbot

We're implementing a chatbot where documents in SharePoint and pages in Confluence augment the results. We want to adhere to existing RBAC policies in these data sources so that the chatbot doesn't produce results that someone should not see. Are you...

Data Engineering

721 Views
1 replies
0 kudos

07-03-2024 1:54:04 PM

View Replies

Latest Reply

Rishabh_Tiwari
Databricks Employee

07-10-2024 9:48:45 AM

0 kudos

Hi @beautrincia , Thank you for reaching out to our community! We're here to help you. To ensure we provide you with the best support, could you please take a moment to review the response and choose the one that best answers your question? Your feed...

0 kudos

07-10-2024 9:48:45 AM

by Tiwarisk • New Contributor III

06-11-2024 9:40:16 PM

1936 Views
5 replies
3 kudos

How can I preserve the data type of the delta tables while writing to Azure blob storage ?

I am writing a file using this but the data type of columns get changed while reading. df.write.format("com.crealytics.spark.excel").option("header", "true").mode("overwrite").save(path) Due to this I have to manual change every time as I can't chang...

Data Engineering

1936 Views
5 replies
3 kudos

06-11-2024 9:40:16 PM

View Replies

Latest Reply

Rishabh_Tiwari
Databricks Employee

07-10-2024 9:16:26 AM

3 kudos

Hi @Tiwarisk , Thank you for reaching out to our community! We're here to help you.To ensure we provide you with the best support, could you please take a moment to review the response and choose the one that best answers your question? Your feedback...

3 kudos

07-10-2024 9:16:26 AM

4 More Replies

by 938452 • New Contributor III

08-22-2023 4:46:30 PM

15941 Views
3 replies
2 kudos

Resolved! Executor memory increase limitation based on node type

Hi Databricks community,I'm using Databricks Jobs Cluster to run some jobs. I'm setting the worker and driver type to AWS m6gd.large, which has 2 cores and 8G of memory each.After seeing it's defaulting executor memory to 2G, I wanted to increase it,...

Data Engineering

15941 Views
3 replies
2 kudos

08-22-2023 4:46:30 PM

View Replies

Latest Reply

938452
New Contributor III

08-28-2023 12:07:07 PM

2 kudos

I think I found the right answer here: https://kb.databricks.com/en_US/clusters/spark-shows-less-memoryIt seems it sets fixed size of ~4GB is used for internal node services. So depending on the node type, `spark.executor.memory` is fixed by Databric...

2 kudos

08-28-2023 12:07:07 PM

2 More Replies

User

Count

1611

763

345

286

252

Databricks Community

Forum Posts

Resolved! Managing libraries in workflows with multiple tasks - need to configure a list of libs for all tasks

Resolved! SQLServer Incorrect syntax near the keyword 'WITH'

Resolved! Notebook is not attaching to a cluster, asks to contact your administrator. Completely stumped. Please help.

Databricks to IDT Connection using Databricks Service Principal

Error on ANALYZE TABLE [...] COMPUTE STATISTICS FOR COLUMNS [...]

[Errno 11] resource temporarily unavailable

Databricks CLI how to start a job and pass the parameters?

CLOUD_PROVIDER_LAUNCH_FAILURE (CLOUD_FAILURE) for workflow job with all-purpose cluster

Jobs API "run now" - How to set task wise parameters

How to define DLT table with cyclic reference

Uploading wheel using `dbutils.fs.cp` to workspace and install it in Runtime>15

System Access Column lineage showing inaccurate results

How to get data permissions from Sharepoint and Confluence to Unity Catalog for RAG LLM chatbot

How can I preserve the data type of the delta tables while writing to Azure blob storage ?

Resolved! Executor memory increase limitation based on node type

Connect with Databricks Users in Your Area

how to get schedule information about a job in dat...

Scheduled Workflow options and DST Change

How to capture dlt pipeline id / name using dynami...

Databricks Workflow design

Not readable format of event_logs traceback in dlt...