Data Engineering

Forum Posts

Sorted by:

by vamsivarun007 • New Contributor II

03-13-2020 1:53:24 AM

32548 Views
5 replies
2 kudos

Driver is up but is not responsive, likely due to GC.

Hi all, "Driver is up but is not responsive, likely due to GC." This is the message in cluster event logs. Can anyone help me with this. What does GC means? Garbage collection? Can we control it externally?

Data Engineering

32548 Views
5 replies
2 kudos

03-13-2020 1:53:24 AM

View Replies

Latest Reply

jacovangelder
Honored Contributor

06-18-2024 12:12:53 AM

2 kudos

9/10 times GC is due to out of memory exceptions.@Jaron spark.catalog.clearCache() is not a configurable option, but rather a command to submit.

2 kudos

06-18-2024 12:12:53 AM

4 More Replies

by brickster_2018 • Esteemed Contributor

06-25-2021 11:43:48 AM

2502 Views
2 replies
0 kudos

Resolved! The driver is temporarily unavailable

My job fails with Driver is temporarily unavailable. Apparently, it's permanently unavailable, because the job is not pausing but failing.

Data Engineering

2502 Views
2 replies
0 kudos

06-25-2021 11:43:48 AM

View Replies

Latest Reply

Chalki
New Contributor III

08-14-2023 1:10:17 PM

0 kudos

I am facing the same issues . I am writing in batches using a simple for loop. I don't have any collect statements inside the loop. I am rewriting the partitions with partition overwrite dynamic mode in a huge wide delta table - several tb. The incr...

0 kudos

08-14-2023 1:10:17 PM

1 More Replies

by MarsSu • New Contributor II

05-11-2023 7:23:35 PM

2414 Views
3 replies
3 kudos

Resolved! Does driver node of job compute have HA?

I would like to confirm and discuss HA mechanism about driver node of job compute. Because we can image driver node just like master node of cluster. In AWS EMR, we can setup 2 master node so that one of master node failed, another master node can re...

Data Engineering

2414 Views
3 replies
3 kudos

05-11-2023 7:23:35 PM

View Replies

Latest Reply

Anonymous
Not applicable

05-22-2023 12:23:36 AM

3 kudos

Hi @Mars Su We haven't heard from you since the last response from @Werner Stinckens and @karthik p , and I was checking back to see if her suggestions helped you.Or else, If you have any solution, please share it with the community, as it can be...

3 kudos

05-22-2023 12:23:36 AM

2 More Replies

by testname1 • New Contributor II

03-26-2023 2:36:16 PM

1762 Views
1 replies
1 kudos

Is it possible to use the databricks-sql-nodejs driver in a create-react-app app?

I'm using the typescript example for the databricks sql driver but I'm getting errors when compiling:

Data Engineering

1762 Views
1 replies
1 kudos

03-26-2023 2:36:16 PM

View Replies

Latest Reply

User16502773013
Contributor II

04-20-2023 4:01:23 PM

1 kudos

Hello @asdf fdsa ,The NodeJS connector is built for NodeJS environment it will not integrate ReactJSFor cases where a web execution is needed we advise to use SQL Exec APIPlease check documentation here for the same:https://docs.databricks.com/sql/a...

1 kudos

04-20-2023 4:01:23 PM

by Anonymous • Not applicable

11-23-2022 10:32:33 PM

8323 Views
3 replies
14 kudos

Resolved! No suitable driver error When configure the Databricks ODBC and JDBC drivers

Hi all,I've just encountered with this issue. Before I launched an My SQL database in RDS of AWS after use this simple code to create connection to it but it all fails with this error.Is there any additional step? or could anyone can take a look on i...

Data Engineering

8323 Views
3 replies
14 kudos

11-23-2022 10:32:33 PM

View Replies

Latest Reply

Jag
New Contributor III

03-30-2023 11:40:16 AM

14 kudos

Hello, It looks issue with JDBC URL. When I am trying to access the Azure SQL database. I was facing the same issue. So I have created JDBC URL as below and it went well.jdbc:sqlserver://<serverurl>:1433;database=<databasename>;user=<username>@<serve...

14 kudos

03-30-2023 11:40:16 AM

2 More Replies

by jonathan-dufaul • Valued Contributor

12-30-2022 10:56:02 AM

1490 Views
3 replies
3 kudos

Resolved! Why does chaining spark.read from one system/driver and .write to another system/driver take so much longer than doing each piece individually?

i am reading data from IBM DB2 and saving into a MS SQL server (the first step is moving the code itself to databricks, and then we will move the databases to databricks itself). Problem I'm running into is doing something like the below will take > ...

Data Engineering

1490 Views
3 replies
3 kudos

12-30-2022 10:56:02 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

01-02-2023 7:56:11 AM

3 kudos

Hi, it is related to partitioning optimization. By default, the JDBC driver queries the source database with only a single thread. So write was from one partition as one partition was created, so it was using a single core. When you used pandas, it d...

3 kudos

01-02-2023 7:56:11 AM

2 More Replies

by Ossian • New Contributor

07-21-2021 12:08:18 AM

1725 Views
1 replies
0 kudos

Driver restarts and job dies after 10-20 hours (Structured Streaming)

I am running a java/jar Structured Streaming job on a single node cluster (Databricks runtime 8.3). The job contains a single query which reads records from multiple Azure Event Hubs using Spark Kafka functionality and outputs results to a mssql dat...

Data Engineering

1725 Views
1 replies
0 kudos

07-21-2021 12:08:18 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-10-2022 6:31:30 AM

0 kudos

its seems that when your nodes are increasing it is seeking for init script and it is failing so you can use reserve instances for this activity instead of spot instances it will increase your overall costor alternatively, you can use depended librar...

0 kudos

12-10-2022 6:31:30 AM

by sriramkumar • New Contributor II

05-25-2022 11:27:22 AM

1172 Views
2 replies
1 kudos

Reasons for new Databricks driver

What are the reasons behind Databricks going for their own driver? What differences are made when switching between the previous Spark driver and the new Databricks driver?Is there any specific document I can look at or just the release notes?Also, w...

Data Engineering

1172 Views
2 replies
1 kudos

05-25-2022 11:27:22 AM

View Replies

Latest Reply

Anonymous
Not applicable

07-22-2022 8:39:18 AM

1 kudos

Hey @Sriramkumar Thamizharasan Hope all is well! Just wanted to check in if you were able to resolve your issue would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from...

1 kudos

07-22-2022 8:39:18 AM

1 More Replies

by GeorgeP • New Contributor II

06-16-2022 4:30:59 AM

1454 Views
2 replies
2 kudos

Errors when querying Azure DataBricks through DBeaver on macos

Configured DBeaver to work with either databricks latest driver or simba. I can connect and see databases, schemas, tables and columns. However, when a select statement is executed 30-40 seconds go by before I get the following error message: SQL...

Data Engineering

1454 Views
2 replies
2 kudos

06-16-2022 4:30:59 AM

View Replies

Latest Reply

sage5616
Valued Contributor

07-12-2022 7:56:18 AM

2 kudos

Has this issue been resolved? @aravhish solution did not help me. Any other options?I am experiencing the exact same issue with the same configuration on a Mac. Much help would be appreciated.

2 kudos

07-12-2022 7:56:18 AM

1 More Replies

by abd • Contributor

06-28-2022 6:23:06 AM

5668 Views
8 replies
16 kudos

Resolved! What will happen if a driver or worker node fails?

What will happen if a driver node will fail?What will happen if one of the worker node fails?Is it same in Spark and Databricks or Databricks provide additional features to overcome these situations?

Data Engineering

5668 Views
8 replies
16 kudos

06-28-2022 6:23:06 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

06-28-2022 1:35:30 PM

16 kudos

Hi @Abdullah Durrani, I'm glad to see that the suggestions provided here helped you. Well, in that case, would you please help us select the best answer for the community?

16 kudos

06-28-2022 1:35:30 PM

7 More Replies

by knight007 • New Contributor II

04-08-2022 1:33:32 AM

3186 Views
8 replies
5 kudos

Containerized Databricks/Spark database

Hello. I'm fairly new to Databricks and Spark.I have a requirement to connect to Databricks using JDBC and that works perfectly using the driver I downloaded from the Databricks website ("com.simba.spark.jdbc.Driver")What I would like to do now is ha...

Data Engineering

3186 Views
8 replies
5 kudos

04-08-2022 1:33:32 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

04-26-2022 4:12:47 AM

5 kudos

Hi @Gurps Bassi , Just a friendly follow-up. Do you still need help, or do @Hubert Dudek (Customer) and @Werner Stinckens 's responses help you find the solution? Please let us know.

5 kudos

04-26-2022 4:12:47 AM

7 More Replies

by sh_abrishami_ie • New Contributor II

01-12-2022 1:37:42 AM

4337 Views
3 replies
3 kudos

Resolved! Driver is up but is not responsive, likely due to GC.

Hi,I have a problem with writing an excel file into the mounted file.after 10 mins I see the Driver is up but is not responsive, likely due to GC on the log events.I'm using the following script:df.repartition(1).write .format("com.crealytics.spark....

Data Engineering

4337 Views
3 replies
3 kudos

01-12-2022 1:37:42 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

01-25-2022 5:45:58 AM

3 kudos

Hi @Shokoufeh Abrishami , Can you show the error stack or the logs?

3 kudos

01-25-2022 5:45:58 AM

2 More Replies

by User16826992666 • Valued Contributor

06-15-2021 11:44:07 AM

6527 Views
1 replies
0 kudos

Resolved! When should I choose a different driver type on my cluster vs the worker type?

When creating a cluster the driver type defaults to choose the same type as the workers, and this is what I usually choose. But in what of situation would I want to choose a different driver type?

Data Engineering

6527 Views
1 replies
0 kudos

06-15-2021 11:44:07 AM

View Replies

Latest Reply

sean_owen
Honored Contributor II

06-17-2021 4:12:36 PM

0 kudos

Using the same instance type is a fine default. If you know that you need very large workers, but little happens on the driver, maybe you can save money with a smaller driver. Conversely, you may know that some parts of your notebook involve a lot of...

0 kudos

06-17-2021 4:12:36 PM

by Anonymous • Not applicable

06-02-2021 5:25:49 PM

707 Views
0 replies
0 kudos

How large is considered a “large” dataset to put on the driver node?

Data Engineering

707 Views
0 replies
0 kudos

06-02-2021 5:25:49 PM

by User16873043212 • New Contributor III

05-07-2021 3:00:48 AM

580 Views
0 replies
0 kudos

We can now launch pools on databricks with different instance types. Hybrid Pools allows customers to create clusters and select different Databricks ...

We can now launch pools on databricks with different instance types. Hybrid Pools allows customers to create clusters and select different Databricks pools for driver and workers. It provides a way to support driver vs. worker heterogeneity, and ther...

Data Engineering

580 Views
0 replies
0 kudos

05-07-2021 3:00:48 AM