Topics with Label: Cluster management

Forum Posts

Sorted by:

by TheDataDexter • New Contributor III

01-07-2022 1:39:04 AM

2222 Views
4 replies
3 kudos

Resolved! Single-Node cluster works but Multi-Node clusters do not read data.

I am currently working with a VNET injected databricks workspace. At the moment I have mounted a the databricks cluster on an ADLS G2 resource. When running notebooks on a single node that read, transform, and write data we do not encounter any probl...

Data Engineering

2222 Views
4 replies
3 kudos

01-07-2022 1:39:04 AM

View Replies

Latest Reply

ellafj
New Contributor II

a week ago

3 kudos

@TheDataDexter Did you find a solution to your problem? I am facing the same issue

3 kudos

a week ago

3 More Replies

by User16869510359 • Esteemed Contributor

06-25-2021 3:13:30 PM

8270 Views
2 replies
0 kudos

How to find Databricks runtime version of the cluster in an init script

Data Engineering

8270 Views
2 replies
0 kudos

06-25-2021 3:13:30 PM

View Replies

Latest Reply

Mooune_DBU
Valued Contributor

06-25-2021 3:37:39 PM

0 kudos

It's set as an environment variable called `DATABRICKS_RUNTIME_VERSION`In your init scripts, you just need to add a line to display or save the info (see python example below):import os print("DATABRICKS_RUNTIME_VERSION:",os.environ.get('DATABRICKS_R...

0 kudos

06-25-2021 3:37:39 PM

1 More Replies

by SaravananPalani • New Contributor II

08-23-2018 4:08:35 AM

18639 Views
8 replies
9 kudos

Is there any way to monitor the CPU, disk and memory usage of a cluster while a job is running?

I am looking for something preferably similar to Windows task manager which we can use for monitoring the CPU, memory and disk usage for local desktop.

Data Engineering

18639 Views
8 replies
9 kudos

08-23-2018 4:08:35 AM

View Replies

Latest Reply

hitech88
New Contributor II

02-04-2023 11:57:28 AM

9 kudos

Some important info to look in Gangalia UI in CPU, memory and server load charts to spot the problem:CPU chart :User %Idle %High percentage of user % indicates heavy CPU usage in the cluster.Memory chart : Use %Free %Swap % If you see purple line ove...

9 kudos

02-04-2023 11:57:28 AM

7 More Replies

by supremefist • New Contributor III

03-16-2022 4:51:08 AM

3312 Views
5 replies
2 kudos

Resolved! New spark cluster being configured in local mode

Hi,We have two workspaces on Databricks, prod and dev. On prod, if we create a new all-purpose cluster through the web interface and go to Environment in the the spark UI, the spark.master setting is correctly set to be the host IP. This results in a...

Data Engineering

3312 Views
5 replies
2 kudos

03-16-2022 4:51:08 AM

View Replies

Latest Reply

scottb
New Contributor II

12-26-2022 7:05:48 PM

2 kudos

I found the same issue when choosing the default cluster setup on first setup that when I went to edit the cluster to add an instance profile, I was not able to save without fixing this. Thanks for the tip

2 kudos

12-26-2022 7:05:48 PM

4 More Replies

by Pat • Honored Contributor III

08-29-2022 7:25:51 AM

5101 Views
7 replies
18 kudos

Resolved! Cluster Modes - High Concurrency

It took me quite some time to find the option to create a cluster in High Concurrency mode. It was hidden in the new UI.What should be the way to access the data with TAC?What is the equivalent mode to work with TAC ?Does it mean that we are being pu...

Data Engineering

5101 Views
7 replies
18 kudos

08-29-2022 7:25:51 AM

View Replies

Latest Reply

Prabakar
Esteemed Contributor III

10-02-2022 4:13:55 AM

18 kudos

Thanks. Always happy to help.

18 kudos

10-02-2022 4:13:55 AM

6 More Replies

by MadelynM • New Contributor III

07-05-2022 10:32:35 AM

3277 Views
3 replies
3 kudos

How do I move existing workflows and jobs running on an all-purpose cluster to a shared jobs cluster?

A Databricks cluster is a set of computation resources that performs the heavy lifting of all of the data workloads you run in Databricks. Databricks provides a number of options when you create and configure clusters to help you get the best perform...

Left navigation bar selecting Data Science & Engineering

Data Engineering

3277 Views
3 replies
3 kudos

07-05-2022 10:32:35 AM

View Replies

Latest Reply

Kaniz
Community Manager

07-07-2022 11:54:48 PM

3 kudos

Hi @Madelyn Mullen , Thank you for sharing such an excellent and informative post. We hope to see these very often.

3 kudos

07-07-2022 11:54:48 PM

2 More Replies

by Megan05 • New Contributor III

06-22-2022 8:50:23 AM

1673 Views
4 replies
4 kudos

Resolved! Out of Memory/Connection Lost When Writing to External SQL Server from Databricks Using JDBC Connection

I am working on writing a large amount of data from Databricks to an external SQL server using a JDB connection. I keep getting timeout errors/connection lost but digging deeper it appears to be a memory problem. I am wondering what cluster configura...

Data Engineering

1673 Views
4 replies
4 kudos

06-22-2022 8:50:23 AM

View Replies

Latest Reply

hotrabattecom
New Contributor II

06-23-2022 2:42:18 AM

4 kudos

Thanks for the answer. I am also get in this problem. Hotrabatt

4 kudos

06-23-2022 2:42:18 AM

3 More Replies

by rbarrero • New Contributor III

05-03-2022 8:01:23 AM

3171 Views
13 replies
7 kudos

Resolved! Error saving changes on Job Cluster

Hello all and thanks.After apply to serving a model, I go to edit corresponding Job Cluster to configure its init_script but when I try to save changes (Confirm and restart) it thrown the following error:Error: Cannot edit cluster 0503-141315-hu3wd4i...

Data Engineering

3171 Views
13 replies
7 kudos

05-03-2022 8:01:23 AM

View Replies

Latest Reply

rbarrero
New Contributor III

06-14-2022 7:06:47 AM

7 kudos

Sorry for the delay in responding. Finally a partner could fix the problem, he can edit without problems the cluster and add the init_script.Thank you!

7 kudos

06-14-2022 7:06:47 AM

12 More Replies

by reedzhang • New Contributor III

03-23-2022 9:15:20 AM

2096 Views
6 replies
5 kudos

Resolved! uninstalled libraries continue to get installed on cluster startup

We have been trying to update some library versions by uninstalling the old versions and installing new ones. However, the old libraries continue to get installed on cluster startup despite not showing up in the "libraries" tab of the cluster page. W...

Data Engineering

2096 Views
6 replies
5 kudos

03-23-2022 9:15:20 AM

View Replies

Latest Reply

reedzhang
New Contributor III

05-08-2022 3:20:20 PM

5 kudos

The issue seemed to go away on its own. At some point the libraries page started showing what was getting installed to the cluster, and removing libraries from the page caused them to stop getting installed on cluster startup. I'm guessing there was ...

5 kudos

05-08-2022 3:20:20 PM

5 More Replies

by rachelk05 • New Contributor II

03-13-2022 12:22:12 PM

1249 Views
2 replies
4 kudos

Resolved! Databricks Community: Cluster Terminated Reason: Unexpected Launch Failure

Hi,I've been encountering the following error when I try to start a cluster, but the status page says everything is fine. Is something happening or are there other steps I can try?Time2022-03-13 14:40:51 EDTMessageCluster terminated.Reason:Unexpected...

Data Engineering

1249 Views
2 replies
4 kudos

03-13-2022 12:22:12 PM

View Replies

Latest Reply

Kaniz
Community Manager

03-21-2022 8:55:23 AM

4 kudos

Hi @Rachel Kelley , Were you able to not see the error anymore?

4 kudos

03-21-2022 8:55:23 AM

1 More Replies

by mikep • New Contributor II

12-15-2021 7:18:01 PM

2390 Views
7 replies
0 kudos

Resolved! Kubernetes or ZooKeeper for HA?

Hello. I am trying to understand High Availability in DataBricks. I understand that DB uses Kubernetes for the cluster manager and to manage Docker Containers. And while DB runs on top of AWS or Azure or GCP, is HA automatically provisioned when I st...

Data Engineering

2390 Views
7 replies
0 kudos

12-15-2021 7:18:01 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-19-2022 4:19:20 AM

0 kudos

0 kudos

03-19-2022 4:19:20 AM

6 More Replies

by GoldenTuna • New Contributor II

01-27-2022 7:18:56 AM

2448 Views
5 replies
2 kudos

Resolved! Mounting an Azure Storage Account in a cluster init script?

We are trying to configure our environment so when our cluster starts up, it checks to see if we have mounted our Azure storage account container and if is not, mount it. We can do this fine in a notebook however have no luck doing this through an in...

Data Engineering

2448 Views
5 replies
2 kudos

01-27-2022 7:18:56 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-28-2022 8:50:26 AM

2 kudos

@David Kruetzkamp - Would you be happy to mark whichever answer helped the most as best? That will help other members find the solution more quickly.

2 kudos

01-28-2022 8:50:26 AM

4 More Replies

by timothy_uk • New Contributor III

12-13-2021 8:19:34 AM

1520 Views
4 replies
0 kudos

Resolved! Zombie .Net Spark Databricks Job (CourseGrainedExecutorBackend)

Hi all,Environment:Nodes: Standard_E8s_v3Databricks Runtime: 9.0.NET for Apache Spark 2.0.0I'm invoking spark submit to run a .Net Spark job hosted in Azure Databricks. The job is written in C#.Net with its only transformation and action, reading a C...

Data Engineering

1520 Views
4 replies
0 kudos

12-13-2021 8:19:34 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

01-05-2022 5:22:23 PM

0 kudos

Hi @Timothy Lin ,I will recommend to not use spark.stop() or System.exit(0) in your code because it will explicitly stop the Spark context but the graceful shutdown and handshake with databricks' job service does not happen.

0 kudos

01-05-2022 5:22:23 PM

3 More Replies

by Mariusz_Cyp • New Contributor II

12-11-2021 1:18:38 AM

3682 Views
3 replies
7 kudos

When the billing time starts for the cluster?

Hi All, I'm just wondering when exactly the billing time starts for the DataBricks cluster? Is starting time included? If cluster creation time takes 3 minutes and query execution only 2, will I pay for 2 or 5?Thanks in advance! MC

Data Engineering

3682 Views
3 replies
7 kudos

12-11-2021 1:18:38 AM

View Replies

Latest Reply

franco_patano
New Contributor III

12-13-2021 10:59:45 AM

7 kudos

Billing for databricks DBUs starts when Spark Context becomes available. Billing for the cloud provider starts when the request for compute is received and the VMs are starting up.

7 kudos

12-13-2021 10:59:45 AM

2 More Replies

by Hubert-Dudek • Esteemed Contributor III

12-07-2021 2:05:54 AM

510 Views
0 replies
19 kudos

docs.databricks.com

Databricks Runtime 10.2 Beta is available from yesterday.More details here: https://docs.databricks.com/release-notes/runtime/10.2.htmlNew features and improvementsUse Files in Repos with Spark StreamingDatabricks Utilities adds an update mount comma...

Data Engineering

510 Views
0 replies
19 kudos

12-07-2021 2:05:54 AM