Topics with Label: Interactive cluster

Forum Posts

Sorted by:

by FranPérez • New Contributor III

08-01-2022 12:37:10 AM

15675 Views
8 replies
4 kudos

set PYTHONPATH when executing workflows

I set up a workflow using 2 tasks. Just for demo purposes, I'm using an interactive cluster for running the workflow. { "task_key": "prepare", "spark_python_task": { "python_file": "file...

Data Engineering

15675 Views
8 replies
4 kudos

08-01-2022 12:37:10 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

08-30-2022 10:07:06 AM

4 kudos

Hi @Fran Pérez,Just a friendly follow-up. Did any of the responses help you to resolve your question? if it did, please mark it as best. Otherwise, please let us know if you still need help.

4 kudos

08-30-2022 10:07:06 AM

7 More Replies

by JKR • Contributor

04-25-2023 4:57:58 PM

3801 Views
2 replies
0 kudos

The spark driver has stopped unexpectedly and is restarting. Your notebook will be automatically reattached.

Getting below error Context: Using Databricks shared interactive cluster for scheduled run multiple parallel jobs at the same time after every 5 mins. When I check Ganglia, driver node's memory reaches almost max and then restart of driver happens an...

Data Engineering

3801 Views
2 replies
0 kudos

04-25-2023 4:57:58 PM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

04-26-2023 2:20:11 PM

0 kudos

please check the driver's logs, for example the log4j and the GC logs

0 kudos

04-26-2023 2:20:11 PM

1 More Replies

by Soma • Valued Contributor

04-23-2023 8:51:03 AM

8889 Views
7 replies
0 kudos

Databricks Workflow cost on running in interactive cluster

Data Engineering

8889 Views
7 replies
0 kudos

04-23-2023 8:51:03 AM

View Replies

Latest Reply

Soma
Valued Contributor

04-24-2023 8:39:05 PM

0 kudos

DLT live tables are costlier

0 kudos

04-24-2023 8:39:05 PM

6 More Replies

by shan_chandra • Databricks Employee

02-23-2023 12:31:12 PM

3724 Views
1 replies
1 kudos

Resolved! Adding spark_conf tag on Jobs API

using Jobs API, when we create a new job to run on an interactive cluster, can we add spark_conf tag and specify spark config tuning parameters?

Data Engineering

3724 Views
1 replies
1 kudos

02-23-2023 12:31:12 PM

View Replies

Latest Reply

shan_chandra
Databricks Employee

02-23-2023 12:53:26 PM

1 kudos

spark_conf needs to be set prior to the start of the cluster or have to restart the existing cluster. Hence, the spark_conf tag is available only on the job_cluster. you may have to set the configs manually on the interactive cluster prior to using ...

1 kudos

02-23-2023 12:53:26 PM

by Praveen2609 • New Contributor

09-19-2022 5:17:22 AM

3182 Views
2 replies
0 kudos

dbfs access for job clusters and interactive cluster

Hi All,I am new to databricks need some understanding for my requirement .our requirement:a: we have zip file in azure blob storage and we are bringing that file to dbfs and unzip that file and executing our transformations in multiple steps (3 steps...

Data Engineering

3182 Views
2 replies
0 kudos

09-19-2022 5:17:22 AM

View Replies

Latest Reply

Anonymous
Not applicable

10-02-2022 3:57:09 AM

0 kudos

Hi @praveen rajak Does @Debayan Mukherjee response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

0 kudos

10-02-2022 3:57:09 AM

1 More Replies

by Dineshkumar_Raj • New Contributor

05-30-2022 10:25:16 PM

3671 Views
2 replies
1 kudos

why the job running time and command execution time not matching in databricks

I have a azure databricks job and it's triggered via ADF using a API call. I want see why the job has been taking n minutes to complete the tasks. When the job execution results, The job execution time says 15 mins and the individual cells/commands d...

Data Engineering

3671 Views
2 replies
1 kudos

05-30-2022 10:25:16 PM

View Replies

Latest Reply

Anonymous
Not applicable

07-29-2022 9:49:02 AM

1 kudos

Hey there @DineshKumar Does @Prabakar Ammeappin's response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly? Else please let us know if you need more help. Cheers!

1 kudos

07-29-2022 9:49:02 AM

1 More Replies

by Alix • New Contributor III

02-21-2022 9:00:10 AM

13631 Views
8 replies
3 kudos

Resolved! Remote RPC client disassociated error

Hello,I've been trying to submit a job to a transient cluster, but it is failing with this error :Caused by: org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 4 times, most recent failure: Lost task 0.3 in ...

Data Engineering

13631 Views
8 replies
3 kudos

02-21-2022 9:00:10 AM

View Replies

Latest Reply

shan_chandra
Databricks Employee

05-10-2022 7:02:51 PM

3 kudos

@Alix Métivier - The error is thrown from the user code (please investigate the jar file attached to the cluster). at m80.dbruniv_0_1.dbruniv.tFixedFlowInput_1Process(dbruniv.java:941)at m80.dbruniv_0_1.dbruniv.run(dbruniv.java:1654)at m80.dbruniv_...

3 kudos

05-10-2022 7:02:51 PM

7 More Replies

by User16783852686 • Databricks Employee

10-05-2021 9:52:50 AM

5926 Views
4 replies
2 kudos

Resolved! Slow first time run, jar based jobs

When running a jar-based job, I've noticed that the 1st run always takes the extra time to complete the job and consecutive runs take less time to finish the job. This behavior is reproducible on an interactive cluster. What's causing this? Is this e...

Data Engineering

5926 Views
4 replies
2 kudos

10-05-2021 9:52:50 AM

View Replies

Latest Reply

User16783852686
Databricks Employee

10-06-2021 5:58:18 AM

2 kudos

@Sandeep Katta , this is a fat jar that does read-transform-write. @DD Sharma response matches @Werner Stinckens & I intuition that there was efficiency on the second job due to jar already being loaded. I would not have noticed this had job run...

2 kudos

10-06-2021 5:58:18 AM

3 More Replies

by User15813097110 • Databricks Employee

05-07-2021 7:24:28 AM

2474 Views
1 replies
0 kudos

Can we update the jars on a running interactive cluster? Is there a way we can reload the Jars and make them available for use on the Notebook/Jobs ?

Data Engineering

2474 Views
1 replies
0 kudos

05-07-2021 7:24:28 AM

View Replies

Latest Reply

User15813097110
Databricks Employee

05-07-2021 7:31:43 AM

0 kudos

Since the SparkContext is already up and running, it requires a restart. Technically, it might be possible to kill the JVM process and restart it but we do not recommend that approach. In this case, we recommend restarting the cluster so that the Sp...

0 kudos

05-07-2021 7:31:43 AM

Databricks Community

set PYTHONPATH when executing workflows

The spark driver has stopped unexpectedly and is restarting. Your notebook will be automatically reattached.

Databricks Workflow cost on running in interactive cluster

Resolved! Adding spark_conf tag on Jobs API

dbfs access for job clusters and interactive cluster

why the job running time and command execution time not matching in databricks

Resolved! Remote RPC client disassociated error

Resolved! Slow first time run, jar based jobs

Can we update the jars on a running interactive cluster? Is there a way we can reload the Jars and make them available for use on the Notebook/Jobs ?