cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Databricks cluster launch time

Farzana
New Contributor II

Hi Team,

We have an @adf pipeline which will run some set of activities before #Azure databricks notebooks get called.As and when the notebooks are called our pipeline will launch a new cluster for every job with job compute as Standard F4 with a single worker node.To launch the cluster itself it is taking ~7mins which increases the overall ADF pipeline run time.

Could you please suggest a solution to reduce the cluster launch time?

Note:Our ADF pipeline has an event based trigger which will run as and when there is a file comes to ADLS. We cannot have a cluster created and running all the time as it impacts the cost.

Thanks

1 REPLY 1

Farzana
New Contributor II

@Retired_mod Thanks for the response. Could you please elaborate what do you mean by preloading the runtime on instance pool?

Even the cluster pool needs to run continuously(as there is no specific time period defined for the files to come to ADLS) in order to reduce the launch time of cluster for each and every job so that the over all ADF pipeline run time can be fast.isn't it?

 

Please help me in understanding "preload the runtime on the instance pool"

 

Thanks

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group