cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

How concurrent runs in a job matches to cluster configuration?

AmanSehgal
Honored Contributor III

In databricks jobs, there's a field to add concurrent runs which can be set to 1000.

If I've a cluster with 4 worker nodes and 8 cores each, then at max how many concurrent jobs I'll be able to execute?

What will happen if I launch 100 instances of same job at once over a cluster with 4 worker nodes (8 cores each)?

2 REPLIES 2

Prabakar
Databricks Employee
Databricks Employee

@Aman Sehgal​ On E2 workspace the limit is 1000 concurrent runs. If you trigger 100 runs​ at the same time, 100 clusters will be created and the runs will be executed. If you use the same cluster for 100 runs, then you might face a lot of failed jobs or even the cluster might go down as it will be overloaded.

AmanSehgal
Honored Contributor III

Ok. So at max how many jobs can run concurrently on a cluster with 4 nodes and 8 cores each and 32gb memory.

Let's say incoming data in each file is 100kb and there are 100 such files arriving at once.

How can I do estimate calculations?

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group