cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Concurrent Workflow Jobs

holychs
New Contributor II

Hi Community, 

I am trying to run a Databricks workflow job using run_job_task under a for_loop. I have set the concurrent jobs as 2. 

I can see 2 iteration jobs getting triggered successfully. But both fail with an error:

"ConnectException: Connection refused (Connection refused)

Error while obtaining a new communication channel"
Contradictory to the fact that same job gets completed successfully with concurrency as 1. 
Cluster Configuration: Driver: Standard_DS5_v2 ยท Workers: Standard_DS5_v2 ยท 4-8 workers
 
1 ACCEPTED SOLUTION

Accepted Solutions

holychs
New Contributor II

It was an internal bug resolved with managing different parameters for each loop jobs.

View solution in original post

2 REPLIES 2

TakuyaOmi
Contributor III

Hi, @holychs 

Did you encounter any error messages related to an OOM (Out of Memory) error?
Itโ€™s possible that the driver node of the cluster doesnโ€™t have sufficient resources (CPU, memory) to handle multiple concurrent jobs.

holychs
New Contributor II

It was an internal bug resolved with managing different parameters for each loop jobs.

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group