cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Concurrent Workflow Jobs

holychs
New Contributor III

Hi Community, 

I am trying to run a Databricks workflow job using run_job_task under a for_loop. I have set the concurrent jobs as 2. 

I can see 2 iteration jobs getting triggered successfully. But both fail with an error:

"ConnectException: Connection refused (Connection refused)

Error while obtaining a new communication channel"
Contradictory to the fact that same job gets completed successfully with concurrency as 1. 
Cluster Configuration: Driver: Standard_DS5_v2 ยท Workers: Standard_DS5_v2 ยท 4-8 workers
 
1 ACCEPTED SOLUTION

Accepted Solutions

holychs
New Contributor III

It was an internal bug resolved with managing different parameters for each loop jobs.

View solution in original post

2 REPLIES 2

Takuya-Omi
Valued Contributor III

Hi, @holychs 

Did you encounter any error messages related to an OOM (Out of Memory) error?
Itโ€™s possible that the driver node of the cluster doesnโ€™t have sufficient resources (CPU, memory) to handle multiple concurrent jobs.

--------------------------
Takuya Omi (ๅฐพ็พŽๆ‹“ๅ“‰)

holychs
New Contributor III

It was an internal bug resolved with managing different parameters for each loop jobs.