Databricks Community

BalajiM · ‎05-29-2025

Recently observed when we run a driver intensive code on a all purpose compute. The parallel runs of the same pattern/kind jobs are getting failed
Example:
Job triggerd on all purpose compute with compute stats of 4 core and 8 gigs ram for driver

Lets say my job is driver expensive and gonna exhaust all the compute and I have same pattern jobs (kind - Driver expensive) run in parallel (assume 5 parallel jobs has been triggered)

If my first job exhausts all the driver's compute (cpu) the other 4 jobs should be queued untill it gets resource
But rather than all my other jobs are getting failed due to OOM in driver
Yes we can use job cluster for this kind of workloads but ideally is there any reason behind why the jobs are not getting queued if it doesn't have resource for driver
Whereas in case of executor compute exhaust the jobs are getting queued if it doesn't have resource for that workload execution

I don't feel this should be an expected behaviour. Do share your insights if am missing out on something.

Aviral-Bhardwaj · ‎05-29-2025

this will help you

# Cluster config adjustments
spark.conf.set("spark.driver.memory", "16g") # Double current allocation
spark.conf.set("spark.driver.maxResultSize", "8g") # Prevent large collects

AviralBhardwaj

Databricks Community

Running Driver Intensive workloads on all purpose compute

A New Era of Databases: Lakebase

BrickTalks Industry Month: Transforming Commercial Real Estate Portfolio Management with AI

Ending the Developer Struggle: The Future of Fast App Development - Virtual Event (May 5–7)

Solution Accelerator Series | Optimize Customer Email Support With LLMs

🌟 Community Pulse: Your Weekly Roundup! March 16 – 22, 2026