Databricks Community

brickster_2018 · ‎06-25-2021

My job fails with Driver is temporarily unavailable. Apparently, it's permanently unavailable, because the job is not pausing but failing.

brickster_2018 · ‎06-25-2021

The error messages mean the Spark driver is having insufficient resources. Although increasing the driver memory or using a bigger instance type is a quick workaround, identifying the issue is the key thing.

On the application side , review if there are operations which is driver intensive. Say, collect() or toPandas() etc.

Also check if the instance type used is adequate for the workload.

View solution in original post

brickster_2018 · ‎06-25-2021

The error messages mean the Spark driver is having insufficient resources. Although increasing the driver memory or using a bigger instance type is a quick workaround, identifying the issue is the key thing.

On the application side , review if there are operations which is driver intensive. Say, collect() or toPandas() etc.

Also check if the instance type used is adequate for the workload.

Chalki · ‎08-14-2023

I am facing the same issues . I am writing in batches using a simple for loop. I don't have any collect statements inside the loop. I am rewriting the partitions with partition overwrite dynamic mode in a huge wide delta table - several tb. The incremental load is sometimes 1-2 tb

Databricks Community

The driver is temporarily unavailable

Photos

Connect with Databricks Users in Your Area

Securely share data, analytics and AI

Data Intelligence for Data Engineers

Databricks Learning Festival (Virtual): 15 January - 31 January 2025