Databricks

User16869510359 · ‎06-25-2021

My job fails with Driver is temporarily unavailable. Apparently, it's permanently unavailable, because the job is not pausing but failing.

User16869510359 · ‎06-25-2021

The error messages mean the Spark driver is having insufficient resources. Although increasing the driver memory or using a bigger instance type is a quick workaround, identifying the issue is the key thing.

On the application side , review if there are operations which is driver intensive. Say, collect() or toPandas() etc.

Also check if the instance type used is adequate for the workload.

View solution in original post

User16869510359 · ‎06-25-2021

The error messages mean the Spark driver is having insufficient resources. Although increasing the driver memory or using a bigger instance type is a quick workaround, identifying the issue is the key thing.

On the application side , review if there are operations which is driver intensive. Say, collect() or toPandas() etc.

Also check if the instance type used is adequate for the workload.

Chalki · ‎08-14-2023

I am facing the same issues . I am writing in batches using a simple for loop. I don't have any collect statements inside the loop. I am rewriting the partitions with partition overwrite dynamic mode in a huge wide delta table - several tb. The incremental load is sometimes 1-2 tb

Databricks

The driver is temporarily unavailable

How to successfully build GenAI applications

Registration now open! Databricks Data + AI Summit 2024

Meet DBRX, the New Standard for High-Quality LLMs

Data Warehousing in the Era of AI