cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Spark Driver failed due to DRIVER_UNAVAILABLE but not due to memory pressure

duliu
New Contributor II

Hello,

I have a job cluster running streaming job and it unexpectedly failed on 19th March due to DRIVER_UNAVAILABLE (Request timed out, Driver is temporarily unavailable) in event log. This is the run: https://atlassian-discover.cloud.databricks.com/jobs/323849284041517/runs/395169892801478?o=44820012... 

I'm aware of a thread reporting the same problem: https://kb.databricks.com/en_US/jobs/driver-unavailable and it pointed out memory pressure is a common cause. However, according to driver stdout there were only minor GCs that took around 30ms-40ms around the time the driver became unavailable:

duliu_0-1712192893352.png

I also checked the driver log (log4j logs) and it doesn't have any error messages, a few warning messages are unrelated. In fact the driver even continued outputting logs several minutes after the DRIVER_UNAVAILABLE error message appeared in event log.

I tried loading spark UI but after a long wait with messages saying processing files, it errors with the following message, so I can't see spark history UI as well:

duliu_1-1712192913524.png

 

Could anyone help please?

1 REPLY 1

Ayushi_Suthar
Databricks Employee
Databricks Employee

Hi @duliu , Hope you are doing well!

Would you kindly see if the KB article below addresses your problem?

https://kb.databricks.com/en_US/jobs/driver-unavailable

Please let me know if this helps and leave a like if this information is useful, followups are appreciated.
Kudos
Ayushi

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now