cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Does Databricks Container Services (DCS) support for GPU containers with Databricks Runtime 11.3 LTS and higher?

ppang
New Contributor III

I have been trying to start a cluster using DCS with GPU containers (https://github.com/databricks/containers/tree/master/ubuntu/gpu), but was only successful with Databricks Runtime 10.4 LTS and lower.

With Databricks Runtime 11.3 LTS and higher, I got the error:

"Internal error message: Spark error: Driver down cause: driver state change"

Does DCS supports GPU containers for 11.3 LTS and higher?

1 REPLY 1

jessysantos
New Contributor III
New Contributor III

Hello @ppang !

Since you posted your question, the repository you shared has received an update, which includes the following warning:

"Using conda in DCS images is no longer supported starting Databricks Runtime 9.0. We highly recommend users to extend cuda-11.8 examples. We no longer support cuda-10.1 and cuda-11.0 compatibility with latest databricks runtime."

It's likely that the issue you encountered was related to a CUDA incompatibility.

Best regards,

Jéssica Santos

Join 100K+ Data Experts: Register Now & Grow with Us!

Excited to expand your horizons with us? Click here to Register and begin your journey to success!

Already a member? Login and join your local regional user group! If there isn’t one near you, fill out this form and we’ll create one for you to join!