cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

How do I implement and train a custom PyTorch model on Databricks using distributed training?

Suheb
Contributor

How can I build my own PyTorch machine-learning model and train it faster on Databricks by using multiple machines/GPUs instead of just one?

1 REPLY 1

KaushalVachhani
Databricks Employee
Databricks Employee

@Suheb , You may look at the torch distributor. It provides multiple distributed training options, including single-node with multiple-GPU training and multi-node training. Below are the references for you.

https://docs.databricks.com/aws/en/machine-learning/train-model/distributed-training/spark-pytorch-d...

https://docs.databricks.com/aws/en/notebooks/source/deep-learning/torch-distributor-lightning.html 

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local communityโ€”sign up today to get started!

Sign Up Now