Resolved! Does Databricks supports the Pytorch Distributed Training for multiple devices?
Hi, Im trying to use the databricks platform to do the pytorch distributed training, but I didnt find any info about this. What I expected is using multiple clusters to run a common job using pytorch distributed data parallel (DDP) with the code belo...
- 2666 Views
- 6 replies
- 1 kudos
Latest Reply
If only the driver is active, this probably means you are not using Spark. When running pure python,... code, the driver will execute that.If Spark is active, workers receive their tasks from the driver. Generally the driver is not that active, the...
- 1 kudos