Databricks Community

adarsh8304 · 12-03-2024

Hey @-werners- thanks for answering first, why then the metrics of cpu, mem utilisation we are getting in driver only and worker seems still, with less utilisation of any training, with torch distributor I think atleast that one worker should be in u...

adarsh8304 · 11-29-2024

Hey, so we even can't use the TorchDistributor and Distributed Data Parallel to achieve the distributed training thing in my code, and `TorchDistributor` is a spark written distribution library, coz with this setup I am not able to get the the requir...

adarsh8304 · 09-25-2024

Hey @Paul_Poco what about using the processpoolexecutor or threadypoolexecutor from the concurrent.futures module ? have u tried them or not . ?

Databricks Community

User Stats

User Activity

Re: Does Databricks supports the Pytorch Distributed Training for multiple devices?

Re: Does Databricks supports the Pytorch Distributed Training for multiple devices?

Re: Asynchronous API calls from Databricks