I'm currently immersed in a project where I'm leveraging PyTorch to develop an object detection model using satellite imagery. My immediate objective is to perform distributed training on this model using PySpark.
While I have found several tutorials and examples on image classification, I'm having trouble translating these resources to suit my needs.
Specifically, to the best of my knowledge, I think that I need to load images and annotation files using PySpark, and subsequently convert or transform these files into a format that's compatible with PyTorch for the purpose of object detection model building. I'm eagerly seeking advice or any pointers towards helpful tutorials or examples that can aid me in refining and constructing my model.
During my search, I have stumbled upon some resources related to using SparkTorch
or pyspark.ml.torch.distributor, and horovod. However, I've been encountering
difficulties in successfully installing horovod. I appreciate any guidance on
this issue as well.