Delta tables and YOLO computer vision tasks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
โ03-10-2024 07:09 AM
Hi all,
I would really appreciate if someone could help me out. I feel itโs both a data engineering and ML question.
One thing we use at wo is YOLO for object detection. Iโve managed to run YOLO by loading data from the blob storage, but Iโve seen that the best way to do deep learning tasks in Databricks is to train your ML models on Delta Live Tables.
I currently have my training dataset as a Delta table, and I was wondering if anyone has managed to train computer vision models on Delta tables.
Iโve read the documentations and have seen repos such as petastorm that try to implement training on delta tables, but I canโt for the life of me understand how to actually run yolo this way, especially since YOLO uses yaml for config.
Thank in advance for your help! ๐
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
โ11-20-2024 04:00 PM
I am also looking for an answer to this question. Did you manage to find a solution @Andrewcon ?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
โ01-15-2025 07:29 AM
Hello @Andrewcon and @jnap ,
Have a look at Mosaic Streaming Dataset. You could load your data from your delta table and then train it on your PyTorch YOLO model. In that example, it use mobilenet model but you can adapt it to use YOLO. Petastorm is another option, but the project is a bit outdated.
https://docs.databricks.com/en/machine-learning/load-data/streaming.html

