<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Delta tables and YOLO computer vision tasks in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/delta-tables-and-yolo-computer-vision-tasks/m-p/63145#M32186</link>
    <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;I would really appreciate if someone could help me out. I feel it’s both a data engineering and ML question.&lt;/P&gt;&lt;P&gt;One thing we use at wo is YOLO for object detection. I’ve managed to run YOLO by loading data from the blob storage, but I’ve seen that the best way to do deep learning tasks in Databricks is to train your ML models on Delta Live Tables.&lt;/P&gt;&lt;P&gt;I currently have my training dataset as a Delta table, and I was wondering if anyone has managed to train computer vision models on Delta tables.&lt;/P&gt;&lt;P&gt;I’ve read the documentations and have seen repos such as petastorm that try to implement training on delta tables, but I can’t for the life of me understand how to actually run yolo this way, especially since YOLO uses yaml for config.&lt;/P&gt;&lt;P&gt;Thank in advance for your help! &lt;span class="lia-unicode-emoji" title=":smiling_face_with_halo:"&gt;😇&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Sun, 10 Mar 2024 14:09:07 GMT</pubDate>
    <dc:creator>Andrewcon</dc:creator>
    <dc:date>2024-03-10T14:09:07Z</dc:date>
    <item>
      <title>Delta tables and YOLO computer vision tasks</title>
      <link>https://community.databricks.com/t5/data-engineering/delta-tables-and-yolo-computer-vision-tasks/m-p/63145#M32186</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;I would really appreciate if someone could help me out. I feel it’s both a data engineering and ML question.&lt;/P&gt;&lt;P&gt;One thing we use at wo is YOLO for object detection. I’ve managed to run YOLO by loading data from the blob storage, but I’ve seen that the best way to do deep learning tasks in Databricks is to train your ML models on Delta Live Tables.&lt;/P&gt;&lt;P&gt;I currently have my training dataset as a Delta table, and I was wondering if anyone has managed to train computer vision models on Delta tables.&lt;/P&gt;&lt;P&gt;I’ve read the documentations and have seen repos such as petastorm that try to implement training on delta tables, but I can’t for the life of me understand how to actually run yolo this way, especially since YOLO uses yaml for config.&lt;/P&gt;&lt;P&gt;Thank in advance for your help! &lt;span class="lia-unicode-emoji" title=":smiling_face_with_halo:"&gt;😇&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 10 Mar 2024 14:09:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/delta-tables-and-yolo-computer-vision-tasks/m-p/63145#M32186</guid>
      <dc:creator>Andrewcon</dc:creator>
      <dc:date>2024-03-10T14:09:07Z</dc:date>
    </item>
    <item>
      <title>Re: Delta tables and YOLO computer vision tasks</title>
      <link>https://community.databricks.com/t5/data-engineering/delta-tables-and-yolo-computer-vision-tasks/m-p/99565#M40028</link>
      <description>&lt;P&gt;I am also looking for an answer to this question. Did you manage to find a solution&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/102039"&gt;@Andrewcon&lt;/a&gt;&amp;nbsp;?&lt;/P&gt;</description>
      <pubDate>Thu, 21 Nov 2024 00:00:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/delta-tables-and-yolo-computer-vision-tasks/m-p/99565#M40028</guid>
      <dc:creator>jnap</dc:creator>
      <dc:date>2024-11-21T00:00:38Z</dc:date>
    </item>
    <item>
      <title>Re: Delta tables and YOLO computer vision tasks</title>
      <link>https://community.databricks.com/t5/data-engineering/delta-tables-and-yolo-computer-vision-tasks/m-p/105733#M42253</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/102039"&gt;@Andrewcon&lt;/a&gt;&amp;nbsp;and&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/133211"&gt;@jnap&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;Have a look at Mosaic Streaming Dataset. You could load your data from your delta table and then train it on your PyTorch YOLO model. In that example, it use mobilenet model but you can adapt it to use YOLO. Petastorm is another option, but the project is a bit outdated.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.databricks.com/en/machine-learning/load-data/streaming.html" target="_blank"&gt;https://docs.databricks.com/en/machine-learning/load-data/streaming.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 15 Jan 2025 15:29:42 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/delta-tables-and-yolo-computer-vision-tasks/m-p/105733#M42253</guid>
      <dc:creator>MathieuDB</dc:creator>
      <dc:date>2025-01-15T15:29:42Z</dc:date>
    </item>
  </channel>
</rss>

