<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: dbfs not found in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/dbfs-not-found/m-p/102413#M3858</link>
    <description>&lt;P class="p1"&gt;This problem can often be attributed to the model artifacts not being available on all the executors, especially in a distributed environment.&lt;/P&gt;
&lt;P class="p1"&gt;Can you try using the dbutils.fs.refreshMounts() in your code?&lt;/P&gt;
&lt;P class="p1"&gt;If the model is small enough, broadcast it to all executors using sc.broadcast&lt;/P&gt;</description>
    <pubDate>Tue, 17 Dec 2024 16:47:16 GMT</pubDate>
    <dc:creator>Alberto_Umana</dc:creator>
    <dc:date>2024-12-17T16:47:16Z</dc:date>
    <item>
      <title>dbfs not found</title>
      <link>https://community.databricks.com/t5/machine-learning/dbfs-not-found/m-p/102409#M3856</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've saved a custom pyfunc and now I'm trying to load it in a pandas_udf. It works on small samples or if I repartition everything to 1 partition, but when I try to run it on a larger sample and distribute it across my cluster it fails repeatably and says the model in the dbfs cannot be found.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any ideas?&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Andrew&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 17 Dec 2024 16:21:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/dbfs-not-found/m-p/102409#M3856</guid>
      <dc:creator>byrnesy5</dc:creator>
      <dc:date>2024-12-17T16:21:03Z</dc:date>
    </item>
    <item>
      <title>Re: dbfs not found</title>
      <link>https://community.databricks.com/t5/machine-learning/dbfs-not-found/m-p/102412#M3857</link>
      <description>&lt;P&gt;Here you can see the progress. 15 tasks succeeded and 110 have failed (eventually the job will fail completely).&amp;nbsp; Seems like some workers can see the file and others cannot. Not sure why some would fail and others would succeed in this instance.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="byrnesy5_0-1734453715975.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/13579i26F0771696AF26FA/image-size/medium?v=v2&amp;amp;px=400" role="button" title="byrnesy5_0-1734453715975.png" alt="byrnesy5_0-1734453715975.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 17 Dec 2024 16:43:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/dbfs-not-found/m-p/102412#M3857</guid>
      <dc:creator>byrnesy5</dc:creator>
      <dc:date>2024-12-17T16:43:18Z</dc:date>
    </item>
    <item>
      <title>Re: dbfs not found</title>
      <link>https://community.databricks.com/t5/machine-learning/dbfs-not-found/m-p/102413#M3858</link>
      <description>&lt;P class="p1"&gt;This problem can often be attributed to the model artifacts not being available on all the executors, especially in a distributed environment.&lt;/P&gt;
&lt;P class="p1"&gt;Can you try using the dbutils.fs.refreshMounts() in your code?&lt;/P&gt;
&lt;P class="p1"&gt;If the model is small enough, broadcast it to all executors using sc.broadcast&lt;/P&gt;</description>
      <pubDate>Tue, 17 Dec 2024 16:47:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/dbfs-not-found/m-p/102413#M3858</guid>
      <dc:creator>Alberto_Umana</dc:creator>
      <dc:date>2024-12-17T16:47:16Z</dc:date>
    </item>
    <item>
      <title>Re: dbfs not found</title>
      <link>https://community.databricks.com/t5/machine-learning/dbfs-not-found/m-p/102415#M3859</link>
      <description>&lt;P&gt;Thanks for getting back to me. Didn't have any luck though.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I tried dbutils.fs.refreshMounts() and still getting the same errors.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I tried broadcasting the model, but it's not able to be pickled and broadcast.&lt;/P&gt;&lt;P&gt;Any other ideas?&lt;/P&gt;</description>
      <pubDate>Tue, 17 Dec 2024 17:58:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/dbfs-not-found/m-p/102415#M3859</guid>
      <dc:creator>byrnesy5</dc:creator>
      <dc:date>2024-12-17T17:58:08Z</dc:date>
    </item>
  </channel>
</rss>

