<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Mlflowexception: &amp;quot;Connection broken: ConnectionResetError(104, \\\'Connection reset by peer in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/mlflowexception-quot-connection-broken-connectionreseterror-104/m-p/59834#M2988</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/99386"&gt;@cl2&lt;/a&gt;&amp;nbsp;, Thank you for writing us back!&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can optionally configure a retry policy for your task within a Job. The retry interval is calculated in milliseconds between the start of the failed run and the subsequent retry run.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can go through these documents for more details:&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.databricks.com/en/workflows/jobs/settings.html#configure-a-retry-policy-for-a-task" target="_blank"&gt;https://docs.databricks.com/en/workflows/jobs/settings.html#configure-a-retry-policy-for-a-task&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.databricks.com/en/workflows/jobs/create-run-jobs.html#:~:text=To%20optionally%20configure%20a%20retry%20policy%20for%20the%20task%2C%20click%20%2B%20Add%20next%20to%20Retries" target="_blank"&gt;https://docs.databricks.com/en/workflows/jobs/create-run-jobs.html#:~:text=To%20optionally%20configure%20a%20retry%20policy%20for%20the%20task%2C%20click%20%2B%20Add%20next%20to%20Retries&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;Please let me know if this helps and leave a like if this helps, followups are appreciated.&lt;BR /&gt;Kudos&lt;BR /&gt;Ayushi&lt;/P&gt;</description>
    <pubDate>Sat, 10 Feb 2024 12:29:55 GMT</pubDate>
    <dc:creator>Ayushi_Suthar</dc:creator>
    <dc:date>2024-02-10T12:29:55Z</dc:date>
    <item>
      <title>Mlflowexception: "Connection broken: ConnectionResetError(104, \\\'Connection reset by peer\\\')"</title>
      <link>https://community.databricks.com/t5/machine-learning/mlflowexception-quot-connection-broken-connectionreseterror-104/m-p/59373#M2958</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I have a workflow running which from time to time crashes with the error:&lt;/P&gt;&lt;LI-CODE lang="python"&gt;MlflowException: The following failures occurred while downloading one or more artifacts from models:/incubator-forecast-charging-demand-power-and-io-dk2/Production: {'python_model.pkl': 'MlflowException(\'("Connection broken: ConnectionResetError(104, \\\'Connection reset by peer\\\')", ConnectionResetError(104, \\\'Connection reset by peer\\\'))\')'}&lt;/LI-CODE&gt;&lt;P&gt;I don't really know how to interpret this exception. Further more it does not happen every day, and I have yet to find any reason to why it crashes.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any suggestion what might be wrong?&lt;/P&gt;</description>
      <pubDate>Tue, 06 Feb 2024 07:39:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/mlflowexception-quot-connection-broken-connectionreseterror-104/m-p/59373#M2958</guid>
      <dc:creator>cl2</dc:creator>
      <dc:date>2024-02-06T07:39:45Z</dc:date>
    </item>
    <item>
      <title>Re: Mlflowexception: "Connection broken: ConnectionResetError(104, \\\'Connection reset by peer</title>
      <link>https://community.databricks.com/t5/machine-learning/mlflowexception-quot-connection-broken-connectionreseterror-104/m-p/59647#M2974</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/99386"&gt;@cl2&lt;/a&gt;,Thanks for bringing up your concerns; always happy to help&amp;nbsp;&lt;span class="lia-unicode-emoji" title=":beaming_face_with_smiling_eyes:"&gt;😁&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Upon going through the details, it appears there was an HTTP connection error downloading artifacts. This typically shouldn’t happen, but it can occur intermittently as a transient&amp;nbsp;network issue resulting in artifactory failure.&lt;/P&gt;
&lt;P&gt;As an immediate workaround, we would recommend adding a retry around the model download logic.&amp;nbsp;&lt;SPAN&gt;We could also add a couple of retries to this logic with 1 second of sleep between the retries - time.sleep(1) to clear the issue.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Leave a like if this helps, followups are appreciated.&lt;BR /&gt;Kudos&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Ayushi&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 08 Feb 2024 05:57:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/mlflowexception-quot-connection-broken-connectionreseterror-104/m-p/59647#M2974</guid>
      <dc:creator>Ayushi_Suthar</dc:creator>
      <dc:date>2024-02-08T05:57:11Z</dc:date>
    </item>
    <item>
      <title>Re: Mlflowexception: "Connection broken: ConnectionResetError(104, \\\'Connection reset by peer</title>
      <link>https://community.databricks.com/t5/machine-learning/mlflowexception-quot-connection-broken-connectionreseterror-104/m-p/59781#M2982</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/76914"&gt;@Ayushi_Suthar&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;It happens 1 or 2 times a week, so quiet frequently.&lt;BR /&gt;&lt;BR /&gt;Can you elaborate a bit more on how I should integrate the retry and retries logic? If there's some examples or documentation somewhere it would be much appreciated &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 09 Feb 2024 10:34:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/mlflowexception-quot-connection-broken-connectionreseterror-104/m-p/59781#M2982</guid>
      <dc:creator>cl2</dc:creator>
      <dc:date>2024-02-09T10:34:28Z</dc:date>
    </item>
    <item>
      <title>Re: Mlflowexception: "Connection broken: ConnectionResetError(104, \\\'Connection reset by peer</title>
      <link>https://community.databricks.com/t5/machine-learning/mlflowexception-quot-connection-broken-connectionreseterror-104/m-p/59834#M2988</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/99386"&gt;@cl2&lt;/a&gt;&amp;nbsp;, Thank you for writing us back!&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can optionally configure a retry policy for your task within a Job. The retry interval is calculated in milliseconds between the start of the failed run and the subsequent retry run.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can go through these documents for more details:&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.databricks.com/en/workflows/jobs/settings.html#configure-a-retry-policy-for-a-task" target="_blank"&gt;https://docs.databricks.com/en/workflows/jobs/settings.html#configure-a-retry-policy-for-a-task&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.databricks.com/en/workflows/jobs/create-run-jobs.html#:~:text=To%20optionally%20configure%20a%20retry%20policy%20for%20the%20task%2C%20click%20%2B%20Add%20next%20to%20Retries" target="_blank"&gt;https://docs.databricks.com/en/workflows/jobs/create-run-jobs.html#:~:text=To%20optionally%20configure%20a%20retry%20policy%20for%20the%20task%2C%20click%20%2B%20Add%20next%20to%20Retries&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;Please let me know if this helps and leave a like if this helps, followups are appreciated.&lt;BR /&gt;Kudos&lt;BR /&gt;Ayushi&lt;/P&gt;</description>
      <pubDate>Sat, 10 Feb 2024 12:29:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/mlflowexception-quot-connection-broken-connectionreseterror-104/m-p/59834#M2988</guid>
      <dc:creator>Ayushi_Suthar</dc:creator>
      <dc:date>2024-02-10T12:29:55Z</dc:date>
    </item>
    <item>
      <title>Re: Mlflowexception: "Connection broken: ConnectionResetError(104, \\\'Connection reset by peer</title>
      <link>https://community.databricks.com/t5/machine-learning/mlflowexception-quot-connection-broken-connectionreseterror-104/m-p/60358#M2999</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/76914"&gt;@Ayushi_Suthar&lt;/a&gt;,&lt;BR /&gt;&lt;BR /&gt;I am using DBX to deploy the workflows - do you have some documents with details how to implement it with DBX?&lt;/P&gt;</description>
      <pubDate>Fri, 16 Feb 2024 07:25:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/mlflowexception-quot-connection-broken-connectionreseterror-104/m-p/60358#M2999</guid>
      <dc:creator>cl2</dc:creator>
      <dc:date>2024-02-16T07:25:43Z</dc:date>
    </item>
  </channel>
</rss>

