<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Bug - task retry fails to load cluster dependencies in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/bug-task-retry-fails-to-load-cluster-dependencies/m-p/37353#M5439</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I'm currently facing an issue with task retries on a structured streaming set to have unlimited retries. Our job crashes frequently due to out of memory problems, so we set the task retry limit to -1&amp;nbsp; (unlimited) as a workaround. This is also the suggested practice for production streaming jobs, according to the Databricks documentation.&lt;/P&gt;&lt;P&gt;However, quite often after a couple of crashes, the cluster fails to find the dependencies. We have specifically setup&amp;nbsp; a custom one, which loads fine on first runs. This bug does not always happen, so it's not so easy to reproduce.&lt;/P&gt;&lt;P&gt;This is the error I see:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="java"&gt;: java.lang.ClassNotFoundException: 
Failed to find data source: solacemqtt. Please find packages at
https://spark.apache.org/third-party-projects.html&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 11 Jul 2023 06:35:15 GMT</pubDate>
    <dc:creator>Paolo</dc:creator>
    <dc:date>2023-07-11T06:35:15Z</dc:date>
    <item>
      <title>Bug - task retry fails to load cluster dependencies</title>
      <link>https://community.databricks.com/t5/get-started-discussions/bug-task-retry-fails-to-load-cluster-dependencies/m-p/37353#M5439</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I'm currently facing an issue with task retries on a structured streaming set to have unlimited retries. Our job crashes frequently due to out of memory problems, so we set the task retry limit to -1&amp;nbsp; (unlimited) as a workaround. This is also the suggested practice for production streaming jobs, according to the Databricks documentation.&lt;/P&gt;&lt;P&gt;However, quite often after a couple of crashes, the cluster fails to find the dependencies. We have specifically setup&amp;nbsp; a custom one, which loads fine on first runs. This bug does not always happen, so it's not so easy to reproduce.&lt;/P&gt;&lt;P&gt;This is the error I see:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="java"&gt;: java.lang.ClassNotFoundException: 
Failed to find data source: solacemqtt. Please find packages at
https://spark.apache.org/third-party-projects.html&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jul 2023 06:35:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/bug-task-retry-fails-to-load-cluster-dependencies/m-p/37353#M5439</guid>
      <dc:creator>Paolo</dc:creator>
      <dc:date>2023-07-11T06:35:15Z</dc:date>
    </item>
  </channel>
</rss>

