<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Init Script Fails Intermittently on Workflow Job in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/init-script-fails-intermittently-on-workflow-job/m-p/73801#M34670</link>
    <description>&lt;P&gt;An init script is used to install system libraries, per below.&lt;/P&gt;&lt;P&gt;Adding the script to a Personal Compute consistently works. The same script is added to a Workflows job via cluster config, which intermittently fails, as shown in error message below.&lt;/P&gt;&lt;P&gt;Both Personal and Workflow clusters are on 14.3 LTS runtime; surprised with the instability of the latter.&lt;/P&gt;&lt;P&gt;Any troubleshooting advice is appreciated.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Init Script&lt;/STRONG&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;EM&gt;#!/bin/bash&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;set -euxo pipefail&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;if [[ $DB_IS_DRIVER = "TRUE" ]]; then&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&amp;nbsp; sudo apt-get -y update &amp;amp;&amp;amp; apt-get install -y libudunits2-dev libgdal-dev libgeos-dev libproj-dev&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;fi&lt;/EM&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&lt;STRONG&gt;Error&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="leungi_0-1718291897408.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/8533i0BFE941ABD4960E0/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="leungi_0-1718291897408.png" alt="leungi_0-1718291897408.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 13 Jun 2024 15:41:39 GMT</pubDate>
    <dc:creator>leungi</dc:creator>
    <dc:date>2024-06-13T15:41:39Z</dc:date>
    <item>
      <title>Init Script Fails Intermittently on Workflow Job</title>
      <link>https://community.databricks.com/t5/data-engineering/init-script-fails-intermittently-on-workflow-job/m-p/73801#M34670</link>
      <description>&lt;P&gt;An init script is used to install system libraries, per below.&lt;/P&gt;&lt;P&gt;Adding the script to a Personal Compute consistently works. The same script is added to a Workflows job via cluster config, which intermittently fails, as shown in error message below.&lt;/P&gt;&lt;P&gt;Both Personal and Workflow clusters are on 14.3 LTS runtime; surprised with the instability of the latter.&lt;/P&gt;&lt;P&gt;Any troubleshooting advice is appreciated.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Init Script&lt;/STRONG&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;EM&gt;#!/bin/bash&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;set -euxo pipefail&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;if [[ $DB_IS_DRIVER = "TRUE" ]]; then&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&amp;nbsp; sudo apt-get -y update &amp;amp;&amp;amp; apt-get install -y libudunits2-dev libgdal-dev libgeos-dev libproj-dev&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;fi&lt;/EM&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&lt;STRONG&gt;Error&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="leungi_0-1718291897408.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/8533i0BFE941ABD4960E0/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="leungi_0-1718291897408.png" alt="leungi_0-1718291897408.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 13 Jun 2024 15:41:39 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/init-script-fails-intermittently-on-workflow-job/m-p/73801#M34670</guid>
      <dc:creator>leungi</dc:creator>
      <dc:date>2024-06-13T15:41:39Z</dc:date>
    </item>
    <item>
      <title>Re: Init Script Fails Intermittently on Workflow Job</title>
      <link>https://community.databricks.com/t5/data-engineering/init-script-fails-intermittently-on-workflow-job/m-p/76811#M35332</link>
      <description>&lt;P&gt;Check the cluster event log to see if there is a clue why the script is failing. if the script failed and returned none zero status the cluster wont start&lt;/P&gt;</description>
      <pubDate>Thu, 04 Jul 2024 18:34:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/init-script-fails-intermittently-on-workflow-job/m-p/76811#M35332</guid>
      <dc:creator>amr</dc:creator>
      <dc:date>2024-07-04T18:34:09Z</dc:date>
    </item>
    <item>
      <title>Re: Init Script Fails Intermittently on Workflow Job</title>
      <link>https://community.databricks.com/t5/data-engineering/init-script-fails-intermittently-on-workflow-job/m-p/77163#M35410</link>
      <description>&lt;P&gt;Thanks for the suggestion&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/770"&gt;@amr&lt;/a&gt;.&lt;/P&gt;&lt;P&gt;Courtesy of a DBX solution engineer, the key was to remove all the files in the &lt;EM&gt;/var/lib/apt/lists/&lt;/EM&gt; directory to force &lt;EM&gt;apt&lt;/EM&gt; to download fresh package lists during subsequent update.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Init Script&lt;/STRONG&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;EM&gt;#!/bin/bash&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;set -euxo pipefail&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;if [[ $DB_IS_DRIVER = "TRUE" ]]; then&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&amp;nbsp; # --- Clear cache&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;EM&gt;&amp;nbsp; rm -r /var/cache/apt/archives/* /var/lib/apt/lists/* &lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&amp;nbsp; sudo apt-get clean -y&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&amp;nbsp; sudo apt-get update -y&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&amp;nbsp; #---&lt;/EM&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&amp;nbsp; sudo apt-get -y update &amp;amp;&amp;amp; apt-get install -y libudunits2-dev libgdal-dev libgeos-dev libproj-dev&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;fi&lt;/EM&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 08 Jul 2024 14:58:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/init-script-fails-intermittently-on-workflow-job/m-p/77163#M35410</guid>
      <dc:creator>leungi</dc:creator>
      <dc:date>2024-07-08T14:58:25Z</dc:date>
    </item>
  </channel>
</rss>

