<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Use init script for Databricks job cluster via Azure Data Factory in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/use-init-script-for-databricks-job-cluster-via-azure-data/m-p/110703#M43651</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/106294"&gt;@Alberto_Umana&lt;/a&gt;&amp;nbsp;, do you have a solution for such issue ?&amp;nbsp;&lt;BR /&gt;Thanks a lot for your help,&lt;BR /&gt;Sacha&lt;/P&gt;</description>
    <pubDate>Thu, 20 Feb 2025 08:28:55 GMT</pubDate>
    <dc:creator>sachamourier</dc:creator>
    <dc:date>2025-02-20T08:28:55Z</dc:date>
    <item>
      <title>Use init script for Databricks job cluster via Azure Data Factory</title>
      <link>https://community.databricks.com/t5/data-engineering/use-init-script-for-databricks-job-cluster-via-azure-data/m-p/110049#M43472</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I would like to install some libraries (both public and private) on a job cluster. I am using Azure Data Factory to run my Databricks notebooks and hence would like to use job clusters to run these jobs.&lt;/P&gt;&lt;P&gt;I have passed my init script to the job cluster but sometimes the package installs work, sometimes not, with no real pattern. The workspace paths to my packages well exist and are correctly set up.&amp;nbsp;&lt;/P&gt;&lt;P&gt;What's wrong ? Is there anything I should check ? Is there another more robust way to do it so that it always works? Since it's not robust, sometimes my libraries are well installed on my job cluster, sometimes not.&lt;BR /&gt;&lt;BR /&gt;I have attached the configuration I am using in Azure Data Factory to use my init script, and also a screenshot of what my init script looks like.&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="adf config for init script" style="width: 621px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/14854i6CD0A46C6B03E8E5/image-size/large?v=v2&amp;amp;px=999" role="button" title="adf_init_script_config.png" alt="adf config for init script" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;adf config for init script&lt;/span&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="init script" style="width: 999px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/14855i4C97CE6FE2B0EAA9/image-size/large?v=v2&amp;amp;px=999" role="button" title="init_script.png" alt="init script" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;init script&lt;/span&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Thank you very much in advance for the help,&lt;/P&gt;&lt;P&gt;Sacha&lt;/P&gt;</description>
      <pubDate>Wed, 12 Feb 2025 21:45:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/use-init-script-for-databricks-job-cluster-via-azure-data/m-p/110049#M43472</guid>
      <dc:creator>sachamourier</dc:creator>
      <dc:date>2025-02-12T21:45:02Z</dc:date>
    </item>
    <item>
      <title>Re: Use init script for Databricks job cluster via Azure Data Factory</title>
      <link>https://community.databricks.com/t5/data-engineering/use-init-script-for-databricks-job-cluster-via-azure-data/m-p/110065#M43476</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/122091"&gt;@sachamourier&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;What is the failure that it gives you when the init script fails?&lt;/P&gt;</description>
      <pubDate>Thu, 13 Feb 2025 02:46:20 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/use-init-script-for-databricks-job-cluster-via-azure-data/m-p/110065#M43476</guid>
      <dc:creator>Alberto_Umana</dc:creator>
      <dc:date>2025-02-13T02:46:20Z</dc:date>
    </item>
    <item>
      <title>Re: Use init script for Databricks job cluster via Azure Data Factory</title>
      <link>https://community.databricks.com/t5/data-engineering/use-init-script-for-databricks-job-cluster-via-azure-data/m-p/110075#M43481</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/106294"&gt;@Alberto_Umana&lt;/a&gt;&amp;nbsp;, I don’t get any failure. When my notebook gets run on the newly created job cluster, my package imports fail as they have not been installed on my cluster.&amp;nbsp;&lt;/P&gt;&lt;P&gt;As you can see on the attached images, it looks like it's searching or finding my init script though. Is there another way to do it otherwise ?&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="init script finished JSON" style="width: 999px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/14864i54DDC088DC313FEC/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screenshot 2025-02-13 065425.png" alt="init script finished JSON" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;init script finished JSON&lt;/span&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="imports issue" style="width: 717px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/14863iAE982AABED3E5743/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screenshot 2025-02-13 065222.png" alt="imports issue" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;imports issue&lt;/span&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="job cluster event log" style="width: 999px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/14862i9F25D25293D21C48/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screenshot 2025-02-13 065155.png" alt="job cluster event log" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;job cluster event log&lt;/span&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 13 Feb 2025 05:56:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/use-init-script-for-databricks-job-cluster-via-azure-data/m-p/110075#M43481</guid>
      <dc:creator>sachamourier</dc:creator>
      <dc:date>2025-02-13T05:56:12Z</dc:date>
    </item>
    <item>
      <title>Re: Use init script for Databricks job cluster via Azure Data Factory</title>
      <link>https://community.databricks.com/t5/data-engineering/use-init-script-for-databricks-job-cluster-via-azure-data/m-p/110703#M43651</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/106294"&gt;@Alberto_Umana&lt;/a&gt;&amp;nbsp;, do you have a solution for such issue ?&amp;nbsp;&lt;BR /&gt;Thanks a lot for your help,&lt;BR /&gt;Sacha&lt;/P&gt;</description>
      <pubDate>Thu, 20 Feb 2025 08:28:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/use-init-script-for-databricks-job-cluster-via-azure-data/m-p/110703#M43651</guid>
      <dc:creator>sachamourier</dc:creator>
      <dc:date>2025-02-20T08:28:55Z</dc:date>
    </item>
    <item>
      <title>Re: Use init script for Databricks job cluster via Azure Data Factory</title>
      <link>https://community.databricks.com/t5/data-engineering/use-init-script-for-databricks-job-cluster-via-azure-data/m-p/110744#M43669</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/122091"&gt;@sachamourier&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;Have you considered using cluster libraries? The behavior you are observing you require additional debugging since init script is installed successfully, can you enable cluster logging and research through the logs:&amp;nbsp;&lt;A href="https://docs.databricks.com/aws/en/compute/configure#compute-log-delivery" target="_blank"&gt;https://docs.databricks.com/aws/en/compute/configure#compute-log-delivery&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Also as a test can you run the init via a notebook to ensure it works fine?&lt;/P&gt;</description>
      <pubDate>Thu, 20 Feb 2025 13:01:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/use-init-script-for-databricks-job-cluster-via-azure-data/m-p/110744#M43669</guid>
      <dc:creator>Alberto_Umana</dc:creator>
      <dc:date>2025-02-20T13:01:02Z</dc:date>
    </item>
  </channel>
</rss>

