<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Databricks Default package repositories in Administration &amp; Architecture</title>
    <link>https://community.databricks.com/t5/administration-architecture/databricks-default-package-repositories/m-p/133042#M4100</link>
    <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/63366"&gt;@tariq&lt;/a&gt;!&lt;/P&gt;
&lt;P&gt;Did the suggestions shared above help address your issue? If so, please consider marking one of the responses as the accepted solution. If you found a different approach that worked for you, sharing it with the community would be really helpful.&lt;/P&gt;</description>
    <pubDate>Thu, 25 Sep 2025 15:18:28 GMT</pubDate>
    <dc:creator>Advika_</dc:creator>
    <dc:date>2025-09-25T15:18:28Z</dc:date>
    <item>
      <title>Databricks Default package repositories</title>
      <link>https://community.databricks.com/t5/administration-architecture/databricks-default-package-repositories/m-p/131495#M4005</link>
      <description>&lt;P&gt;I have added an extra-index-url in the default package repository in databricks which points to a repository in azure artifact. The libraries from it are getting installed on job cluster but is not working on the all purpose cluster. Below is the relevant config for the all purpose cluster:&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"cluster_id"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"some-id"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"data_security_mode"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"USER_ISOLATION"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"effective_spark_version"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"16.4.x-scala2.12"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"release_version"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"16.4.8"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"autoscale"&lt;/SPAN&gt;&lt;SPAN&gt;: {&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"min_workers"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;1&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"max_workers"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;15&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"target_workers"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;1&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; },&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"init_scripts_safe_mode"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;false&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"spec"&lt;/SPAN&gt;&lt;SPAN&gt;: {&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"cluster_name"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"cluster_name"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"spark_version"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"16.4.x-scala2.12"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"autotermination_minutes"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;15&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"instance_pool_id"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"some-pool"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"driver_instance_pool_id"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"some-pool"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"data_security_mode"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"USER_ISOLATION"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"autoscale"&lt;/SPAN&gt;&lt;SPAN&gt;: {&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"min_workers"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;1&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"max_workers"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;15&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; },&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"apply_policy_default_values"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;false&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; }&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 10 Sep 2025 08:20:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/databricks-default-package-repositories/m-p/131495#M4005</guid>
      <dc:creator>tariq</dc:creator>
      <dc:date>2025-09-10T08:20:41Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Default package repositories</title>
      <link>https://community.databricks.com/t5/administration-architecture/databricks-default-package-repositories/m-p/132000#M4043</link>
      <description>&lt;P&gt;Hola&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/63366"&gt;@tariq&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;My recomendation is create a init_script and attach it to the all-porpouse cluster.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;#!/bin/bash

echo "[global]
index-url = https://&amp;lt;user&amp;gt;:&amp;lt;token&amp;gt;@pkgs.dev.azure.com/&amp;lt;org&amp;gt;/&amp;lt;project&amp;gt;/_packaging/&amp;lt;feed&amp;gt;/pypi/simple/
extra-index-url = https://pypi.org/simple
trusted-host = pkgs.dev.azure.com
" &amp;gt; /etc/pip.conf&lt;/LI-CODE&gt;&lt;P&gt;You can upload it to your cloud storage and must add it under&lt;STRONG&gt; Catalog&amp;gt;Metastore&amp;gt;Allowed JARs/Init Scripts&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/STRONG&gt;Hope this helps &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;Isi&lt;/P&gt;</description>
      <pubDate>Mon, 15 Sep 2025 14:30:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/databricks-default-package-repositories/m-p/132000#M4043</guid>
      <dc:creator>Isi</dc:creator>
      <dc:date>2025-09-15T14:30:44Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Default package repositories</title>
      <link>https://community.databricks.com/t5/administration-architecture/databricks-default-package-repositories/m-p/132226#M4055</link>
      <description>&lt;P class="p1"&gt;Greeting &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/63366"&gt;@tariq&lt;/a&gt;&amp;nbsp;, this is a great question (thank you &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/145555"&gt;@Isi&lt;/a&gt;&amp;nbsp;for raising the suggestion). I looked into our internal documentation, and it turns out that it is &lt;STRONG&gt;not recommended&lt;/STRONG&gt;&amp;nbsp;to install libraries cluster-wide on "All Purpose" compute in `USER_ISOLATION` mode. Databricks enforces strict separation between users—including how libraries are installed, loaded, and how environment variables are managed.&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;&lt;STRONG&gt;Key Points to Consider&lt;/STRONG&gt;&lt;/P&gt;
&lt;P class="p1"&gt;- USER_ISOLATION clusters strictly restrict cross-user contamination. Init scripts and global environment variables are not always passed into the per-user, per-notebook Python context.&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;- Cluster-scoped installations (e.g., via init scripts with 'pip install', or through the cluster “Libraries” UI) often do not work as expected in notebook sessions under USER_ISOLATION.&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;- Instead, &lt;STRONG&gt;per-user '%pip' installs are isolated and recommended.&lt;/STRONG&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;&lt;STRONG&gt;Recommended Approach&lt;/STRONG&gt;&lt;/P&gt;
&lt;P class="p1"&gt;&lt;STRONG&gt;Install within Notebook Sessions&lt;/STRONG&gt;&lt;/P&gt;
&lt;P class="p1"&gt;```python&lt;/P&gt;
&lt;P class="p1"&gt;%pip install --extra-index-url &amp;lt;azure-artifact-repo-url&amp;gt; package-name&lt;/P&gt;
&lt;P class="p1"&gt;```&lt;/P&gt;
&lt;P class="p1"&gt;Run this directly in your own notebook. This ensures the package is installed in your user session and correctly respects isolation and credentials.&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;&lt;STRONG&gt;If You Must Use Cluster Init Scripts&lt;/STRONG&gt;&lt;/P&gt;
&lt;P class="p1"&gt;If pre-installation cluster-wide is absolutely necessary, you can test with an init script that explicitly uses the Python executable for notebook sessions. For example:&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;```bash&lt;/P&gt;
&lt;P class="p1"&gt;#!/bin/bash&lt;/P&gt;
&lt;P class="p1"&gt;/databricks/python/bin/pip install --extra-index-url &amp;lt;azure-artifact-repo-url&amp;gt; &amp;lt;package-name&amp;gt;&lt;/P&gt;
&lt;P class="p1"&gt;```&lt;/P&gt;
&lt;P class="p1"&gt;- Save this script in workspace storage or a mounted volume.&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;- Add it to the `init_scripts` section of the cluster configuration.&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;Afterwards, verify installation:&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;```python&lt;/P&gt;
&lt;P class="p1"&gt;import sys&lt;/P&gt;
&lt;P class="p1"&gt;print(sys.executable)&lt;/P&gt;
&lt;P class="p1"&gt;!pip list&lt;/P&gt;
&lt;P class="p1"&gt;```&lt;/P&gt;
&lt;P class="p1"&gt;Keep in mind: due to `USER_ISOLATION` boundaries, even init scripts may not guarantee availability across all user sessions. Installing with `%pip` inside each notebook is usually more reliable.&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;&lt;STRONG&gt;In Short&lt;/STRONG&gt;&lt;/P&gt;
&lt;P class="p1"&gt;On all-purpose clusters with `USER_ISOLATION`, use `%pip install` with your extra index URL directly in your notebook, and ensure authentication is set for each user session. Init scripts are possible but less reliable for propagating libraries across users.&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;Hope this helps point you in the right direction!&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;Cheers, Louis.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 17 Sep 2025 11:09:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/databricks-default-package-repositories/m-p/132226#M4055</guid>
      <dc:creator>Louis_Frolio</dc:creator>
      <dc:date>2025-09-17T11:09:43Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Default package repositories</title>
      <link>https://community.databricks.com/t5/administration-architecture/databricks-default-package-repositories/m-p/132325#M4062</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/34815"&gt;@Louis_Frolio&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P class=""&gt;I understand Databricks’ best practices, but in my experience, for libraries that are already present in the cluster runtime (e.g pydantic), I haven’t been able to make &lt;SPAN class=""&gt;%pip install&lt;/SPAN&gt; consistently overwrite the preinstalled version. The only reliable way I’ve found is by installing them through the &lt;SPAN class=""&gt;&lt;STRONG&gt;cluster Libraries UI&lt;/STRONG&gt;&lt;/SPAN&gt;—otherwise, when I run &lt;SPAN class=""&gt;%pip install&lt;/SPAN&gt;, the version I specify doesn’t seem to take effect because of how the environment isolation works.&lt;/P&gt;&lt;P class=""&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;That said, when working directly at the cluster level, I &lt;I&gt;have&lt;/I&gt; been able to get it to work. I understand the point about &lt;SPAN class=""&gt;USER_ISOLATION&lt;/SPAN&gt;, but when you face restrictions (e.g., ML runtimes or other special environments), sometimes you have to rely on these “tricks” to get things working.&lt;/P&gt;&lt;P class=""&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;Still, I really appreciate your explanation. It would be great if Databricks could put more emphasis on making this kind of internal documentation available more openly, since it would save users a lot of trial and error.&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Isi&lt;/P&gt;</description>
      <pubDate>Wed, 17 Sep 2025 19:21:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/databricks-default-package-repositories/m-p/132325#M4062</guid>
      <dc:creator>Isi</dc:creator>
      <dc:date>2025-09-17T19:21:34Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Default package repositories</title>
      <link>https://community.databricks.com/t5/administration-architecture/databricks-default-package-repositories/m-p/133042#M4100</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/63366"&gt;@tariq&lt;/a&gt;!&lt;/P&gt;
&lt;P&gt;Did the suggestions shared above help address your issue? If so, please consider marking one of the responses as the accepted solution. If you found a different approach that worked for you, sharing it with the community would be really helpful.&lt;/P&gt;</description>
      <pubDate>Thu, 25 Sep 2025 15:18:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/databricks-default-package-repositories/m-p/133042#M4100</guid>
      <dc:creator>Advika_</dc:creator>
      <dc:date>2025-09-25T15:18:28Z</dc:date>
    </item>
  </channel>
</rss>

