<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Cluster install of Python libraries versus notebook install in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/cluster-install-of-python-libraries-versus-notebook-install/m-p/109041#M43214</link>
    <description>&lt;P&gt;When a base set of libraries is installed on a cluster, can indeed conflict with a notebook submitted to the cluster that defines a conflicting set of libraries for installation. This is because the libraries installed at the cluster level take precedence over those specified in the notebook.&amp;nbsp;&lt;/P&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;&lt;SPAN&gt;If you need to use custom Python libraries or modules, the recommended approach is to install the required libraries directly to the cluster using the cluster library installation feature. This ensures that the libraries are available to all notebooks running on the cluster.&lt;/SPAN&gt;&lt;/P&gt;
&lt;DIV class="_1sijkvt0"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV class="_1sijkvt0"&gt;About your pythonpath variable, please see;&lt;/DIV&gt;
&lt;P&gt;&lt;A href="https://kb.databricks.com/en_US/clusters/cannot-set-a-custom-pythonpath" target="_blank"&gt;https://kb.databricks.com/en_US/clusters/cannot-set-a-custom-pythonpath&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 06 Feb 2025 01:12:06 GMT</pubDate>
    <dc:creator>Alberto_Umana</dc:creator>
    <dc:date>2025-02-06T01:12:06Z</dc:date>
    <item>
      <title>Cluster install of Python libraries versus notebook install</title>
      <link>https://community.databricks.com/t5/data-engineering/cluster-install-of-python-libraries-versus-notebook-install/m-p/109034#M43211</link>
      <description>&lt;P&gt;If a base set of libraries is installed on the cluster and pinned to a specific version, can/would this conflict with a notebook submitted to the cluster that defines a conflicting set of libraries for install?&lt;BR /&gt;Is there a way to override the cluster python env via modifying the PYTHONPATH or some other Databricks variable in these cases or some order of precedence?&lt;/P&gt;</description>
      <pubDate>Wed, 05 Feb 2025 23:56:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/cluster-install-of-python-libraries-versus-notebook-install/m-p/109034#M43211</guid>
      <dc:creator>lmorrissey</dc:creator>
      <dc:date>2025-02-05T23:56:13Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster install of Python libraries versus notebook install</title>
      <link>https://community.databricks.com/t5/data-engineering/cluster-install-of-python-libraries-versus-notebook-install/m-p/109041#M43214</link>
      <description>&lt;P&gt;When a base set of libraries is installed on a cluster, can indeed conflict with a notebook submitted to the cluster that defines a conflicting set of libraries for installation. This is because the libraries installed at the cluster level take precedence over those specified in the notebook.&amp;nbsp;&lt;/P&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;&lt;SPAN&gt;If you need to use custom Python libraries or modules, the recommended approach is to install the required libraries directly to the cluster using the cluster library installation feature. This ensures that the libraries are available to all notebooks running on the cluster.&lt;/SPAN&gt;&lt;/P&gt;
&lt;DIV class="_1sijkvt0"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV class="_1sijkvt0"&gt;About your pythonpath variable, please see;&lt;/DIV&gt;
&lt;P&gt;&lt;A href="https://kb.databricks.com/en_US/clusters/cannot-set-a-custom-pythonpath" target="_blank"&gt;https://kb.databricks.com/en_US/clusters/cannot-set-a-custom-pythonpath&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 06 Feb 2025 01:12:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/cluster-install-of-python-libraries-versus-notebook-install/m-p/109041#M43214</guid>
      <dc:creator>Alberto_Umana</dc:creator>
      <dc:date>2025-02-06T01:12:06Z</dc:date>
    </item>
  </channel>
</rss>

