<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Job fails on clusters only with library dependency in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/144385#M52315</link>
    <description>&lt;P&gt;Hi!&lt;/P&gt;&lt;P&gt;Sure! As said, I&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;* set the default catalog for the workspace to the proper catalog: Workspace settings &amp;gt; Advanced &amp;gt; Default catalog for the workspace: Set to our project workspace (that was created by our admins before)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;* disabled "Legacy access": Workspace settings &amp;gt; Security &amp;gt; Disable legacy access: Disabled&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 19 Jan 2026 06:38:16 GMT</pubDate>
    <dc:creator>matmad</dc:creator>
    <dc:date>2026-01-19T06:38:16Z</dc:date>
    <item>
      <title>Job fails on clusters only with library dependency</title>
      <link>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/127590#M48022</link>
      <description>&lt;P&gt;Hello!&lt;/P&gt;&lt;P&gt;I have following problem: All my job runs fail when the job uses a library. Even the most basic job (print a string) and the most basic library package (no secondary dependencies, the script does not even import/use the library) fails with `Failed to reach the driver`:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="matmad_0-1754491920584.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/18771i8EAA87702A9C6650/image-size/medium?v=v2&amp;amp;px=400" role="button" title="matmad_0-1754491920584.png" alt="matmad_0-1754491920584.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;* All my libraries are python wheels&lt;BR /&gt;* I use `spark_python_task` (but also tested `python_wheel_task` with same error)&lt;BR /&gt;* If I use serverless (same script, same .whl), all works fine&lt;BR /&gt;* If I remove the package from the job's library section, all works fine (as said: I don't even import/use the library)&lt;BR /&gt;* I also used a different python wheel package and created a wheel using&amp;nbsp;&lt;A href="https://docs.databricks.com/aws/en/jobs/how-to/use-python-wheels-in-workflows#step-6-run-the-job-and-view-the-job-run-details" target="_blank"&gt;https://docs.databricks.com/aws/en/jobs/how-to/use-python-wheels-in-workflows#step-6-run-the-job-and-view-the-job-run-details&lt;/A&gt;&lt;BR /&gt;* It doesn't matter if I configure the job via yml in asset bundle or "manually" in the UI&lt;BR /&gt;&lt;BR /&gt;The logs of the clusters don't really help me.&lt;/P&gt;&lt;P&gt;I really appreciate your ideas - thank you!&lt;BR /&gt;&lt;BR /&gt;The script:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="matmad_1-1754492300025.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/18772i1769A05F944215EF/image-size/medium?v=v2&amp;amp;px=400" role="button" title="matmad_1-1754492300025.png" alt="matmad_1-1754492300025.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The job yml:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="matmad_2-1754492381339.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/18773i51255F411B56023B/image-size/medium?v=v2&amp;amp;px=400" role="button" title="matmad_2-1754492381339.png" alt="matmad_2-1754492381339.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 06 Aug 2025 15:01:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/127590#M48022</guid>
      <dc:creator>matmad</dc:creator>
      <dc:date>2025-08-06T15:01:19Z</dc:date>
    </item>
    <item>
      <title>Re: Job fails on clusters only with library dependency</title>
      <link>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/127592#M48023</link>
      <description>&lt;P&gt;Maybe worth mentioning: If I install the library in a python notebook using&lt;BR /&gt;%pip install /Workspace/Shared/code/my_package-0.1-py3-none-any.whl&lt;BR /&gt;all works fine.&lt;/P&gt;</description>
      <pubDate>Wed, 06 Aug 2025 15:04:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/127592#M48023</guid>
      <dc:creator>matmad</dc:creator>
      <dc:date>2025-08-06T15:04:00Z</dc:date>
    </item>
    <item>
      <title>Re: Job fails on clusters only with library dependency</title>
      <link>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/127751#M48071</link>
      <description>&lt;P&gt;My current workaround (I'm surprised that this works) is to install the library via "pypi" (actually: an internal pypi mirror using artifactory) instead of via the .whl file. I would still be interested in a reason of and a solution to the problem, though.&amp;nbsp;&lt;BR /&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Fri, 08 Aug 2025 04:23:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/127751#M48071</guid>
      <dc:creator>matmad</dc:creator>
      <dc:date>2025-08-08T04:23:08Z</dc:date>
    </item>
    <item>
      <title>Re: Job fails on clusters only with library dependency</title>
      <link>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/128129#M48166</link>
      <description>&lt;P&gt;I think I found a (the?) solution. The cluster tried to connect to the legacy Hive Catalog, so I&lt;/P&gt;&lt;P&gt;* set the default catalog for the workspace to the proper catalog&lt;BR /&gt;* disabled "Legacy access"&lt;/P&gt;&lt;P&gt;These steps solve my `DriverError`. This log4j error message gave the hint: `&lt;SPAN&gt;Caused by: com.zaxxer.hikari.pool.HikariPool$PoolInitializationException: Failed to initialize pool: Could not connect to address=(host=consolidated-westeuropec2-prod-metastore-3.mysql.database.azure.com)(port=3306)(type=master) : Socket fail to connect to host:consolidated-westeuropec2-prod-metastore-3.mysql.database.azure.com, port:3306. Connect timed out`&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;I still don't know why installing the wheel made such problems, but I consider my problem now as solved.&lt;/P&gt;</description>
      <pubDate>Tue, 12 Aug 2025 07:08:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/128129#M48166</guid>
      <dc:creator>matmad</dc:creator>
      <dc:date>2025-08-12T07:08:59Z</dc:date>
    </item>
    <item>
      <title>Re: Job fails on clusters only with library dependency</title>
      <link>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/144363#M52313</link>
      <description>&lt;P&gt;Hi, can you please share detailed soluton, did you include any config in&amp;nbsp;&lt;SPAN&gt;databricks.yml? I'm also having same issue.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 18 Jan 2026 19:45:40 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/144363#M52313</guid>
      <dc:creator>gopal2026</dc:creator>
      <dc:date>2026-01-18T19:45:40Z</dc:date>
    </item>
    <item>
      <title>Re: Job fails on clusters only with library dependency</title>
      <link>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/144385#M52315</link>
      <description>&lt;P&gt;Hi!&lt;/P&gt;&lt;P&gt;Sure! As said, I&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;* set the default catalog for the workspace to the proper catalog: Workspace settings &amp;gt; Advanced &amp;gt; Default catalog for the workspace: Set to our project workspace (that was created by our admins before)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;* disabled "Legacy access": Workspace settings &amp;gt; Security &amp;gt; Disable legacy access: Disabled&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Jan 2026 06:38:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/job-fails-on-clusters-only-with-library-dependency/m-p/144385#M52315</guid>
      <dc:creator>matmad</dc:creator>
      <dc:date>2026-01-19T06:38:16Z</dc:date>
    </item>
  </channel>
</rss>

