<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: databricks bundle install: Error: Maximum file size of 524288000 exceeded in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/databricks-bundle-install-error-maximum-file-size-of-524288000/m-p/125313#M47416</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/171773"&gt;@weakliemg&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;When deploying with Databricks Asset Bundles (DAB) from your CI/CD server, it tries to upload any local dependencies referenced in the bundle config, even if they’re already installed, because it treats them as local files. That’s likely why it’s trying to upload the large PyTorch wheel and failing.&lt;BR /&gt;To avoid this:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Upload the dependency to a workspace or Unity Catalog location and reference that path&lt;/LI&gt;&lt;LI&gt;Use a PyPI reference in your bundle config instead of a local file&lt;/LI&gt;&lt;/UL&gt;</description>
    <pubDate>Tue, 15 Jul 2025 13:45:31 GMT</pubDate>
    <dc:creator>SP_6721</dc:creator>
    <dc:date>2025-07-15T13:45:31Z</dc:date>
    <item>
      <title>databricks bundle install: Error: Maximum file size of 524288000 exceeded</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-bundle-install-error-maximum-file-size-of-524288000/m-p/125208#M47370</link>
      <description>&lt;P&gt;I have a job that's running some ML classification models. This uses PyTorch 2.5.0. I've configured the project with that dependency.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I can deploy my job to our dev system from my laptop and all goes well. When I run this off our CI/CD server, for some reason it wants to upload dependencies when it runs databricks bundle deploy, it appears to try to upload the dependencies and fails on the PyTorch wheel which is apparently huge.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I shouldn't need to upload dependencies with the DAB, why is it trying to deploy them? Running off my laptop, it doesn't try to upload the dependencies and the deploy is done in a few seconds.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jul 2025 21:00:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-bundle-install-error-maximum-file-size-of-524288000/m-p/125208#M47370</guid>
      <dc:creator>weakliemg</dc:creator>
      <dc:date>2025-07-14T21:00:45Z</dc:date>
    </item>
    <item>
      <title>Re: databricks bundle install: Error: Maximum file size of 524288000 exceeded</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-bundle-install-error-maximum-file-size-of-524288000/m-p/125313#M47416</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/171773"&gt;@weakliemg&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;When deploying with Databricks Asset Bundles (DAB) from your CI/CD server, it tries to upload any local dependencies referenced in the bundle config, even if they’re already installed, because it treats them as local files. That’s likely why it’s trying to upload the large PyTorch wheel and failing.&lt;BR /&gt;To avoid this:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Upload the dependency to a workspace or Unity Catalog location and reference that path&lt;/LI&gt;&lt;LI&gt;Use a PyPI reference in your bundle config instead of a local file&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Tue, 15 Jul 2025 13:45:31 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-bundle-install-error-maximum-file-size-of-524288000/m-p/125313#M47416</guid>
      <dc:creator>SP_6721</dc:creator>
      <dc:date>2025-07-15T13:45:31Z</dc:date>
    </item>
    <item>
      <title>Re: databricks bundle install: Error: Maximum file size of 524288000 exceeded</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-bundle-install-error-maximum-file-size-of-524288000/m-p/125322#M47418</link>
      <description>&lt;P&gt;Thanks but why does this behavior not happen locally? Also, the bundle config doesn't reference torch, it's used in code and included as a dev dependency in pyproject.toml. My libraries are just this:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;          libraries:
            - whl: ../dist/*.whl
            - pypi:
                package: pydantic&amp;gt;=2.0
            - maven:
                coordinates: io.dataflint:spark_2.12:0.4.0&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 15 Jul 2025 14:56:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-bundle-install-error-maximum-file-size-of-524288000/m-p/125322#M47418</guid>
      <dc:creator>weakliemg</dc:creator>
      <dc:date>2025-07-15T14:56:17Z</dc:date>
    </item>
  </channel>
</rss>

