<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Asset Bundles - Workspace or GIT? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/asset-bundles-workspace-or-git/m-p/89545#M37850</link>
    <description>&lt;P&gt;We are just starting down the path of migrating from DBX to DAB.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have been able to successfully use DAB as per all the available documentation.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are very keen to use DAB for development deployments by the data engineering team and the benefits it will bring are many.&lt;/P&gt;&lt;P&gt;One thing I'm not clear on is the best practice of where and how to configure the source for Notebook tasks once we move beyond development.&lt;/P&gt;&lt;P&gt;With DBX we deployed all Jobs (300+) with a GIT source pointing to a specific release TAG, which gave us confidence that was being executed was what we expected it to be and nobody could accidently/intentionally edit the source on the Workspace.&lt;/P&gt;&lt;P&gt;Your Notebook Best Practices still indicates the use of GIT source:&amp;nbsp;&lt;A href="https://docs.databricks.com/en/notebooks/best-practices.html" target="_blank" rel="noopener"&gt;https://docs.databricks.com/en/notebooks/best-practices.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Should we be trying to override the notebook source when deploying to Production to use a GIT source?&amp;nbsp;&lt;/P&gt;&lt;P&gt;If so how can we easily do this without having to duplicate config for 300+ jobs? (i.e. every job we deploy needs to be at the same GIT tag).&lt;/P&gt;&lt;P&gt;If not, what steps can we take to have the same level of confidence as when configured to use GIT tags?&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 12 Sep 2024 06:18:17 GMT</pubDate>
    <dc:creator>JeremyFord</dc:creator>
    <dc:date>2024-09-12T06:18:17Z</dc:date>
    <item>
      <title>Asset Bundles - Workspace or GIT?</title>
      <link>https://community.databricks.com/t5/data-engineering/asset-bundles-workspace-or-git/m-p/89545#M37850</link>
      <description>&lt;P&gt;We are just starting down the path of migrating from DBX to DAB.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have been able to successfully use DAB as per all the available documentation.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are very keen to use DAB for development deployments by the data engineering team and the benefits it will bring are many.&lt;/P&gt;&lt;P&gt;One thing I'm not clear on is the best practice of where and how to configure the source for Notebook tasks once we move beyond development.&lt;/P&gt;&lt;P&gt;With DBX we deployed all Jobs (300+) with a GIT source pointing to a specific release TAG, which gave us confidence that was being executed was what we expected it to be and nobody could accidently/intentionally edit the source on the Workspace.&lt;/P&gt;&lt;P&gt;Your Notebook Best Practices still indicates the use of GIT source:&amp;nbsp;&lt;A href="https://docs.databricks.com/en/notebooks/best-practices.html" target="_blank" rel="noopener"&gt;https://docs.databricks.com/en/notebooks/best-practices.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Should we be trying to override the notebook source when deploying to Production to use a GIT source?&amp;nbsp;&lt;/P&gt;&lt;P&gt;If so how can we easily do this without having to duplicate config for 300+ jobs? (i.e. every job we deploy needs to be at the same GIT tag).&lt;/P&gt;&lt;P&gt;If not, what steps can we take to have the same level of confidence as when configured to use GIT tags?&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 12 Sep 2024 06:18:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/asset-bundles-workspace-or-git/m-p/89545#M37850</guid>
      <dc:creator>JeremyFord</dc:creator>
      <dc:date>2024-09-12T06:18:17Z</dc:date>
    </item>
    <item>
      <title>Re: Asset Bundles - Workspace or GIT?</title>
      <link>https://community.databricks.com/t5/data-engineering/asset-bundles-workspace-or-git/m-p/89568#M37857</link>
      <description>&lt;P&gt;Hello!&lt;/P&gt;&lt;P&gt;Migrating from DBX to DAB is an exciting step! For configuring the source for Notebook tasks beyond development, it’s best practice to use a GIT source for production deployments to ensure consistency and prevent accidental changes. To avoid duplicating configuration for 300+ jobs, you can use a centralized configuration management system, environment variables, and automation scripts. Centralized configuration allows you to set the GIT tag centrally and apply it across all jobs. Environment variables can dynamically set the GIT tag, and automation scripts can fetch the latest GIT tag and update the job configurations accordingly. By implementing these practices, you can maintain the same level of confidence as with DBX, ensuring that your production environment is stable and consistent.&lt;/P&gt;</description>
      <pubDate>Thu, 12 Sep 2024 08:20:37 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/asset-bundles-workspace-or-git/m-p/89568#M37857</guid>
      <dc:creator>Anne165Hernadez</dc:creator>
      <dc:date>2024-09-12T08:20:37Z</dc:date>
    </item>
    <item>
      <title>Re: Asset Bundles - Workspace or GIT?</title>
      <link>https://community.databricks.com/t5/data-engineering/asset-bundles-workspace-or-git/m-p/92133#M38365</link>
      <description>&lt;P&gt;Hi Jeremy,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When using a DAB, the job reads from the workspace source, not the Git source. We will update the &lt;A href="https://docs.databricks.com/en/repos/ci-cd-techniques-with-repos.html#production-job-workflow" target="_self"&gt;doc page&lt;/A&gt; to include DAB as an option and specifically call out this point to avoid future confusion.&lt;/P&gt;
&lt;P&gt;Check out this example in our talk where we illustrate an end-to-end CICD journey with DAB:&amp;nbsp; &lt;A href="https://github.com/databricks/dais-cow-bff" target="_blank" rel="noopener"&gt;https://github.com/databricks/dais-cow-bff&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 27 Sep 2024 19:06:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/asset-bundles-workspace-or-git/m-p/92133#M38365</guid>
      <dc:creator>nicole_lu_PM</dc:creator>
      <dc:date>2024-09-27T19:06:36Z</dc:date>
    </item>
  </channel>
</rss>

