<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Databricks asset bundle occasionally duplicating jobs in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/databricks-asset-bundle-occasionally-duplicating-jobs/m-p/113757#M44624</link>
    <description>&lt;P class=""&gt;&lt;SPAN class=""&gt;Since last year, we have adopted Databricks Asset Bundles for deploying our workflows to the production and staging environments. The tool has proven to be quite effective, and we currently use Azure DevOps Pipelines to automate bundle deployment, triggered by pull requests and changes to our repository.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;However, we have observed unexpected behavior: on two occasions, including one instance two weeks ago, our jobs deployed in production were duplicated, without any changes to our deployment routine. It is important to note that this issue manifested exclusively in the production environment, while the staging workspace remains stable and without duplications, despite both using the same deployment process, varying only the target ('uat' or 'prod') in the pipeline's YAML file.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;While searching for solutions, I found reports from other Databricks community users who have experienced a similar issue, as discussed in this thread: &lt;A class="" href="https://www.google.com/url?sa=E&amp;amp;q=https%3A%2F%2Fcommunity.databricks.com%2Ft5%2Fdata-engineering%2Fazure-pipeline-databricks-bundle-deploy-duplicating-jobs%2Ftd-p%2F82591" target="_blank" rel="noopener"&gt;&lt;SPAN class=""&gt;https://community.databricks.com/t5/data-engineering/azure-pipeline-databricks-bundle-deploy-duplicating-jobs/td-p/82591&lt;SPAN class=""&gt;. &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;Although the discussion does not provide a definitive solution or an official acknowledgment of the issue by Databricks, it reinforces the possibility that we might be facing a genuine, existing issue.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;So, here are a few points I've noticed:&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Based on my analysis, I suspect the problem might be related to the Databricks Asset Bundle's state management. The bundle maintains state files that record important information about the deployment, such as the job name and ID, and the path to the YAML file used for the deployment.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class=""&gt;Normally, during a standard deployment, these files are updated with the new information if there are changes to the job or when a new job is added. However, during the duplication incidents, we noticed that the 'Created at' column of the files is updated to the date the issue occurred. This suggests that, instead of simply being updated, the state files are being recreated during these events. This recreation results in the assignment of new job IDs in the YAML file, leading to the creation of a new job with the same name but with a different ID, causing the duplication.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;The only way we can resolve this is by manually changing these state files back to the previous job IDs. After performing a new bundle deployment, the system returns to normal operation without duplication.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;CLI version: 0.245&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="gsouza_0-1743021507944.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/15648i5CF0FCD61FB4C365/image-size/medium?v=v2&amp;amp;px=400" role="button" title="gsouza_0-1743021507944.png" alt="gsouza_0-1743021507944.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 26 Mar 2025 20:40:39 GMT</pubDate>
    <dc:creator>gsouza</dc:creator>
    <dc:date>2025-03-26T20:40:39Z</dc:date>
    <item>
      <title>Databricks asset bundle occasionally duplicating jobs</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-asset-bundle-occasionally-duplicating-jobs/m-p/113757#M44624</link>
      <description>&lt;P class=""&gt;&lt;SPAN class=""&gt;Since last year, we have adopted Databricks Asset Bundles for deploying our workflows to the production and staging environments. The tool has proven to be quite effective, and we currently use Azure DevOps Pipelines to automate bundle deployment, triggered by pull requests and changes to our repository.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;However, we have observed unexpected behavior: on two occasions, including one instance two weeks ago, our jobs deployed in production were duplicated, without any changes to our deployment routine. It is important to note that this issue manifested exclusively in the production environment, while the staging workspace remains stable and without duplications, despite both using the same deployment process, varying only the target ('uat' or 'prod') in the pipeline's YAML file.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;While searching for solutions, I found reports from other Databricks community users who have experienced a similar issue, as discussed in this thread: &lt;A class="" href="https://www.google.com/url?sa=E&amp;amp;q=https%3A%2F%2Fcommunity.databricks.com%2Ft5%2Fdata-engineering%2Fazure-pipeline-databricks-bundle-deploy-duplicating-jobs%2Ftd-p%2F82591" target="_blank" rel="noopener"&gt;&lt;SPAN class=""&gt;https://community.databricks.com/t5/data-engineering/azure-pipeline-databricks-bundle-deploy-duplicating-jobs/td-p/82591&lt;SPAN class=""&gt;. &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;Although the discussion does not provide a definitive solution or an official acknowledgment of the issue by Databricks, it reinforces the possibility that we might be facing a genuine, existing issue.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;So, here are a few points I've noticed:&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Based on my analysis, I suspect the problem might be related to the Databricks Asset Bundle's state management. The bundle maintains state files that record important information about the deployment, such as the job name and ID, and the path to the YAML file used for the deployment.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class=""&gt;Normally, during a standard deployment, these files are updated with the new information if there are changes to the job or when a new job is added. However, during the duplication incidents, we noticed that the 'Created at' column of the files is updated to the date the issue occurred. This suggests that, instead of simply being updated, the state files are being recreated during these events. This recreation results in the assignment of new job IDs in the YAML file, leading to the creation of a new job with the same name but with a different ID, causing the duplication.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;The only way we can resolve this is by manually changing these state files back to the previous job IDs. After performing a new bundle deployment, the system returns to normal operation without duplication.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;CLI version: 0.245&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="gsouza_0-1743021507944.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/15648i5CF0FCD61FB4C365/image-size/medium?v=v2&amp;amp;px=400" role="button" title="gsouza_0-1743021507944.png" alt="gsouza_0-1743021507944.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 26 Mar 2025 20:40:39 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-asset-bundle-occasionally-duplicating-jobs/m-p/113757#M44624</guid>
      <dc:creator>gsouza</dc:creator>
      <dc:date>2025-03-26T20:40:39Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks asset bundle occasionally duplicating jobs</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-asset-bundle-occasionally-duplicating-jobs/m-p/117300#M45472</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;There have been internal discussions on this issue in the past and this issue is reported. There are internal trackers for this issue.&lt;/P&gt;
&lt;P&gt;Thanks!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 01 May 2025 06:20:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-asset-bundle-occasionally-duplicating-jobs/m-p/117300#M45472</guid>
      <dc:creator>NandiniN</dc:creator>
      <dc:date>2025-05-01T06:20:01Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks asset bundle occasionally duplicating jobs</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-asset-bundle-occasionally-duplicating-jobs/m-p/117704#M45551</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/23233"&gt;@NandiniN&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for your reply, but we would like to know if there is any way for us to monitor the progress of the solution to this problem. It is a recurring error and it is very disruptive when it occurs.&lt;/P&gt;&lt;P&gt;Best regards.&lt;/P&gt;</description>
      <pubDate>Mon, 05 May 2025 12:53:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-asset-bundle-occasionally-duplicating-jobs/m-p/117704#M45551</guid>
      <dc:creator>isabelgontijo</dc:creator>
      <dc:date>2025-05-05T12:53:57Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks asset bundle occasionally duplicating jobs</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-asset-bundle-occasionally-duplicating-jobs/m-p/120763#M46245</link>
      <description>&lt;P&gt;This is negatively impacting us as well.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 02 Jun 2025 20:00:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-asset-bundle-occasionally-duplicating-jobs/m-p/120763#M46245</guid>
      <dc:creator>kevin_w_edwards</dc:creator>
      <dc:date>2025-06-02T20:00:14Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks asset bundle occasionally duplicating jobs</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-asset-bundle-occasionally-duplicating-jobs/m-p/141600#M51767</link>
      <description>&lt;P&gt;This is a recurring issue for my org as well.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Dec 2025 15:21:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-asset-bundle-occasionally-duplicating-jobs/m-p/141600#M51767</guid>
      <dc:creator>cmantilla</dc:creator>
      <dc:date>2025-12-10T15:21:36Z</dc:date>
    </item>
  </channel>
</rss>

