<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Issue with Resetting Checkpoint Metadata in DLT with Unity Catalog in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/issue-with-resetting-checkpoint-metadata-in-dlt-with-unity/m-p/127319#M47920</link>
    <description>&lt;P&gt;Hi Community,&lt;BR /&gt;&lt;BR /&gt;Hope someonne can help with this DLT question.&lt;BR /&gt;&lt;BR /&gt;I am currently working in a Databricks environment using Delta Live Tables (DLT) with Unity Catalog enabled, and I'm encountering a blocker related to schema evolution and checkpoint metadata.&lt;/P&gt;&lt;P&gt;I am developing a streaming DLT pipeline in Python to build out dimensional and fact tables iteratively. During this process, I frequently drop and re-create tables (e.g. dev.dds.dim_office) to refine schema and logic. However, I'm running into the following error when trying to re-run the pipeline after modifying the schema:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;com.databricks.pipelines.common.errors.DLTAnalysisException: Table 'dev.dds.dim_office' has a user-specified schema that is incompatible with the schema inferred from its query.&lt;/P&gt;&lt;P&gt;Streaming tables are stateful and remember data that has already been processed. If you want to recompute the table from scratch, please full refresh the table.&lt;/P&gt;&lt;P&gt;Declared schema:&lt;BR /&gt;root&lt;BR /&gt;|-- id: long (nullable = true)&lt;BR /&gt;|-- account_siv_id: string (nullable = true)&lt;/P&gt;&lt;P&gt;Inferred schema:&lt;BR /&gt;root&lt;BR /&gt;|-- account_type: string (nullable = false)&lt;BR /&gt;|-- account_siv_id: string (nullable = true)&lt;BR /&gt;&lt;BR /&gt;I’ve attempted to reset the pipeline by dropping the table using:&lt;BR /&gt;&lt;BR /&gt;DROP TABLE dev.dds.dim_office PURGE;&lt;BR /&gt;&lt;BR /&gt;This appears to drop the table successfully, but does &lt;STRONG&gt;not&lt;/STRONG&gt; remove the associated checkpoint metadata. I have also tried manually deleting the checkpoint folder:&lt;BR /&gt;&lt;BR /&gt;dbutils.fs.rm("abfss://test@xxxxxx.dfs.core.windows.net/managed/__unitystorage/catalogs/&amp;lt;catalog_id&amp;gt;/checkpoints/dim_office", recurse=True)&lt;BR /&gt;&lt;BR /&gt;However, this returns an error:&lt;BR /&gt;&lt;BR /&gt;overlaps with managed storage within 'CheckPathAccess' call&lt;BR /&gt;&lt;BR /&gt;It seems that checkpoint metadata is not being fully cleared when the table is dropped, and I am unable to force a fresh recomputation or schema reset due to this residual state.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;My Use Case:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;As part of the early development/POC phase, I need to be able to iterate quickly—dropping and recreating tables (including schema changes) without residual metadata interference. This is proving to be a major limitation for adopting DLT pipelines in production under Unity Catalog.&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;Request/Advice Sought:&amp;nbsp;&lt;/STRONG&gt;&lt;BR /&gt;&lt;BR /&gt;How to &lt;STRONG&gt;fully reset the DLT table&lt;/STRONG&gt;, including its &lt;STRONG&gt;schema, checkpoint, and lineage metadata&lt;/STRONG&gt;.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 04 Aug 2025 11:49:45 GMT</pubDate>
    <dc:creator>Suki</dc:creator>
    <dc:date>2025-08-04T11:49:45Z</dc:date>
    <item>
      <title>Issue with Resetting Checkpoint Metadata in DLT with Unity Catalog</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-resetting-checkpoint-metadata-in-dlt-with-unity/m-p/127319#M47920</link>
      <description>&lt;P&gt;Hi Community,&lt;BR /&gt;&lt;BR /&gt;Hope someonne can help with this DLT question.&lt;BR /&gt;&lt;BR /&gt;I am currently working in a Databricks environment using Delta Live Tables (DLT) with Unity Catalog enabled, and I'm encountering a blocker related to schema evolution and checkpoint metadata.&lt;/P&gt;&lt;P&gt;I am developing a streaming DLT pipeline in Python to build out dimensional and fact tables iteratively. During this process, I frequently drop and re-create tables (e.g. dev.dds.dim_office) to refine schema and logic. However, I'm running into the following error when trying to re-run the pipeline after modifying the schema:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;com.databricks.pipelines.common.errors.DLTAnalysisException: Table 'dev.dds.dim_office' has a user-specified schema that is incompatible with the schema inferred from its query.&lt;/P&gt;&lt;P&gt;Streaming tables are stateful and remember data that has already been processed. If you want to recompute the table from scratch, please full refresh the table.&lt;/P&gt;&lt;P&gt;Declared schema:&lt;BR /&gt;root&lt;BR /&gt;|-- id: long (nullable = true)&lt;BR /&gt;|-- account_siv_id: string (nullable = true)&lt;/P&gt;&lt;P&gt;Inferred schema:&lt;BR /&gt;root&lt;BR /&gt;|-- account_type: string (nullable = false)&lt;BR /&gt;|-- account_siv_id: string (nullable = true)&lt;BR /&gt;&lt;BR /&gt;I’ve attempted to reset the pipeline by dropping the table using:&lt;BR /&gt;&lt;BR /&gt;DROP TABLE dev.dds.dim_office PURGE;&lt;BR /&gt;&lt;BR /&gt;This appears to drop the table successfully, but does &lt;STRONG&gt;not&lt;/STRONG&gt; remove the associated checkpoint metadata. I have also tried manually deleting the checkpoint folder:&lt;BR /&gt;&lt;BR /&gt;dbutils.fs.rm("abfss://test@xxxxxx.dfs.core.windows.net/managed/__unitystorage/catalogs/&amp;lt;catalog_id&amp;gt;/checkpoints/dim_office", recurse=True)&lt;BR /&gt;&lt;BR /&gt;However, this returns an error:&lt;BR /&gt;&lt;BR /&gt;overlaps with managed storage within 'CheckPathAccess' call&lt;BR /&gt;&lt;BR /&gt;It seems that checkpoint metadata is not being fully cleared when the table is dropped, and I am unable to force a fresh recomputation or schema reset due to this residual state.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;My Use Case:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;As part of the early development/POC phase, I need to be able to iterate quickly—dropping and recreating tables (including schema changes) without residual metadata interference. This is proving to be a major limitation for adopting DLT pipelines in production under Unity Catalog.&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;Request/Advice Sought:&amp;nbsp;&lt;/STRONG&gt;&lt;BR /&gt;&lt;BR /&gt;How to &lt;STRONG&gt;fully reset the DLT table&lt;/STRONG&gt;, including its &lt;STRONG&gt;schema, checkpoint, and lineage metadata&lt;/STRONG&gt;.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 04 Aug 2025 11:49:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-resetting-checkpoint-metadata-in-dlt-with-unity/m-p/127319#M47920</guid>
      <dc:creator>Suki</dc:creator>
      <dc:date>2025-08-04T11:49:45Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Resetting Checkpoint Metadata in DLT with Unity Catalog</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-resetting-checkpoint-metadata-in-dlt-with-unity/m-p/127321#M47921</link>
      <description>&lt;P&gt;Supplementary note: -&lt;BR /&gt;&lt;BR /&gt;This is enabled:&amp;nbsp; table_properties={"pipelines.reset.allowed": "true"}&lt;BR /&gt;&lt;BR /&gt;But regardless - when I choose Full Refresh - it still gives me the error.&lt;/P&gt;</description>
      <pubDate>Mon, 04 Aug 2025 11:52:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-resetting-checkpoint-metadata-in-dlt-with-unity/m-p/127321#M47921</guid>
      <dc:creator>Suki</dc:creator>
      <dc:date>2025-08-04T11:52:46Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Resetting Checkpoint Metadata in DLT with Unity Catalog</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-resetting-checkpoint-metadata-in-dlt-with-unity/m-p/127336#M47926</link>
      <description>&lt;P&gt;I feel you.&lt;/P&gt;&lt;P&gt;Probably not the way to go, but did to try to destroy and re-deploy your pipeline?&lt;/P&gt;</description>
      <pubDate>Mon, 04 Aug 2025 13:26:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-resetting-checkpoint-metadata-in-dlt-with-unity/m-p/127336#M47926</guid>
      <dc:creator>T0M</dc:creator>
      <dc:date>2025-08-04T13:26:24Z</dc:date>
    </item>
  </channel>
</rss>

