<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic replicate the behaviour of DLT create auto cdc flow in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/replicate-the-behaviour-of-dlt-create-auto-cdc-flow/m-p/141406#M51714</link>
    <description>&lt;P&gt;I want to custom write the behaviour of&amp;nbsp;DLT create auto cdc flow . how can we do it&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 08 Dec 2025 12:49:13 GMT</pubDate>
    <dc:creator>hidden</dc:creator>
    <dc:date>2025-12-08T12:49:13Z</dc:date>
    <item>
      <title>replicate the behaviour of DLT create auto cdc flow</title>
      <link>https://community.databricks.com/t5/data-engineering/replicate-the-behaviour-of-dlt-create-auto-cdc-flow/m-p/141406#M51714</link>
      <description>&lt;P&gt;I want to custom write the behaviour of&amp;nbsp;DLT create auto cdc flow . how can we do it&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Dec 2025 12:49:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/replicate-the-behaviour-of-dlt-create-auto-cdc-flow/m-p/141406#M51714</guid>
      <dc:creator>hidden</dc:creator>
      <dc:date>2025-12-08T12:49:13Z</dc:date>
    </item>
    <item>
      <title>Re: replicate the behaviour of DLT create auto cdc flow</title>
      <link>https://community.databricks.com/t5/data-engineering/replicate-the-behaviour-of-dlt-create-auto-cdc-flow/m-p/141417#M51718</link>
      <description>&lt;P&gt;As of late 2025, Databricks’ Lakeflow Spark Declarative Pipelines (SDP) introduced create_auto_cdc_flow() (Python) and AUTO CDC ... INTO (SQL), which replace the older DLT apply_changes API and let you customize the CDC behavior declaratively—keys, sequencing, delete/truncate handling, SCD1 vs SCD2, column-level history, null-update rules, and more.&lt;BR /&gt;&lt;A href="https://docs.databricks.com/aws/en/ldp/cdc" target="_blank"&gt;https://docs.databricks.com/aws/en/ldp/cdc&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Dec 2025 16:06:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/replicate-the-behaviour-of-dlt-create-auto-cdc-flow/m-p/141417#M51718</guid>
      <dc:creator>nayan_wylde</dc:creator>
      <dc:date>2025-12-08T16:06:19Z</dc:date>
    </item>
    <item>
      <title>Re: replicate the behaviour of DLT create auto cdc flow</title>
      <link>https://community.databricks.com/t5/data-engineering/replicate-the-behaviour-of-dlt-create-auto-cdc-flow/m-p/141420#M51720</link>
      <description>&lt;P&gt;create_auto_cdc_flow() is the new API replacing DLT apply_changes(), used to build declarative CDC pipelines on Delta Change Data Feed (CDF). It ingests inserts, updates, and deletes from a CDC source and applies them into a target streaming table you define. You specify keys (PK), sequence_by (event ordering), and customize behavior like null-handling, delete logic, truncation logic, column filtering, and SCD Type 1 or 2 storage. Deletes can be interpreted via apply_as_deletes, which uses temporary tombstones with configurable retention. Full table truncation can be triggered using apply_as_truncates (SCD Type 1 only).You can include/exclude specific columns and configure which columns track history. SCD2 requires the target table to include special columns __START_AT and __END_AT with matching type to sequence_by. Supports once=True for backfills (runs as batch). Works only with target streaming tables created using create_streaming_table().&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.databricks.com/aws/en/ldp/developer/ldp-python-ref-apply-changes" target="_blank" rel="noopener"&gt;https://docs.databricks.com/aws/en/ldp/developer/ldp-python-ref-apply-changes&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Dec 2025 16:41:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/replicate-the-behaviour-of-dlt-create-auto-cdc-flow/m-p/141420#M51720</guid>
      <dc:creator>Poorva21</dc:creator>
      <dc:date>2025-12-08T16:41:36Z</dc:date>
    </item>
    <item>
      <title>Re: replicate the behaviour of DLT create auto cdc flow</title>
      <link>https://community.databricks.com/t5/data-engineering/replicate-the-behaviour-of-dlt-create-auto-cdc-flow/m-p/141424#M51724</link>
      <description>&lt;P&gt;And you need to handle dozens of exceptions, such as late-arriving data, duplicate data, data in the wrong order, etc.&lt;/P&gt;</description>
      <pubDate>Mon, 08 Dec 2025 16:50:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/replicate-the-behaviour-of-dlt-create-auto-cdc-flow/m-p/141424#M51724</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2025-12-08T16:50:12Z</dc:date>
    </item>
  </channel>
</rss>

