<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Is Lakeflow Connect SCD Type 2 output is incompatible with Spark dec pipeline streaming tables? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/is-lakeflow-connect-scd-type-2-output-is-incompatible-with-spark/m-p/156282#M54399</link>
    <description>&lt;P&gt;## Problem&lt;/P&gt;&lt;P&gt;When using Lakeflow Connect to ingest from SQL Server with SCD Type 2 enabled, any downstream Streaming Table (auto cdc flow) in a Spark Declarative pipeline will fail with the following error:&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;"An error occurred because we detected an update or delete to one or more rows in the source table. Streaming tables may only use append-only streaming sources."&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;This happens because Lakeflow Connect applies MERGE operations to its bronze target table when writing SCD2 history — updating __END_AT on existing rows when new versions arrive. This makes the bronze table non-append-only, which violates the streaming table contract.&lt;/P&gt;&lt;P&gt;We designed this using streaming architecture as we may want to enable continuous data processing. However, for now, we can process this in batch.&amp;nbsp;&lt;/P&gt;&lt;P&gt;These tables are large so a materialized view may not be an option. Auto CDC from snapshot is not an option as this expects a non-streaming source. What is the recommendation for processing data in later layers?&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 06 May 2026 16:28:56 GMT</pubDate>
    <dc:creator>lrm_data</dc:creator>
    <dc:date>2026-05-06T16:28:56Z</dc:date>
    <item>
      <title>Is Lakeflow Connect SCD Type 2 output is incompatible with Spark dec pipeline streaming tables?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-lakeflow-connect-scd-type-2-output-is-incompatible-with-spark/m-p/156282#M54399</link>
      <description>&lt;P&gt;## Problem&lt;/P&gt;&lt;P&gt;When using Lakeflow Connect to ingest from SQL Server with SCD Type 2 enabled, any downstream Streaming Table (auto cdc flow) in a Spark Declarative pipeline will fail with the following error:&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;"An error occurred because we detected an update or delete to one or more rows in the source table. Streaming tables may only use append-only streaming sources."&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;This happens because Lakeflow Connect applies MERGE operations to its bronze target table when writing SCD2 history — updating __END_AT on existing rows when new versions arrive. This makes the bronze table non-append-only, which violates the streaming table contract.&lt;/P&gt;&lt;P&gt;We designed this using streaming architecture as we may want to enable continuous data processing. However, for now, we can process this in batch.&amp;nbsp;&lt;/P&gt;&lt;P&gt;These tables are large so a materialized view may not be an option. Auto CDC from snapshot is not an option as this expects a non-streaming source. What is the recommendation for processing data in later layers?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 06 May 2026 16:28:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-lakeflow-connect-scd-type-2-output-is-incompatible-with-spark/m-p/156282#M54399</guid>
      <dc:creator>lrm_data</dc:creator>
      <dc:date>2026-05-06T16:28:56Z</dc:date>
    </item>
    <item>
      <title>Re: Is Lakeflow Connect SCD Type 2 output is incompatible with Spark dec pipeline streaming tables?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-lakeflow-connect-scd-type-2-output-is-incompatible-with-spark/m-p/156286#M54400</link>
      <description>&lt;P&gt;Following up with a recommendation from Databricks:&lt;/P&gt;&lt;P&gt;For tables that need incremental processing -&amp;nbsp;&lt;/P&gt;&lt;P&gt;SQL Server → &amp;nbsp;Lakeflow Connect → Bronze SCD2 Streaming Table (CDF enabled → consume CDF, not base table using AUTO CDC → Silver SCD2 Streaming Table → Downstream MVs or Streaming Tables&lt;/P&gt;</description>
      <pubDate>Wed, 06 May 2026 17:14:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-lakeflow-connect-scd-type-2-output-is-incompatible-with-spark/m-p/156286#M54400</guid>
      <dc:creator>lrm_data</dc:creator>
      <dc:date>2026-05-06T17:14:17Z</dc:date>
    </item>
  </channel>
</rss>

