<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Materialized view always load full table instead of incremental in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141339#M51701</link>
    <description>&lt;P&gt;My delta table are stored at HANA data lake file and&amp;nbsp;I have ETL configured like below&lt;/P&gt;&lt;LI-CODE lang="python"&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/25059"&gt;@DP&lt;/a&gt;.materialized_view(temporary=True)
def source():
    return spark.read.format("delta").load("/data/source")

@dp.materialized_view
def sink():
    return spark.read.table("source").withColumnRenamed("COL_A", "COL_B")&lt;/LI-CODE&gt;&lt;P&gt;When I first ran pipeline, it show 100k records has been processed for both table.&lt;/P&gt;&lt;P&gt;For the second run, since there is no update from source table, so I'm expecting no records will be processed. But the dashboard still show 100k.&lt;/P&gt;&lt;P&gt;I'm also check whether the source table enable change data feed by executing&lt;/P&gt;&lt;LI-CODE lang="python"&gt;dt = DeltaTable.forPath(spark, "/data/source")
detail = dt.detail().collect()[0]
props = detail.asDict().get("properties", {})
for k, v in props.items():
    print(f"{k}: {v}")&lt;/LI-CODE&gt;&lt;P&gt;and the result is&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;pipelines.metastore.tableName: `default`.`source`
pipelines.pipelineId: 645fa38f-f6bf-45ab-a696-bd923457dc85
delta.enableChangeDataFeed: true&lt;/LI-CODE&gt;&lt;P&gt;Anybody knows what am I missing here?&lt;/P&gt;&lt;P&gt;Thank in advance.&lt;/P&gt;</description>
    <pubDate>Sun, 07 Dec 2025 07:42:19 GMT</pubDate>
    <dc:creator>anhnnguyen</dc:creator>
    <dc:date>2025-12-07T07:42:19Z</dc:date>
    <item>
      <title>Materialized view always load full table instead of incremental</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141339#M51701</link>
      <description>&lt;P&gt;My delta table are stored at HANA data lake file and&amp;nbsp;I have ETL configured like below&lt;/P&gt;&lt;LI-CODE lang="python"&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/25059"&gt;@DP&lt;/a&gt;.materialized_view(temporary=True)
def source():
    return spark.read.format("delta").load("/data/source")

@dp.materialized_view
def sink():
    return spark.read.table("source").withColumnRenamed("COL_A", "COL_B")&lt;/LI-CODE&gt;&lt;P&gt;When I first ran pipeline, it show 100k records has been processed for both table.&lt;/P&gt;&lt;P&gt;For the second run, since there is no update from source table, so I'm expecting no records will be processed. But the dashboard still show 100k.&lt;/P&gt;&lt;P&gt;I'm also check whether the source table enable change data feed by executing&lt;/P&gt;&lt;LI-CODE lang="python"&gt;dt = DeltaTable.forPath(spark, "/data/source")
detail = dt.detail().collect()[0]
props = detail.asDict().get("properties", {})
for k, v in props.items():
    print(f"{k}: {v}")&lt;/LI-CODE&gt;&lt;P&gt;and the result is&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;pipelines.metastore.tableName: `default`.`source`
pipelines.pipelineId: 645fa38f-f6bf-45ab-a696-bd923457dc85
delta.enableChangeDataFeed: true&lt;/LI-CODE&gt;&lt;P&gt;Anybody knows what am I missing here?&lt;/P&gt;&lt;P&gt;Thank in advance.&lt;/P&gt;</description>
      <pubDate>Sun, 07 Dec 2025 07:42:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141339#M51701</guid>
      <dc:creator>anhnnguyen</dc:creator>
      <dc:date>2025-12-07T07:42:19Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized view always load full table instead of incremental</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141376#M51709</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/198085"&gt;@anhnnguyen&lt;/a&gt;,&amp;nbsp;&lt;/P&gt;&lt;P&gt;You defined source for you CDF as temporary view and they are always &lt;STRONG&gt;fully refreshed&lt;/STRONG&gt; on every pipeline run.&lt;/P&gt;&lt;P&gt;Try defining it without this option&lt;/P&gt;</description>
      <pubDate>Mon, 08 Dec 2025 09:11:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141376#M51709</guid>
      <dc:creator>GaweL</dc:creator>
      <dc:date>2025-12-08T09:11:33Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized view always load full table instead of incremental</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141393#M51710</link>
      <description>&lt;P&gt;The issue comes from defining your source as a temporary materialized view:&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/25059"&gt;@DP&lt;/a&gt;.materialized_view(temporary=True)&lt;BR /&gt;def source():&lt;/P&gt;&lt;P&gt;Temporary materialized views do not track state between pipeline runs. Because of that, the view is fully refreshed every time, so the pipeline always reprocesses the entire dataset instead of using CDF incrementally.&lt;/P&gt;&lt;P&gt;To fix this, remove temporary=True so the materialized view can maintain state and leverage Change Data Feed properly&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/25059"&gt;@DP&lt;/a&gt;.materialized_view&lt;BR /&gt;def source():&lt;/P&gt;&lt;P&gt;After making this change, your pipeline should only process incremental changes.&lt;/P&gt;</description>
      <pubDate>Mon, 08 Dec 2025 11:50:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141393#M51710</guid>
      <dc:creator>Yogesh_Verma_</dc:creator>
      <dc:date>2025-12-08T11:50:03Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized view always load full table instead of incremental</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141411#M51716</link>
      <description>&lt;P&gt;hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/78842"&gt;@Yogesh_Verma_&lt;/a&gt;&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/200035"&gt;@GaweL&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;after removing&amp;nbsp;&lt;SPAN&gt;temporary=True, pipeline still full recompute every run even though there is no change in source&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Dec 2025 13:55:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141411#M51716</guid>
      <dc:creator>anhnnguyen</dc:creator>
      <dc:date>2025-12-08T13:55:13Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized view always load full table instead of incremental</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141415#M51717</link>
      <description>&lt;P&gt;1 more note that I'm not using Unity Catalog here, not sure if it's relevant&lt;/P&gt;</description>
      <pubDate>Mon, 08 Dec 2025 15:44:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141415#M51717</guid>
      <dc:creator>anhnnguyen</dc:creator>
      <dc:date>2025-12-08T15:44:09Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized view always load full table instead of incremental</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141425#M51725</link>
      <description>&lt;P&gt;Can you try to register in UC as an external table? Additionally, if there is column masking or row filtering in Delta, it will always be a full recompute.&lt;BR /&gt;&lt;BR /&gt;Also, your metadata shows CDF enabled, row tracking is different TBLPROPERTIES. The best would be like that:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;ALTER TABLE &amp;lt;table-name&amp;gt; SET TBLPROPERTIES (
  delta.enableDeletionVectors = true,
  delta.enableRowTracking = true,
  delta.enableChangeDataFeed = true);&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Dec 2025 16:55:40 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141425#M51725</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2025-12-08T16:55:40Z</dc:date>
    </item>
    <item>
      <title>Re: Materialized view always load full table instead of incremental</title>
      <link>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141466#M51733</link>
      <description>&lt;P&gt;I tried enabling 3 options as recommended in documentation&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;"delta.enableChangeDataFeed": "true"
"delta.enableRowTracking": "true"
"delta.enableDeletionVectors": "true"&lt;/LI-CODE&gt;&lt;P&gt;but no luck, will try registering as external table later since my workspace has not enabled it yet.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Dec 2025 00:14:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/materialized-view-always-load-full-table-instead-of-incremental/m-p/141466#M51733</guid>
      <dc:creator>anhnnguyen</dc:creator>
      <dc:date>2025-12-09T00:14:01Z</dc:date>
    </item>
  </channel>
</rss>

