cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

DLT Incrimental Load And Metadata Capture

Ovasheli
New Contributor

Hello,

I'm building a Delta Live Tables (DLT) pipeline to load data from a cloud source into an on-premise warehouse. My source tables have Change Data Feed (CDF) enabled, and my pipeline code is complex, involving joins of multiple Slowly Changing Dimensions (SCDs).

The pipeline is intended to perform an incremental load, but I've noticed it's reading and processing significantly more rows than expected. This is leading to inefficient pipeline runs.

I also need to capture DLT-generated metadata, specifically the change type (_change type) and commit version (_commit version) from the final DLT output table, not the source tables.

Could you please provide guidance on how to configure the DLT pipeline for a truly incremental load while also ensuring I can capture this essential metadata from the Change Data Feed of the DLT table itself?

1 REPLY 1

szymon_dybczak
Esteemed Contributor III

Hi @Ovasheli ,

The thing is with Declarative Pipelines (former DLT) you can't always force incremental load. For example, if you're using materialized views in your pipeline there is an optimizer called Enzyme that can selectively incrementally load materialized views when the optimizer determines that an incremental update is a more optimal strategy than a full update. Enzyme chooses an incremental strategy when a number of factors are true (for example what operator you use in pipeline etc) . If you have a complex pipeline then Enzyme can estimate that it's better to perform full refresh instead of incremental one.

You can read more about it here:

Incremental refresh for materialized views - Azure Databricks | Microsoft Learn

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now