cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Change Data Capture(CDC)

subhas_hati
New Contributor

May I know what is a Change Data Capture ?

1 REPLY 1

szymon_dybczak
Esteemed Contributor III

Hi @subhas_hati ,

Change data capture (CDC) is a data integration pattern that captures changes made to data in a source system, such as inserts, updates, and deletes. These changes, represented as a list, are commonly referred to as a CDC feed. You can process your data much faster if you operate on a CDC feed, instead of reading the entire source dataset. Transactional databases such as SQL Server, MySQL, and Oracle generate CDC feeds. Delta tables generate their own CDC feed, known as a change data feed (CDF).

If you want to find more details, please take a look at below articles:

What is change data capture (CDC)? | Databricks on AWS

Use Delta Lake change data feed on Azure Databricks - Azure Databricks | Microsoft Learn

 

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now