Databricks Community

JissMathew · ‎11-29-2024

Hi everyone,

I’m working on implementing Structured Streaming in Databricks to capture Change Data Capture (CDC) as part of a Medallion Architecture (Bronze, Silver, and Gold layers). While Microsoft’s documentation provides a theoretical approach, I’m looking for hands-on examples or code snippets that you’ve successfully used in a real-world project.

Specifically, I’d like to understand:

How to ingest data into a Delta table (Bronze layer) using Auto Loader or another streaming method.
How to process this data incrementally to create CDC and propagate changes to Silver and Gold layers.
Any recommendations for configurations or optimizations to manage schema evolution and large datasets effectively.

If anyone has experience with this and can share practical examples or insights beyond the documentation, it would be greatly appreciated!

Thank you in advance!

Jiss Mathew
India .

szymon_dybczak · ‎11-29-2024

Hi @JissMathew ,

Do you have access to databricks academy? I believe in their data engineering track there's pleny of example notebooks.
Or you can try dbdemos. For example, here you can find demo notebook for autoloader

Databricks Autoloader (cloudfile)

If you'd like to test it on your databricks instance just do the following:

%pip install dbdemos

import dbdemosdbdemos.install('auto-loader')

For CDC pipeline you can use following:

CDC Pipeline With Delta | Databricks

View solution in original post

szymon_dybczak · ‎11-29-2024

Hi @JissMathew ,

Do you have access to databricks academy? I believe in their data engineering track there's pleny of example notebooks.
Or you can try dbdemos. For example, here you can find demo notebook for autoloader

Databricks Autoloader (cloudfile)

If you'd like to test it on your databricks instance just do the following:

%pip install dbdemos

import dbdemosdbdemos.install('auto-loader')

For CDC pipeline you can use following:

CDC Pipeline With Delta | Databricks

JissMathew · ‎12-03-2024

Hi @szymon_dybczak , Thank you very much. Your reply provided me with an excellent reference solution. I had been struggling with structured streaming, and your help was incredibly valuable and insightful.

Jiss Mathew
India .

Databricks Community

Seeking Practical Example for Structured Streaming with Delta Tables in Medallion Architecture

Join Us as a Local Community Builder!

PSA: Community Edition retires at the end of 2025 - move to Free Edition today to keep your work.

🎤 Call for Presentations: Data + AI Summit 2026 is Open!

Last Chance: Help Shape the 2026 Data + AI Summit | Win a Full Conference Pass

🌟 Community Pulse: Your Weekly Roundup! December 05 – 11, 2025

Jaipur Usergroup First Virtual Meetup: AI/BI Genie + Data Science Careers — 19 Dec | 6 PM IST