AUTO CDC made me curious about one practical question: if Auto CDC is now one of the easiest ways to process CDF, is it also the cheapest? To answer that, I compared 3 approaches:
- AUTO CDC pipeline (in standard and performance mode)
- Spark Structured Streaming (in standard and performance mode)
- SQL warehouse with table_changes()
test results:
https://www.sunnydata.ai/blog/auto-cdc-change-data-feed-cost-benchmark-databricks
https://medium.com/@databrickster/change-data-feed-in-databricks-delta-how-to-process-it-the-most-ef...

My blog: https://databrickster.medium.com/