Near real time processing with CDC from snowflake to databricks
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-21-2025 06:29 AM - edited 01-21-2025 06:45 AM
Hi I would like to configure near real time streaming on Databricks to process data as soon as a new data finish processing on snowflake e.g. with DLT pipelins and Auto Loader. Which option would be better for this setup?
Option A)
Export the Snowpark DataFrame to Databricks to an external cloud storage (e.g. S3 as parquet).
Option B)
use apache iceberg with polaris and configure from Databricks in order to read that information.
1 REPLY 1
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-21-2025 06:36 AM
it is like latency vs complexity and cost. you have to choose for yourself 🙂 for me option A sounds reasonable

