cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Near real time processing with CDC from snowflake to databricks

abelian-grape
New Contributor III
Hi I would like to configure near real time streaming on Databricks to process data as soon as a new data finish processing on snowflake e.g. with DLT pipelins and Auto Loader. Which option would be better for this setup?
 
Option A)
Export the Snowpark DataFrame to Databricks to an external cloud storage (e.g. S3 as parquet).
 
Option B)
 use apache iceberg with polaris and configure from Databricks in order to read that information.
1 REPLY 1

saurabh18cs
Honored Contributor

it is like latency vs complexity and cost. you have to choose for yourself 🙂 for me option A sounds reasonable