Databricks Community

abelian-grape · ‎01-21-2025

Hi I would like to configure near real time streaming on Databricks to process data as soon as a new data finish processing on snowflake e.g. with DLT pipelins and Auto Loader. Which option would be better for this setup?

Option A)

Export the Snowpark DataFrame to Databricks to an external cloud storage (e.g. S3 as parquet).

Option B)

use apache iceberg with polaris and configure from Databricks in order to read that information.

saurabh18cs · ‎01-21-2025

it is like latency vs complexity and cost. you have to choose for yourself 🙂 for me option A sounds reasonable

Databricks Community

Near real time processing with CDC from snowflake to databricks

Photos

Join Us as a Local Community Builder!

Business Intelligence in the Era of AI

🚀 Monthly Databricks Get Started Days – Accelerate Your Learning Journey! 🚀

Databricks Community Champion - March 2025 - Takuya Omi

Get Started With Lakehouse Architecture | Pass a quiz to earn your certificate completion.

Virtual Learning Festival: 9 April - 30 April