Building a Spark Declarative Pipeline OSS with Apa...

Pranav_1699 · 2 hours ago

Hey everyone,

I recently worked on building a modern financial data lakehouse using Spark Declarative Pipeline OSS (SDP OSS), Apache Iceberg, and AWS Glue Catalog.

The blog covers:

- Building declarative data pipelines with Spark

- Using Apache Iceberg as the table format

- Managing metadata with AWS Glue Catalog

- Streaming + batch style processing patterns

- End-to-end lakehouse architecture ideas

Looking forward to feedback and discussions 🙂

Building a Spark Declarative Pipeline OSS with Apache Iceberg and AWS Glue Catalog