Open Lakehouse Meetup I Mountain View
Join us for an Open Lakehouse Meetup on Tuesday, March 11 from 5:00 PM - 9:00 PM PST at the Databricks’ Mountain View office!
This meetup is designed for OSS data practitioners and developers who build production data pipelines. Immerse yourself in learning, collaborating, and community building as we explore cutting-edge topics in open source data engineering focusing on the open lakehouse paradigm.
This event showcases the latest advancements in open lakehouse technologies, with a focus on the open lakehouse including Delta Lake, Apache Iceberg™, Apache Spark™, Rust, Apache Arrow™, and more. Learn how to build and optimize modern lakehouse architectures, deploy real-world applications, and lakehouse best practices.
Don't miss this opportunity to accelerate your data journey and contribute to shaping the future of data and AI!
🔗🌟 RSVP to secure your spot at the event ➡️ https://lu.ma/okxq0bt1
AGENDA
-
5:00 PM: Registration & Mingling
-
6:00 PM: Welcome Remarks by :
-
Jules Damji (Developer Advocate)
-
-
6:15 PM: Session #1 – Lessons Learned on delta.rs with Delta Lake Definitive Guide Authors featuring:
-
Robert Tyler Croy (Director of Platform Engineering, Scribd)
-
Scott Haines (Distinguished Software Engineer, Nike)
-
Robert Pack (Staff Developer Advocate, Databricks)
-
Denny Lee (Principal Developer Advocate, Databricks)
-
-
6:45 PM: Session #2 – Incremental Iceberg Table Replication At Scale featuring:
-
Szehon Ho (Senior Staff Software Engineer, Databricks)
-
-
7:15 PM: Session #3 – Open Lakehouse Panel Discussion featuring:
-
Daniel Weeks (Co-founder of Tabular and co-creator of Apache Iceberg)
-
Xiao Li (Engineering Director, Databricks)
-
DB Tsai (Sr. Engineering Manager, Databricks)
-
-
7:45 PM: Closing Remarks by:
-
8:00 PM: Reception with bites and beverages
-
9:00 PM: Good night
SESSION DESCRIPTIONS
-
Lessons Learned on delta.rs with Delta Lake Definitive Guide Authors: Dive into the Rust and Arrow ecosystem with Delta Lake committers and authors of Delta Lake: The Definitive Guide. Understand some of our key learnings on why, ultimately, we created delta-kernel-rs to abstract lakehouse format metadata from the underlying data.
-
Incremental Iceberg Table Replication At Scale: Apache Iceberg™ is a popular table format for managing large analytical datasets. However, replicating iceberg tables at scale can be a daunting task—especially when dealing with its hierarchical metadata. In this talk, we will present an end-to-end workflow for replicating Apache Iceberg tables, leveraging Apache Spark™ to ensure that backup tables remain identical to their source counterparts, including snapshot history, schema, and partition specifications. More excitingly, we have contributed these libraries back to the open source community.
-
Open Lakehouse Panel discussion: Join Apache Spark™, Apache Iceberg™, and Delta Lake committers, maintainers, and contributors on a fun panel discussion. Come with your questions and we’ll hopefully have the answers!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
can we attend from London ? virtual attendance possible?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
HI @basit_siddiqui! Unfortunately the event will not be livestreamed but we will have two London events end of April! Subscribe to our calendar: lu.ma/DevConnectDBX to be updated once those events are posted.

