In this post, we’ll explore how Magnite and Databricks collaborated to build:
In today’s data-driven ecosystem, Magnite relies on it’s customers to share clean, trusted datasets. This inbound data-sharing setup often comes with challenges:
To solve these challenges, Magnite and Databricks designed a solution that automates customer onboarding, removes manual data-sharing workflows, and enforces consistent validation and governance at scale, enabling standardized publishing by customers and streamlined consumption by Magnite using the Lakehouse and Delta Sharing.
Here’s how the end-to-end workflow is structured:
On the Magnite side, automation is driven by a Lakeflow job that continuously listens at fixed intervals for newly created Delta Shares from customers.
When a new share is detected, the workflow performs the following steps:
Each per-customer Lakeflow job then:
This two-tier design cleanly separates orchestration from data processing, enabling Magnite to onboard new customers automatically while maintaining isolation, observability, and consistent operational behavior at scale.
This solution is deeply enabled by native Databricks platform capabilities that simplify onboarding, enforce data quality, and make inbound data sharing scalable and operationally reliable.
Databricks Marketplace productizes onboarding into a self-service experience. Magnite’s customers get a standardized listing with schema requirements, guided runbooks, and a packaged Python wheel for validation and setup, dramatically reducing onboarding time and enabling consistent adoption across many customers.
Delta Sharing provides the core sharing abstraction. Magnite’s customers are able to programmatically create shares and recipients, enable sharing, and expose governed, read-only access to live Delta tables. This enables zero-copy sharing, where data remains in the provider’s environment and is accessed without duplicating datasets or building custom export pipelines, reducing the overhead and complexity of data movement while preserving secure, governed access to live data.
Delta Lake Change Data Feed (CDF) is what makes scalable ingestion practical for Magnite’s use case. While Delta Sharing enables live access to customer tables, Magnite requires its own managed copy of the data for transformation, enrichment, downstream processing, and operational isolation. CDF allows Magnite to maintain that copy efficiently by consuming only incremental row-level changes instead of performing full table refreshes. Without CDF, Magnite would need to build and operate a custom CDC mechanism to replicate partner data into its environment.
Databricks REST APIs are what make the workflow fully automated at scale. Magnite uses APIs to discover new shares, enumerate shared tables/metadata, create per-customer catalogs, provision per-customer Lakeflow jobs, and update operational queries/alerts, eliminating manual onboarding steps and lowering engineering overhead.
Lakeflow Jobs unify orchestration and streaming. A persistent “share listener” job detects new customer shares, validates schemas, provisions ingestion jobs, and runs transformation pipelines.
The Databricks Lakehouse stores operational metadata, CDF metrics, and audit logs in Delta tables, providing built-in observability and troubleshooting.
Together, these capabilities reduce onboarding friction, improve data quality and consistency, lower engineering overhead, and enable scalable adoption. Without Databricks, Magnite would have needed custom export pipelines, external orchestration, bespoke governance layers, a separate control plane for automation, and manual customer setup resulting in higher cost, slower onboarding, and a more fragile system.
Creating this new pipeline into Magnite opens up new pathways for data monetization, reinforcing our broader narrative of flexible, privacy-safe onboarding. By integrating directly with Databricks, we make it easier for partners to activate their data within Magnite, expanding how and where data can drive value.
Inbound data sharing is becoming critical in modern data ecosystems. By leveraging the Databricks Lakehouse, Delta Sharing, Change Data Feed, and a Marketplace-first approach, Magnite has built a solution that makes it easy for customers to publish trusted data, and easy for Magnite to consume it at scale. Together, this enables faster insights, stronger collaboration, and greater value from shared data.
This solution is also a strong example of how Databricks Professional Services partners with customers to accelerate time to value. By combining deep platform expertise with Magnite’s domain knowledge, the joint team delivered a scalable, production-grade inbound data-sharing architecture, turning what would have been a complex, bespoke integration into a repeatable, productized workflow.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.