Databricks Community

ryojikn · ‎05-03-2024

Hey,

I'm composing an architecture within the usage of Model Serving Endpoints and one of the needs that we're aiming to resolve is Shadow Deployment.

Currently, it seems that the traffic configurations available in model serving do not allow this type of behavior, mixing a mirroring requests effect with "fire and forget" responses from the shadow application.

Do you have this as a feature backlog? Or do you have any already implemented architecture composed within Azure pieces that I could use for that?

Thanks in advance

irtizak · a month ago

I have the same query.

KaushalVachhani · a month ago

@ryojikn and @irtizak , you’re right. Databricks Model Serving allows splitting traffic between model versions, but it doesn’t have a true shadow deployment where live production traffic is mirrored to a new model for monitoring without affecting user responses.

For now, you can try a couple of custom approaches:

1) Deploy one endpoint with your production model and another with the shadow model. On the client side, duplicate each incoming request to both endpoints, but return only the production model’s response to the user. You can capture and compare both responses later using the inference table for analysis.

2) Wrap your models inside a PyFunc and handle routing within the wrapper itself. You can reference models dynamically using aliases (like champion and challenger) so that whenever a model version changes, you don’t need to update the wrapper code. It’ll automatically select the correct model version based on the alias when the endpoint is updated.

Databricks Community

Model Serving - Shadow Deployment - Azure

Join Us as a Local Community Builder!

🌟 Community Pulse: Your Weekly Roundup! November 28 – December 04, 2025

Jaipur Usergroup First Virtual Meetup: AI/BI Genie + Data Science Careers — 19 Dec | 6 PM IST

Lakehouse, Lagers & Legends — Bangalore Meetup | December 13

Celebrating Our First Brickster Champion: Louis Frolio

⭐ Setup Spark with Hadoop Anywhere : A DBR aligned local Spark+HDFS+Hive stack on Docker⭐