cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Provisioned concurrency of serving endpoints scales to zero

chidifrank
New Contributor II

Hi,

 
We provisioned the endpoint with 4 DBUs and also disabled the scale_to_zero option. For some reason, it randomly drops to 0 provisioned concurrency. Logs available in the serving endpoint service are not insightful.
 
Currently, we are provisioning the endpoint with 8 DBUs but still, it randomly drops 4. What might be the issue?

chidifrank_0-1690968368091.png

 

 
1 REPLY 1

chidifrank
New Contributor II

Hi,

I apologize if my question wasn't clear; let me clarify it.
We are not using the scale_to_zero option and we are not doing any warmup requests so it should never scale to zero despite traffic or zero traffic right? 

 

chidifrank_0-1691048861787.png

 

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now