cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Provisioned concurrency of serving endpoints scales to zero

chidifrank
New Contributor II

Hi,

 
We provisioned the endpoint with 4 DBUs and also disabled the scale_to_zero option. For some reason, it randomly drops to 0 provisioned concurrency. Logs available in the serving endpoint service are not insightful.
 
Currently, we are provisioning the endpoint with 8 DBUs but still, it randomly drops 4. What might be the issue?

chidifrank_0-1690968368091.png

 

 
1 REPLY 1

chidifrank
New Contributor II

Hi,

I apologize if my question wasn't clear; let me clarify it.
We are not using the scale_to_zero option and we are not doing any warmup requests so it should never scale to zero despite traffic or zero traffic right? 

 

chidifrank_0-1691048861787.png

 

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group