cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Endpoint performance questions

Kaizen
Valued Contributor

Hi! 
Had really interesting results from some endpoint performance tests I did. I set up the non-optimized endpoint with zero-cluster scaling and optimized had this feature disabled.

1) Why does the non-optimized endpoint have variable response time for 3600, 1800, and 600 seconds tests? If the serving cluster node scaled to 0 (due to no traffic) I would expect it to also require 240 seconds to start up and start serving again. 

- what is going on behind the scenes that results in this?

2) It was also interesting to see that the endpoint metrcs showed request error rates (top right graph). The endpoint didnt have any bad responses. Also the logs didnt show anything that would allude to this. Any idea why this would be the case? See blow for the metrics image.

3) I didnt find much information on this on the databricks documentation. Any additional documentation would be appreicated! Happy to sync with the team

non-optimized endpoint results

Kaizen_1-1710196442817.png


optimized endpoint results 

Kaizen_0-1710196408535.png

metrics log:

Kaizen_2-1710196880601.png

 

1 ACCEPTED SOLUTION

Accepted Solutions

Kaizen
Valued Contributor

Independently found the solution to item 2. Currently you cannot modify the 30 min time for scale to zero. 

Hope this helps someone in the future!

View solution in original post

5 REPLIES 5

Kaizen
Valued Contributor

Kaizen
Valued Contributor

Answering Q1: 
1) The variable response time is due to the first endpoint response time requiring ~180 seconds to scale to 1 cluster from 0

2) Can i change zero scale time from the preset 30 min?

Kaizen
Valued Contributor

Thanks for this. 

1) The odd values i got for 3600/1800/ etc was due to an outlier in my data so in general a response time of ~183 sec should be expected 

2) @Retired_mod can we adjust the scaling of the cluster from 30 min to something else?

Kaizen
Valued Contributor

@s_park / @Sujitha / @Debayan  could one of you address item 2?

Kaizen
Valued Contributor

Independently found the solution to item 2. Currently you cannot modify the 30 min time for scale to zero. 

Hope this helps someone in the future!

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group