cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Full list of serving endpoint metrics returned by api/2.0/serving-endpoints/[ENDPOINT_NAME]/metrics

KyraHinnegan
Visitor

Hello! Looking at the documentation for this metric endpoint: https://docs.databricks.com/aws/en/machine-learning/model-serving/metrics-export-serving-endpoint
It does not include a sample API response, and the code examples given don't have the full list of possible metric keys that can be returned.
These are the keys that I was able to find:
cpu_usage_percentage
mem_usage_percentage
provisioned_concurrent_requests_total
request_4xx_count_total
request_5xx_count_total
request_count_total
request_latency_ms - histogram (request_latency_ms_bucket, request_latency_ms_count, request_latency_ms_sum)

However this is missing the following GPU metrics:

KyraHinnegan_0-1767388845438.png

What would the keys and the response structure look like for those? An output example would be very helpful.
Thanks!

0 REPLIES 0

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now