Hello! Looking at the documentation for this metric endpoint: https://docs.databricks.com/aws/en/machine-learning/model-serving/metrics-export-serving-endpoint
It does not include a sample API response, and the code examples given don't have the full list of possible metric keys that can be returned.
These are the keys that I was able to find:
cpu_usage_percentage
mem_usage_percentage
provisioned_concurrent_requests_total
request_4xx_count_total
request_5xx_count_total
request_count_total
request_latency_ms - histogram (request_latency_ms_bucket, request_latency_ms_count, request_latency_ms_sum)
However this is missing the following GPU metrics:

What would the keys and the response structure look like for those? An output example would be very helpful.
Thanks!