Exposing prometheus metrics from inference server in azure machine learning

Data Engineering

Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.

At the moment we are running an inference server in azure machine learning. We would like to be able to expose existing metrics to prometheus as well as create our own custom metrics, all described below

Expose existing metrics:

I would like a breakdown of number of requests by status/average latency etc... I assume azure keeps track of this somewhere but I can't see it anywhere, even if I could I want to expose it so prometheus can scrape it.

Expose custom metrics:

Our inference code has a number of slow steps, we want to be able to expose metrics about each section of this code so that we can see which parts of our application are the bottleneck.