Databricks Community

Zoumana · ‎11-13-2021

I trained my model and was able to get the batch prediction from that model as specified below. But I want to also get the probability scores for each prediction. Do you have any idea?

Thank you!

logged_model = path_to_model

# Load model as a PyFuncModel.

loaded_model = mlflow.pyfunc.load_model(logged_model)

# Predict on a Pandas DataFrame.

import pandas as pd

loaded_model.predict(pd.DataFrame(data))

SyedGhouri · ‎11-08-2022

Hi @Kaniz Fatma

The error said 'PyFuncModel' object has no attribute 'predict_proba'.

As shows above, I was using the following to load the model

loaded_model = mlflow.pyfunc.load_model(logged_model) and got the error.

After going through mlflow documentation, I changed it to

loaded_model = mlflow.sklearn.load_model(logged_model) and it is working fine.

It's all good now. Thanks for your time.

Syed

View solution in original post

Zoumana · ‎11-13-2021

Hi Kaniz,

Great to meet you too!

Thank you for replying to my question.

Best!

SyedGhouri · ‎11-08-2022

Hi @Kaniz Fatma

Sorry for hijacking the post.

My question is - if I am reading a registered model from mlflow, I can only see the option of .predict method but not .predict_proba.

Do we have any straightforward solution to get the probabilities?

Thanks

Syed

SyedGhouri · ‎11-08-2022

Hi @Kaniz Fatma

The error said 'PyFuncModel' object has no attribute 'predict_proba'.

As shows above, I was using the following to load the model

loaded_model = mlflow.pyfunc.load_model(logged_model) and got the error.

After going through mlflow documentation, I changed it to

loaded_model = mlflow.sklearn.load_model(logged_model) and it is working fine.

It's all good now. Thanks for your time.

Syed

SyedGhouri · ‎11-08-2022

Hi @Kaniz Fatma

I do not see the option to select "Best Answer" but feel free to do anything that you think can help this community.

Thanks

Syed

OndrejHavlicek · ‎08-08-2023

Now you can log the model using this parameter:

mlflow.sklearn.log_model(
    ...,  # the usual params
    pyfunc_predict_fn="predict_proba"
)

which will return probabilities for the first class apparently when using the model for inference (e.g. when loading it using mlflow.pyfunc.spark_udf() ).

Databricks Community

How to get probability score for each prediction from mlflow

Connect with Databricks Users in Your Area

Databricks Learning Festival (Virtual): 10 October - 31 October

Databricks Hybrid Learning Day - New York City

Databricks Migration Strategy: Lessons Learned

What’s New With Databricks Assistant?

Introducing Simple, Fast, and Scalable Batch LLM Inference on Mosaic AI Model Serving