cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Creating an Input Schema for Multiple DataFrames in MLflow

ecram
New Contributor

Hi everyone,

I am working with MLflow version 2.5.0 and need to create an input_schema for my model. My data schema is divided into several DataFrames, for example:

{
"dataframe_split": {
     "columns": ["ClientGuid", "Instance", "TypeScore", ...],
     "data": [["26c5f035-cf57-4cb7-b67c-55da528ce3e0", "1", "POST", ...]]
},
"dataframe_loan_split": {
     "columns": ["System", "Entity", "Credit", ...],
     "data": [["R", "Entity1", "M1 - MIC. IND. DEB. GAR.", ...]]
},
...
}

Could someone provide an example of how to define an input_schema using MLflow for this type of structure? I am looking for a way to specify all the DataFrames within the input schema.

Thanks for your help.

1 REPLY 1

Kaniz_Fatma
Community Manager
Community Manager

Hi @ecramTo create an input schema for multiple DataFrames in MLflow, you can use the mlflow.types.schema module to define the schema for each DataFrame and then combine them. 

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group