Databricks Community

User16826992666 · ‎06-16-2021

sean_owen · ‎06-17-2021

There shouldn't be. Generally speaking, models will be serialized according to their 'native' format for well-known libraries like Tensorflow, xgboost, sklearn, etc. Custom model will be saved with pickle. The files exist on distributed storage as artifacts. MLflow can and does log additional metadata with the model, like its schema, sample input, environment requirements, but these are tiny additional files.

View solution in original post

sean_owen · ‎06-17-2021

There shouldn't be. Generally speaking, models will be serialized according to their 'native' format for well-known libraries like Tensorflow, xgboost, sklearn, etc. Custom model will be saved with pickle. The files exist on distributed storage as artifacts. MLflow can and does log additional metadata with the model, like its schema, sample input, environment requirements, but these are tiny additional files.

Databricks Community

Is there any file size overhead when I save models using MLflow?

Databricks AMER Learning Festival | Virtual Training

Introducing the Genie Hub: Ask Questions, Share Builds, and Master Conversational Analytics

🌟 Community Pulse: Your Weekly Roundup! July 13 – 19, 2026

Solution Accelerator Series | Social Determinants of Health

Upcoming Community BrickTalk | Sports Analytics: Turning Tracking Data into Real-Time AI Decisions