Hi @FutureLegend, To download the Dolly-v2-7b model on your local machine, you can use MLflow Export-Import to migrate MLflow models from one workspace to another. You can copy the model from one workspace to another, from a development to a production workspace.
However, it would be best if you had MLflow 2.3 installed, and any cluster with the Hugging Face transformers library installed can be used for batch inference. The transformers library comes preinstalled on 10.4 LTS ML and above.
Here are the steps you can follow:
1. Export the Dolly-v2-7b model from your Databricks workspace using MLflow Export-Import.
2. Download the exported model to your local machine.
3. Install the Hugging Face transformers library on your local machine.
4. Load the exported model to your local machine using the transformers library.
Sources:
- https://docs.databricks.com/machine-learning/manage-model-lifecycle/workspace-model-registry.html
- https://docs.databricks.com/machine-learning/train-model/huggingface/model-inference-nlp.html