hi, hope you are fine
I deployed a model 3 or 2 months ago using Databricks Serving and MLFlow. The model worked good using GPU from model serving.
I stopped using it for some months and when I tried again deploying it, it has some errors.
1. [FIXED] A dependency error because it was changed since that time, I added this dependency and fixed it:
huggingface_hub==0.23.2
2. When i'm deploying it i get this error and it stops the deploy and update:
An error occurred while loading the model: Exceeded maximum wait time of 300 seconds to load the model
I ask, is there an environment variable to change the timeout of 300 seconds?, I tried adding these variables:
Thanks!