cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Error accessing file from dbfs inside mlflow serve endpoint

sanjay
Valued Contributor II

Hi,

I have mlflow model served using serverless GPU which takes audio file name as input and then file will be passed as parameter to huggiung face model inside predict method. But I am getting following error

HFValidationError(\nhuggingface_hub.utils._validators.HFValidationError: Repo id must be in the form 'repo_name' or 'namespace/repo_name': '/dbfs/tmp'. Use `repo_type` argument if needed.\n"}

Appreciate any help.

Regards,

Sanjay

 

2 REPLIES 2

sanjay
Valued Contributor II

Thank you @Retired_mod for prompt response. I am able to load the model from Huggingface correctly. The issue is with loading input audio file which is stored in local directory in databrick dbfs. This error is coming only after creating serverless endpoint. Without serverless endpoint, I am able to load model from registry and read incoming audio file from same location and process it.

txti
New Contributor III

I have the same issue.
I have a large file that I cannot access from an MLFlow service.
Things I have tried (none of these work):

  • Read-only from DBFS
    • `dbfs:/myfolder/myfile.chroma` does not work
    • `/dbfs/myfolder/myfile.chroma` does not work
  • Read-only from Unity Catalog Volume
    • `/Volumes/mycatalog/mydb/myfolder/myfile.chroma` does not work
  • Read-only from S3 storage
    • `s3://mybucket/mydb/myfolder/myfile.chroma` does not work

So far, the only thing that works is parking the huge file as MLFlow artifact and accessing it locally in the service (awful).

The errors are all similar to:

`ValueError: Dataset at path Volumes/non_prod/metrics/files/recommender/embed-db.chroma was not found`

Notice that the path strips the leading `/` from `/Volumes/` and does the same with `s3://` protocol, etc.

I can't use MLFlow endpoints without this very basic functionality.

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local communityโ€”sign up today to get started!

Sign Up Now