Databricks Community

AChang · 01-25-2024

So, I didn't quite set up my model training output directory correctly, and it saved all my model files to the workspace in the git repo I was working in. I am trying to move these files to DBFS, but when I try using dbutils.fs.mv, I get this error: ...

AChang · 10-17-2023

I am attempting to log, register, and deploy a finetuned GPT2 model in Databricks. While I have been able to get my logging code to run, when I try to run my registration code, I get an MlflowException error.Here is my model logging code.mlflow.set_r...

AChang · 09-15-2023

I am trying to deploy a model in the serving endpoints section, but it keeps failing after attempting to create for an hour. Here are the service logs:Container failed with: 9 +0000] [115] [INFO] Booting worker with pid: 115[2023-09-15 19:15:35 +0000...

AChang · 08-22-2023

I am following along with this notebook found from this article. I am attempting to fine tune the model with a single node and multiple GPUs, so I run everything up to the "Run Local Training" section, but from there I skip to "Run distributed traini...

AChang · 08-08-2023

I have a pyspark dataframe, 61k rows, 3 columns, one of which is a string column which has a max length of 4k. I'm doing about 100 different regexp_replace functions on this dataframe, so, very resource intensive. I'm trying to write this to a delta ...

AChang · 04-15-2024

Hey @KYX , I don't believe I ever did. You can try to configure the CLI in the ephemeral terminal in the notebook, but it really shouldn't be necessary to do so, so I think something else has to be up.

AChang · 01-25-2024

Figured it out, just had to use the !cp command, here is what I did, worked perfectly.!cp -r /Workspace/Repos/$RESTOFPATH /dbfs/folder and it put the entire folder i was trying to move, into that dbfs folder.

AChang · 09-15-2023

I am having the same issue on the large compute! Except my error looks like[rkxn8] [2023-09-15 19:49:24 +0000] [2] [INFO] Starting gunicorn 21.2.0[rkxn8] [2023-09-15 19:49:24 +0000] [2] [INFO] Listening at: http://0.0.0.0:8080 (2)[rkxn8] [2023-09-15 ...

Databricks Community

User Stats

User Activity

Move a folder from Workspace to DBFS

MlflowException: Unable to download model artifacts in Databricks while registering model with MLflo

Model Serving Endpoint keeps failing with SIGKILL error

How to fix this runtime error in this Databricks distributed training tutorial workbook

Best Cluster Setup for intensive transformation workload

Re: How to fix this runtime error in this Databricks distributed training tutorial workbook

Re: Move a folder from Workspace to DBFS

Re: Serving API endpoint failing