Machine Learning

by cbossi • New Contributor III

4 weeks ago

171 Views
1 replies
1 kudos

Resolved! Options sporadic (and cost-efficient) Model Serving on Databricks?

Hi all,I'm new to Databricks so would appreciate some advice.I have a ML model deployed using Databricks Model Serving. My use case is very sporadic: I only need to make 5–15 prediction requests per day (industrial application), and there can be long...

Machine Learning

Reply

171 Views
1 replies
1 kudos

4 weeks ago

View Replies

Latest Reply

KaushalVachhani
Databricks Employee

4 weeks ago

1 kudos

Hi @cbossi , You are right! A 30-minute idle period precedes the endpoint's scaling down. You are billed for the compute resources used during this period, plus the actual serving time when requests are made. This is the current expected behaviour. Y...

1 kudos

4 weeks ago

by spearitchmeta • Contributor

10-23-2025 1:57:58 AM

267 Views
1 replies
1 kudos

Resolved! How does Databricks AutoML handle null imputation for categorical features by default?

Hi everyone I’m using Databricks AutoML (classification workflow) on Databricks Runtime 10.4 LTS ML+, and I’d like to clarify how missing (null) values are handled for categorical (string) columns by default.From the AutoML documentation, I see that:...

Machine Learning

Reply

267 Views
1 replies
1 kudos

10-23-2025 1:57:58 AM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

10-23-2025 12:07:39 PM

1 kudos

Hello @spearitchmeta , I looked internally to see if I could help with this and I found some information that will shed light on your question. Here’s how missing (null) values in categorical (string) columns are handled in Databricks AutoML on Dat...

1 kudos

10-23-2025 12:07:39 PM

by tarunnagar • New Contributor III

10-15-2025 4:48:36 AM

515 Views
1 replies
1 kudos

Best Practices for Collaborative Notebook Development in Databricks

Hi everyone! I’m looking to learn more about effective strategies for collaborative development in Databricks notebooks. Since notebooks are often used by multiple data scientists, analysts, and engineers, managing collaboration efficiently is critic...

Machine Learning

Reply

515 Views
1 replies
1 kudos

10-15-2025 4:48:36 AM

View Replies

Latest Reply

AbhaySingh
Databricks Employee

10-16-2025 7:57:29 AM

1 kudos

For version control, use this approach.Git Integration with Databricks ReposCore Features:Databricks Git Folders (Repos) provides native Git integration with visual UI and REST API access Supports all major providers: GitHub, GitLab, Azure DevOps, Bi...

1 kudos

10-16-2025 7:57:29 AM

by spicysheep • New Contributor II

05-28-2025 10:46:48 PM

1552 Views
3 replies
1 kudos

Distributed SparkXGBRanker training: failed barrier ResultStage

I'm following a variation of the tutorial [here](https://assets.docs.databricks.com/_extras/notebooks/source/xgboost-pyspark-new.html) to train an `SparkXGBRanker` in distributed mode. However, the line:pipeline_model = pipeline.fit(data) Is throwing...

Machine Learning

Reply

1552 Views
3 replies
1 kudos

05-28-2025 10:46:48 PM

View Replies

Latest Reply

NandiniN
Databricks Employee

10-03-2025 9:14:01 PM

1 kudos

You have already mentioned you did turn off autoscaling, please try the num_workers too Step 1: Disable Dynamic Resource Allocation: Use spark.dynamicAllocation.enabled = false Step 2: Configure num_workers to Match Fixed Resources After disabling dy...

1 kudos

10-03-2025 9:14:01 PM

2 More Replies

by the_p_l • New Contributor

07-14-2025 4:25:15 AM

927 Views
1 replies
0 kudos

Lakehouse monitoring generates broken queries

Hi everyone,I’m setting up Databricks Lakehouse Monitoring to track my model’s performance using an inference-regression monitor. I’ve completed all the required configuration and successfully launched my first monitoring run.The quality tables are g...

Machine Learning

Reply

927 Views
1 replies
0 kudos

07-14-2025 4:25:15 AM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

09-24-2025 9:42:28 AM

0 kudos

Hi @the_p_l ,I want to confirm that I understand your situation correctly. You mentioned that you are not adding any custom code to the deployed Lakehouse Monitoring setup, and you believe the issue is related to the inline comments generated during ...

0 kudos

09-24-2025 9:42:28 AM

by AlkaSaliss • New Contributor II

08-25-2025 2:12:06 AM

784 Views
3 replies
2 kudos

Unable to register Scikit-learn or XGBoost model to unity catalog

Hello, I'm following the tutorial provided here https://docs.databricks.com/aws/en/notebooks/source/mlflow/mlflow-classic-ml-e2e-mlflow-3.html for ML model management process using ML FLow, in a unity-catalog enabled workspace, however I'm facing an ...

Machine Learning

Reply

784 Views
3 replies
2 kudos

08-25-2025 2:12:06 AM

View Replies

Latest Reply

gbhatia
New Contributor II

09-22-2025 8:13:29 AM

2 kudos

Maybe add missing: mlflow.set_tracking_uri("databricks")mlflow.set_registry_uri("databricks")

2 kudos

09-22-2025 8:13:29 AM

2 More Replies

by gbhatia • New Contributor II

09-15-2025 10:41:00 AM

1048 Views
3 replies
1 kudos

Endpoint deployment is very slow

HI team I am testing some changes on UAT / DEV environment and noticed that the model endpoint are very slow to deploy. Since the environment is just testing and not serving any production traffic, I was wondering if there was a way to expedite this ...

Machine Learning

Reply

1048 Views
3 replies
1 kudos

09-15-2025 10:41:00 AM

View Replies

Latest Reply

gbhatia
New Contributor II

09-22-2025 7:53:38 AM

1 kudos

Hi @WiliamRosa Thanks for your response on this. I have been using the setting you described aboved, with the exception of `scale_to_zero`. PFA screenshot of the endpoint settings. My deployment is a simple Pytorch Deep Learning model wrapped in a `s...

1 kudos

09-22-2025 7:53:38 AM

2 More Replies

by Edwin1 • New Contributor III

09-06-2025 10:28:19 AM

1666 Views
4 replies
4 kudos

Resolved! Distributed Optuna and MLflow

Hello All, I just tried running the following notebook (https://docs.databricks.com/aws/en/notebooks/source/machine-learning/optuna-mlflow.html) on the Databricks Free Edition platform , through Microsoft Account Authentication. It takes 15 minutes ...

Machine Learning

Reply

1666 Views
4 replies
4 kudos

09-06-2025 10:28:19 AM

View Replies

Latest Reply

Edwin1
New Contributor III

09-06-2025 11:22:40 AM

4 kudos

Great. Thank you. That worked. I still need more compute and networking resources to make it justifiable, but this confirms that it works !!!

4 kudos

09-06-2025 11:22:40 AM

3 More Replies

by Junqueira • New Contributor II

07-29-2025 6:29:48 AM

889 Views
1 replies
1 kudos

[ERROR] Worker (pid:11) was sent code 132 When deploying a Custom Model in serving

Hi, I've been developing a custom model with mlflow.pyfunc.PythonModel. Among other libs, I use ANNOY. While trying to serve the model as an endpoint in "serving", After a few fixes my model worked fine as well the endpoin call.Altough, I tried updat...

Machine Learning

Reply

889 Views
1 replies
1 kudos

07-29-2025 6:29:48 AM

View Replies

Latest Reply

WiliamRosa
Contributor III

08-17-2025 6:48:16 AM

1 kudos

Great observation! The difference between Using worker: sync and Using worker: gevent typically refers to the worker class used by Gunicorn, the web server behind many MLflow model deployments (like in Databricks model serving or other MLflow-compati...

1 kudos

08-17-2025 6:48:16 AM

by Dnirmania • Contributor

07-11-2025 5:03:27 AM

1599 Views
2 replies
3 kudos

Resolved! Serving Endpoint: Container image creation

Hi TeamWhenever I try to create an endpoint from a model in Databricks, the process often gets stuck at the 'Container Image Creation' step. I've tried to understand what happens during this step, but couldn't find any detailed or helpful information...

Machine Learning

Reply

1599 Views
2 replies
3 kudos

07-11-2025 5:03:27 AM

View Replies

Latest Reply

Dnirmania
Contributor

07-28-2025 6:45:29 AM

3 kudos

Thank you @Vidhi_Khaitan for sharing the detailed process ..

3 kudos

07-28-2025 6:45:29 AM

1 More Replies

by CelGuillau • New Contributor III

07-22-2025 6:29:57 AM

3181 Views
5 replies
3 kudos

Resolved! This API is disabled for users without the databricks-sql-access

Running a deply on github: Run databricks bundle deploydatabricks bundle deployshell: /usr/bin/bash -e {0}env:DATABRICKS_HOST: {{HOST}}DATABRICKS_CLIENT_ID: {{ID}}DATABRICKS_CLIENT_SECRET: ***DATABRICKS_BUNDLE_ENV: prodError: This API is disabled for...

Machine Learning

Reply

3181 Views
5 replies
3 kudos

07-22-2025 6:29:57 AM

View Replies

Latest Reply

CelGuillau
New Contributor III

07-24-2025 11:01:56 AM

3 kudos

Got it working, yes I see it was a little confusing at first, the workspace displayed at the top right is the account information whereas the profile icon is where you can access the workspace settings. For anyone that got as confused as I did. Thank...

3 kudos

07-24-2025 11:01:56 AM

4 More Replies

by Sachin_Amin • New Contributor II

07-09-2025 12:58:30 PM

1002 Views
1 replies
1 kudos

Resolved! Model Inferencing

Any links, pointers to host a model in real time (similar to sagemaker endpoint in aws) - how can we host a model in DBX in real time - any documentation please?

Machine Learning

Reply

1002 Views
1 replies
1 kudos

07-09-2025 12:58:30 PM

View Replies

Latest Reply

jamesl
Databricks Employee

07-10-2025 9:43:24 AM

1 kudos

@Sachin_Amin you can find an example in our docs here: https://docs.databricks.com/aws/en/machine-learning/model-serving/model-serving-intro We also have free training courses on realtime model deployment for both classical ML (https://www.databricks...

1 kudos

07-10-2025 9:43:24 AM

by Dharma25 • New Contributor III

05-01-2025 6:47:34 AM

3493 Views
2 replies
2 kudos

workflow not pickingup correct host value (While working with MLflow model registry URI)

Exception: mlflow.exceptions.MlflowException: An API request to https://canada.cloud.databricks.com/api/2.0/mlflow/model-versions/list-artifacts failed due to a timeout. The error message was: HTTPSConnectionPool(host='canada.cloud.databricks.com', p...

Machine Learning

Reply

3493 Views
2 replies
2 kudos

05-01-2025 6:47:34 AM

View Replies

Latest Reply

Dharma25
New Contributor III

05-16-2025 6:33:47 AM

2 kudos

Thanks for the answer. I will try this solution

2 kudos

05-16-2025 6:33:47 AM

1 More Replies

by DaPo • New Contributor III

05-07-2025 6:23:29 AM

2150 Views
2 replies
0 kudos

Model Serving Endpoint: Cuda-OOM for Custom Model

Hello all,I am tasked to evaluate a new LLM for some use-cases. In particular, I need to build a POC for a chat bot based on that model. To that end, I want to create a custom Serving Endpoint for an LLM pulled from huggingfaces. The model itself is...

Machine Learning

Reply

2150 Views
2 replies
0 kudos

05-07-2025 6:23:29 AM

View Replies

Latest Reply

sarahbhord
Databricks Employee

05-07-2025 9:52:42 AM

0 kudos

Here are some suggestions: 1. Update coda.yaml. Replace the current config with this optimized version: channels: - conda-forge dependencies: - python=3.10 # 3.12 may cause compatibility issues - pip - pip: - mlflow==2.21.3 - torch...

0 kudos

05-07-2025 9:52:42 AM

1 More Replies

by Sri2025 • New Contributor

04-16-2025 1:35:34 PM

982 Views
1 replies
0 kudos

Not able to run end to end ML project on Databricks Trial

I started using Databricks trial version from today. I want to explore full end to end ML lifecycle on the databricks. I observed for the compute only 'serverless' option is available. I was trying to execute the notebook posted on https://docs.datab...

Machine Learning

Reply

982 Views
1 replies
0 kudos

04-16-2025 1:35:34 PM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

04-17-2025 8:15:56 AM

0 kudos

I can take up to 15 minutes for the serving endpoint to be created. Once you initiate the "create endpoint" chunk of code go and grab a cup of coffee and wait 15 minutes. Then, before you use it verify it is running (bottom left menu "Serving") by g...

0 kudos

04-17-2025 8:15:56 AM

Databricks Community

Forum Posts

Resolved! Options sporadic (and cost-efficient) Model Serving on Databricks?

Resolved! How does Databricks AutoML handle null imputation for categorical features by default?

Best Practices for Collaborative Notebook Development in Databricks

Distributed SparkXGBRanker training: failed barrier ResultStage

Lakehouse monitoring generates broken queries

Unable to register Scikit-learn or XGBoost model to unity catalog

Endpoint deployment is very slow

Resolved! Distributed Optuna and MLflow

[ERROR] Worker (pid:11) was sent code 132 When deploying a Custom Model in serving

Resolved! Serving Endpoint: Container image creation

Resolved! This API is disabled for users without the databricks-sql-access

Resolved! Model Inferencing

workflow not pickingup correct host value (While working with MLflow model registry URI)

Model Serving Endpoint: Cuda-OOM for Custom Model

Not able to run end to end ML project on Databricks Trial