Machine Learning

by Vlad96 • Databricks Partner

05-26-2025 5:15:05 PM

1934 Views
3 replies
0 kudos

My model serving endpoint is never getting created

Hello, Im trying to serve a Pyfunc Model on a databricks endpoint but for some reason is getting stuck on a pending status. It's been 4 hours since the endpoint deployment started. If I check the build logs, no error appears whatsoever #23 0.133 chan...

Machine Learning

Reply

1934 Views
3 replies
0 kudos

05-26-2025 5:15:05 PM

View Replies

Latest Reply

ThijsBertramCZ
New Contributor II

4 weeks ago

0 kudos

Have you ever found a fix?I am experiencing the same issue

0 kudos

4 weeks ago

2 More Replies

by excavator-matt • Contributor III

09-03-2025 2:21:13 AM

3879 Views
9 replies
3 kudos

Resolved! What is the most efficient way of running sentence-transformers on a Spark DataFrame column?

We're trying to run the bundled sentence-transformers library from SBert in a notebook running Databricks ML 16.4 on an AWS g4dn.2xlarge [T4] instance.However, we're experiencing out of memory crashes and are wondering what the optimal to run sentenc...

Machine Learning

memory issues

sentence-transformers

vector embeddings

Reply

3879 Views
9 replies
3 kudos

09-03-2025 2:21:13 AM

View Replies

Latest Reply

excavator-matt
Contributor III

02-25-2026 5:35:14 AM

3 kudos

Also, I forgot to mention the workaround solution for the first approach. If you write to parquet in a volume, you can then convert it back to a Delta table in a later cell.Instead of thisprojects_pdf.to_delta("europe_prod_catalog.ad_hoc.project_reco...

3 kudos

02-25-2026 5:35:14 AM

8 More Replies

by Dali1 • New Contributor III

02-20-2026 4:46:28 AM

507 Views
2 replies
2 kudos

Resolved! Python environment DAB

Hello,I am building a pipeline using DAB.The first step of the dab is to deploy my library as a wheel.The pipeline is run on a shared databricks cluster.When I run the job I see that the job is not using exactly the requirements I specified but it us...

Machine Learning

Reply

507 Views
2 replies
2 kudos

02-20-2026 4:46:28 AM

View Replies

Latest Reply

stbjelcevic
Databricks Employee

02-22-2026 6:40:56 PM

2 kudos

Hi @Dali1, +1 to @pradeep_singh, on shared clusters, tasks inherit cluster-installed libraries, so you won’t get a clean, versioned environment. Use a job cluster (new_cluster) or switch to serverless jobs with an environment per task for isolation. ...

2 kudos

02-22-2026 6:40:56 PM

1 More Replies

by Dali1 • New Contributor III

02-18-2026 4:04:19 AM

386 Views
1 replies
0 kudos

Resolved! Install library in notebook

Hello ,I tried installing a custom library in my databricks notebook that is in a git folder of my worskpace.The installation looks successfulI saw the library in the list of libraries but when I want to import it I have : ModuleNotFoundError: No mod...

Machine Learning

Reply

386 Views
1 replies
0 kudos

02-18-2026 4:04:19 AM

View Replies

Latest Reply

Dali1
New Contributor III

02-18-2026 4:16:24 AM

0 kudos

Just found the issue - The installation with editable mode doesnt work you have to install it as a library I don't know why

0 kudos

02-18-2026 4:16:24 AM

by Deep_Blue_Whale • New Contributor

02-17-2026 5:15:13 AM

314 Views
1 replies
0 kudos

Error starting or creating custom model serving endpoints - 'For input string: ""'

Hi Databricks Community,I'm having issues starting or creating custom model serving endpoints. When going into Serving endpoints > Selecting the endpoint > Start, I get the error message 'For input string:'This endpoint had worked correctly yesterday...

Machine Learning

Reply

314 Views
1 replies
0 kudos

02-17-2026 5:15:13 AM

View Replies

Latest Reply

emma_s
Databricks Employee

02-18-2026 4:11:50 AM

0 kudos

Hi, sorry you're having the issue. You mentioned you've tried to recreate the endpoint with this model and other custom models but still having the same issue. Have you tried serving one of the foundation models and seeing if that works or a really s...

0 kudos

02-18-2026 4:11:50 AM

by Dali1 • New Contributor III

02-18-2026 12:31:56 AM

456 Views
1 replies
0 kudos

Resolved! Databricks SDK vs bundles

Hello,In this article: https://www.databricks.com/blog/from-airflow-to-lakeflow-data-first-orchestrationI understand that if I want to create and deploy ml pipeline in production the recommandation is to use databricks asset bundles. But by using it ...

Machine Learning

Reply

456 Views
1 replies
0 kudos

02-18-2026 12:31:56 AM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

02-18-2026 1:30:34 AM

0 kudos

Hi @Dali1 ,When you deploy with Asset Bundles, DABk keeps track of what’s already been deployed and what has changed. That means:it only updates what needs updating,detects drift between your desired state and the workspace,lets you generate plans/di...

0 kudos

02-18-2026 1:30:34 AM

by AlkaSaliss • New Contributor II

08-25-2025 2:12:06 AM

1340 Views
4 replies
2 kudos

Unable to register Scikit-learn or XGBoost model to unity catalog

Hello, I'm following the tutorial provided here https://docs.databricks.com/aws/en/notebooks/source/mlflow/mlflow-classic-ml-e2e-mlflow-3.html for ML model management process using ML FLow, in a unity-catalog enabled workspace, however I'm facing an ...

Machine Learning

Reply

1340 Views
4 replies
2 kudos

08-25-2025 2:12:06 AM

View Replies

Latest Reply

joelramirezai
Databricks Employee

02-16-2026 7:51:39 AM

2 kudos

You need to ensure that your Unity Catalog catalog and schema already exist, that you have the necessary permissions to use them, and that you update the code to reference your own catalog and schema names. You must also run on a classic cluster with...

2 kudos

02-16-2026 7:51:39 AM

3 More Replies

by p4pratikjain • Contributor

08-19-2024 7:53:54 AM

4272 Views
3 replies
0 kudos

DAB - Add/remove task depending on workspace.

I use DAB for deploying Jobs, I want to add a specific Task in dev only but not in staging or prod. Is there any way to achieve this using DAB ?

Machine Learning

Reply

4272 Views
3 replies
0 kudos

08-19-2024 7:53:54 AM

View Replies

Latest Reply

Pat
Esteemed Contributor

02-05-2026 3:55:55 AM

0 kudos

I know it's a bit old, but if someone is looking into a solution, then I was able to resolve the issue where I need to deploy some jobs only into the DEV target:https://github.com/databricks/bundle-examples/tree/main/knowledge_base/target_includes.Us...

0 kudos

02-05-2026 3:55:55 AM

2 More Replies

by Danik • Databricks Partner

01-21-2026 3:57:46 AM

1213 Views
2 replies
3 kudos

Resolved! Population stability index (PSI) calculation in Lakehouse monitor

Hi! We are using Lakehouse monitoring for detecting data drift in our metrics. However, the exact calculation of metrics is not documented anywhere (I couldnt find it) and it raises questions on how they are done, in our case especially - PSI. I woul...

Machine Learning

Reply

1213 Views
2 replies
3 kudos

01-21-2026 3:57:46 AM

View Replies

Latest Reply

iyashk-DB
Databricks Employee

01-21-2026 7:34:11 AM

3 kudos

Hi @Danik , I have reviewed this. 1) Is there documentation for PSI and other metrics?Public docs list PSI in the drift table and give thresholds, but don’t detail the exact algorithm.Internally, numeric PSI uses ~1000 quantiles, equal‑height binning...

3 kudos

01-21-2026 7:34:11 AM

1 More Replies

by naveen_marthala • Contributor

05-02-2022 9:08:48 AM

14846 Views
5 replies
3 kudos

Resolved! How to PREVENT mlflow's autologging from logging ALL runs?

I am logging runs from jupyter notebook. the cells which has `mlflow.sklearn.autlog()` behaves as expected. but, the cells which has .fit() method being called on sklearn's estimators are also being logged as runs without explicitly mentioning `mlflo...

Machine Learning

Reply

14846 Views
5 replies
3 kudos

05-02-2022 9:08:48 AM

View Replies

Latest Reply

alexsheer9003
New Contributor II

01-19-2026 2:47:26 AM

3 kudos

NICE TIP!

3 kudos

01-19-2026 2:47:26 AM

4 More Replies

by thomasm • New Contributor II

01-07-2026 3:44:23 AM

537 Views
4 replies
1 kudos

MLFlow Detailed Trace view doesn't work in some workspaces

I've created a Databricks Model Serving Endpoint which serves an MLFlow Pyfunc model. The model uses langchain and I'm using mlflow.langchain.autolog().At my company we have some production(-like) workspaces where users cannot e.g. run Notebooks and ...

Machine Learning

Reply

537 Views
4 replies
1 kudos

01-07-2026 3:44:23 AM

View Replies

Latest Reply

thomasm
New Contributor II

01-12-2026 8:58:06 AM

1 kudos

Hi Jahnavi,Thanks for your reply. I think the issues you mentioned are not the cause of the discrepancy though. I have attached a screenshot of the same trace ID when displayed in the Experiments UI (where I cannot get a detailed trace view) and in t...

1 kudos

01-12-2026 8:58:06 AM

3 More Replies

by tonybenzu99 • New Contributor II

01-08-2026 5:12:49 AM

1444 Views
2 replies
3 kudos

Resolved! Is Delta Lake deeply tested in Professional Data Engineer Exam?

I wanted to ask people who have already taken the Databricks Certified Professional Data Engineer exam whether Delta Lake is tested in depth or not. While preparing, I’m currently using the Databricks Certified Professional Data Engineer sample quest...

Machine Learning

Reply

1444 Views
2 replies
3 kudos

01-08-2026 5:12:49 AM

View Replies

Latest Reply

lucafredo
New Contributor III

01-12-2026 4:10:48 AM

3 kudos

Yes, Delta Lake concepts are an important part of the Databricks Professional Data Engineer exam, but they aren’t tested in extreme depth compared to core Spark transformations and data pipeline design. The exam mainly focuses on practical understand...

3 kudos

01-12-2026 4:10:48 AM

1 More Replies

by ryojikn • New Contributor III

05-03-2024 8:59:23 AM

2061 Views
3 replies
2 kudos

Model Serving - Shadow Deployment - Azure

Hey,I'm composing an architecture within the usage of Model Serving Endpoints and one of the needs that we're aiming to resolve is Shadow Deployment.Currently, it seems that the traffic configurations available in model serving do not allow this type...

Machine Learning

Reply

2061 Views
3 replies
2 kudos

05-03-2024 8:59:23 AM

View Replies

Latest Reply

KaushalVachhani
Databricks Employee

11-12-2025 5:18:59 AM

2 kudos

@ryojikn and @irtizak , you’re right. Databricks Model Serving allows splitting traffic between model versions, but it doesn’t have a true shadow deployment where live production traffic is mirrored to a new model for monitoring without affecting use...

2 kudos

11-12-2025 5:18:59 AM

2 More Replies

by jitenjha11 • Databricks Partner

12-29-2025 11:39:49 PM

387 Views
2 replies
3 kudos

Getting error when running databricks deploy bundle command

HI all,I am trying to implement MLOps project using https://github.com/databricks/mlops-stacks repo.I have created azure databricks with Premium (+ Role-based access controls) (Click to change) and following bundle creation and deploy using uRL: http...

Machine Learning

Reply

387 Views
2 replies
3 kudos

12-29-2025 11:39:49 PM

View Replies

Latest Reply

iyashk-DB
Databricks Employee

01-05-2026 7:50:40 AM

3 kudos

This is expected behavior with mlops-stacks and not an issue with your Terraform version or the CLI. The main problem is that your Azure Databricks workspace does not have Unity Catalog enabled or assigned. The mlops-stacks templates assume Unity Cat...

3 kudos

01-05-2026 7:50:40 AM

1 More Replies

by Suheb • Contributor

12-30-2025 10:52:17 PM

539 Views
2 replies
2 kudos

Why does my MLflow model training job fail on Databricks with an out‑of‑memory error for large datas

I am trying to train a machine learning model using MLflow on Databricks. When my dataset is very large, the training stops and gives an ‘out-of-memory’ error. Why does this happen and how can I fix it?

Machine Learning

Reply

539 Views
2 replies
2 kudos

12-30-2025 10:52:17 PM

View Replies

Latest Reply

iyashk-DB
Databricks Employee

01-05-2026 7:45:35 AM

2 kudos

+1 to what @mukul1409 has told. Please follow the guides below to distribute the training: https://docs.databricks.com/aws/en/machine-learning/train-model/distributed-training/spark-pytorch-d... https://docs.databricks.com/aws/en/notebooks/source/dee...

2 kudos

01-05-2026 7:45:35 AM

1 More Replies

Databricks Community

Forum Posts

My model serving endpoint is never getting created

Resolved! What is the most efficient way of running sentence-transformers on a Spark DataFrame column?

Resolved! Python environment DAB

Resolved! Install library in notebook

Error starting or creating custom model serving endpoints - 'For input string: ""'

Resolved! Databricks SDK vs bundles

Unable to register Scikit-learn or XGBoost model to unity catalog

DAB - Add/remove task depending on workspace.

Resolved! Population stability index (PSI) calculation in Lakehouse monitor

Resolved! How to PREVENT mlflow's autologging from logging ALL runs?

MLFlow Detailed Trace view doesn't work in some workspaces

Resolved! Is Delta Lake deeply tested in Professional Data Engineer Exam?

Model Serving - Shadow Deployment - Azure

Getting error when running databricks deploy bundle command

Why does my MLflow model training job fail on Databricks with an out‑of‑memory error for large datas

Issue Running Job on Serverless GPU

Which types of model serving endpoints have health...

Unable to Access Azure Blob Storage from Databrick...

Generic Spark Connect ML error. The fitted or load...

Import CV2 results in Fatal Error