Machine Learning

by ruia-dojo • New Contributor

Monday

74 Views
1 replies
0 kudos

Job compute fails due to BQ permissions

Hello,My databricks workspace is associated to GCP project analytics.But me and my team mostly work on GCP project data-science, which contains the only BQ dataset that we have write access to.I'm trying to automate a pipeline to run on job compute a...

Machine Learning

Reply

74 Views
1 replies
0 kudos

Monday

View Replies

Latest Reply

MoJaMa
Databricks Employee

Monday

0 kudos

What identity is the job running as? Do you have any settings on the all-purpose cluster that you are not setting on the job-cluster? Maybe you need to provide roles/bigquery.jobUser on project analytics to the job compute service account?

0 kudos

Monday

by knight22-21 • New Contributor

Thursday

365 Views
3 replies
1 kudos

Resolved! Unable to Access Azure Blob Storage from Databricks Community Edition Notebook

Hi everyone,I’m currently using the Databricks Community Edition and trying to access data stored in Azure Blob Storage from my .ipynb notebook. The storage account is part of my student free Azure subscription.However, I’m not able to establish a co...

Machine Learning

Reply

365 Views
3 replies
1 kudos

Thursday

View Replies

Latest Reply

emma_s
Databricks Employee

Friday

1 kudos

Hi, I think you are referring to Databricks Free edition, in which case this doesn't support the connection to external storage such as Azure Blob storage. Thanks,Emma

1 kudos

Friday

2 More Replies

by rtglorenabasul • New Contributor

a week ago

152 Views
1 replies
1 kudos

Resolved! Issue Running Job on Serverless GPU

I have a job that runs a notebook, the notebook uses serverless GPU (A10) and it keeps failing with a "Run failed with error message Cluster 'xxxxxxxxxxx' was terminated. Reason: UNKNOWN (SUCCESS)". The base environment is 'Standard v4' and I have tr...

Machine Learning

Reply

152 Views
1 replies
1 kudos

a week ago

View Replies

Latest Reply

Ashwin_DSA
Databricks Employee

a week ago

1 kudos

Hi @rtglorenabasul, Thanks for sharing the details. The behaviour you’re seeing is consistent with an issue in how the job is bringing up Serverless GPU compute, rather than with the notebook code itself. Having done some checks, that error usually m...

1 kudos

a week ago

by KyraHinnegan • New Contributor II

2 weeks ago

199 Views
1 replies
1 kudos

Resolved! Which types of model serving endpoints have health metrics available?

I am retrieving a list of model serving endpoints for my workspace via this API: https://docs.databricks.com/api/workspace/servingendpoints/listAnd then going to retrieve health metrics for each one with: https://[DATABRICKS_HOST]/api/2.0/serving-end...

Machine Learning

Reply

199 Views
1 replies
1 kudos

2 weeks ago

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

2 weeks ago

1 kudos

Hey @KyraHinnegan, I did some digging and here is what I found. Hopefully it helps you understand a bit more about what is going on. At a high level, not every endpoint type exposes infrastructure health metrics via /metrics. What you’re seeing with ...

1 kudos

2 weeks ago

by jayshan • New Contributor III

02-24-2026 12:42:45 PM

728 Views
4 replies
2 kudos

Resolved! Generic Spark Connect ML error. The fitted or loaded model size is too big.

When I train models in the serverless environment V4 (Premium Plan), the system occasionally returns the error message listed below, especially after running the model training code multiple times. We have tried creating new serverless sessions, whic...

Machine Learning

Reply

728 Views
4 replies
2 kudos

02-24-2026 12:42:45 PM

View Replies

Latest Reply

Ashwin_DSA
Databricks Employee

2 weeks ago

2 kudos

Hi @jayshan, I'm sorry for the delayed response to your question. And, thanks for the extra details and for sharing your workaround. This behaviour is tied to how Spark Connect ML works in serverless mode, rather than a traditional JVM/GC leak. On se...

2 kudos

2 weeks ago

3 More Replies

by RodrigoE • New Contributor III

12-17-2025 7:49:49 AM

1053 Views
5 replies
3 kudos

Resolved! Vector search index initialization very slow

Hello,I am creating a vector search index and selected Compute embeddings for a delta table with 19M records. Delta table has only two columns: ID (selected as index) and Name (selected for embedding). Embedding model is databricks-gte-large-en.Ind...

Machine Learning

index

search

vector

vector index

Vector Search

Reply

1053 Views
5 replies
3 kudos

12-17-2025 7:49:49 AM

View Replies

Latest Reply

BadrErraji
New Contributor II

2 weeks ago

3 kudos

Why the deltaSync doesn't compute the embedding in parralel instead of sequential.That a major gap in the architecture no ?

3 kudos

2 weeks ago

4 More Replies

by fede_bia • Databricks Partner

3 weeks ago

296 Views
1 replies
0 kudos

Databricks Model Serving Scaling Logic

Hi everyone,I’m seeking technical clarification on how Databricks Model Serving handles request queuing and autoscaling for CPU-intensive tasks. I am deploying a custom model for text and image extraction from PDFs (using Tesseract), and I’m struggli...

Machine Learning

Reply

296 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

AbhaySingh
Databricks Employee

3 weeks ago

0 kudos

TLDR: Pre-provision min_provisioned_concurrency ≥ your peak parallel requests (in multiples of 4) with scale-to-zero disabled, and chunk large PDFs in your model code to bound per-request latency — reactive autoscaling can't help CPU-bound workloads ...

0 kudos

3 weeks ago

by boskicl • New Contributor III

02-26-2026 9:37:54 AM

636 Views
4 replies
1 kudos

Resolved! mlflow spark load_model fails with FMRegressor Model error on Unity Catalog

We trained a Spark ML FMRegressor model and registered it to Unity Catalog via MLflow. When attempting to load it back using mlflow.spark.load_model, we get anOSError: [Errno 5] Input/output error: '/dbfs/tmp' regardless of what dfs_tmpdir path is pa...

Machine Learning

Reply

636 Views
4 replies
1 kudos

02-26-2026 9:37:54 AM

View Replies

Latest Reply

SteveOstrowski
Databricks Employee

4 weeks ago

1 kudos

Hi, This is a well-documented issue that comes down to cluster access mode and how mlflow.spark.load_model handles temporary file storage. Let me break down both problems you are hitting and provide solutions. PROBLEM 1: OSError: [Errno 5] Input/outp...

1 kudos

4 weeks ago

3 More Replies

by d_szepietowska • New Contributor II

01-09-2026 4:53:30 AM

771 Views
3 replies
3 kudos

Why ENABLE_MLFLOW_TRACING does not work for serving endpoint?

I would like to ask you if you have experienced similar issue like me recently. I trained sklearn model. Logged this model with fe.log_model for automatic feature lookup. Online feature tables where published with currently recommended approach, whi...

Machine Learning

Reply

771 Views
3 replies
3 kudos

01-09-2026 4:53:30 AM

View Replies

Latest Reply

SteveOstrowski
Databricks Employee

3 weeks ago

3 kudos

Hi @d_szepietowska, Thank you for the detailed investigation, especially the side-by-side comparison between the legacy online table (MySQL) and the Lakebase-backed Online Feature Store. That is very helpful for narrowing down the behavior. UNDERSTAN...

3 kudos

3 weeks ago

2 More Replies

by Dali1 • New Contributor III

4 weeks ago

390 Views
4 replies
1 kudos

Params with databricks Asset bundles

Hello,I am using Databricks Asset bundels to create jobs for machine learning pipelines.My problem is I am using SparkPython taks and defining params inside those. When the job is created it is created with some params. When I want to run the same jo...

Machine Learning

Reply

390 Views
4 replies
1 kudos

4 weeks ago

View Replies

Latest Reply

SteveOstrowski
Databricks Employee

4 weeks ago

1 kudos

Hi @Dali1, Great questions -- parameterizing ML pipelines in DABs is something a lot of people wrestle with, so let me break down the options. THE SHORT ANSWER No, you should not have to update the job definition every time you want different paramet...

1 kudos

4 weeks ago

3 More Replies

by lschneid • New Contributor II

02-02-2026 1:47:04 AM

448 Views
1 replies
1 kudos

Replacing a Monolithic MLflow Serving Pipeline with Composed Models in Databricks

Hi everyone,I’m a senior MLE and recently joined a company where all data science and ML workloads run on Databricks. My background is mostly MLOps on Kubernetes, so I’m currently ramping up on Databricks and trying to improve the architecture of som...

Machine Learning

Reply

448 Views
1 replies
1 kudos

02-02-2026 1:47:04 AM

View Replies

Latest Reply

SteveOstrowski
Databricks Employee

3 weeks ago

1 kudos

Hi @lschneid, This is a common architectural evolution for ML serving on Databricks, and the platform gives you several good options for decomposing a monolithic serving pipeline into cleaner, more maintainable components. Here is a breakdown of the ...

1 kudos

3 weeks ago

by askaditya • New Contributor II

01-29-2026 1:29:56 AM

336 Views
2 replies
1 kudos

copilot studio to aws databricks genie

Is it possible?

Machine Learning

Reply

336 Views
2 replies
1 kudos

01-29-2026 1:29:56 AM

View Replies

Latest Reply

SteveOstrowski
Databricks Employee

3 weeks ago

1 kudos

Hi @askaditya, Yes, this is possible. You can connect Microsoft Copilot Studio to a Databricks Genie space on AWS by using the Genie Conversation API and Copilot Studio's HTTP Request node (or a Power Automate cloud flow). Here is how the pieces fit ...

1 kudos

3 weeks ago

1 More Replies

by fede_bia • Databricks Partner

02-25-2026 4:14:56 AM

450 Views
1 replies
1 kudos

Resolved! Model Serving Only Shows WARNING/ERROR Logs

Hi everyone,I’m deploying a custom model using mlflow.pyfunc.PythonModel in Databricks Model Serving. Inside my wrapper code, I configured logging as follows:logging.basicConfig( stream=sys.stdout, level=logging.INFO, format='%(asctime)s ...

Machine Learning

Reply

450 Views
1 replies
1 kudos

02-25-2026 4:14:56 AM

View Replies

Latest Reply

SteveOstrowski
Databricks Employee

3 weeks ago

1 kudos

@fede_bia This is worth walking through carefully. this is a common source of confusion when deploying custom models on Databricks Model Serving. SHORT ANSWER The default root logging level for Model Serving endpoints is set to WARNING. That is why y...

1 kudos

3 weeks ago

by Bodevan • New Contributor II

02-23-2026 5:40:44 AM

723 Views
1 replies
1 kudos

Resolved! Import CV2 results in Fatal Error

Hey thereThe setup for getting the error can be very basic:-Start a runtime (e.g. 17.3 LTS ML with an Standard_NV36ads_A10_v5 [A10] 440 GB memory, 1GPU)- In a notebook, install the cv2 package like this:%pip install opencv-pythonThis seems to install...

Machine Learning

Reply

723 Views
1 replies
1 kudos

02-23-2026 5:40:44 AM

View Replies

Latest Reply

SteveOstrowski
Databricks Employee

3 weeks ago

1 kudos

Hi @Bodevan, This is a common scenario when installing the standard opencv-python package on Databricks (or any headless server environment). The root cause is that opencv-python ships with GUI dependencies (Qt and X11 libraries) that are not availab...

1 kudos

3 weeks ago

by venkatkittu • New Contributor

02-06-2026 10:40:20 PM

407 Views
1 replies
0 kudos

Facing 132 error in model serving while using faiss

Hi,I have been trying to deply a rest endpoint for my application using the model serving feature, I have registered my model on the unity catalog and when trying to serve the model it is getting sucess when I removed the code related faiss and when ...

Machine Learning

Reply

407 Views
1 replies
0 kudos

02-06-2026 10:40:20 PM

View Replies

Latest Reply

SteveOstrowski
Databricks Employee

3 weeks ago

0 kudos

Hi @venkatkittu, I can help with this. The error code 132 you are seeing is actually a Unix signal, and understanding it will point you directly to the fix. WHAT ERROR CODE 132 MEANS In Unix systems, when a worker process exits with code 132, that is...

0 kudos

3 weeks ago

Databricks Community

Forum Posts

Job compute fails due to BQ permissions

Resolved! Unable to Access Azure Blob Storage from Databricks Community Edition Notebook

Resolved! Issue Running Job on Serverless GPU

Resolved! Which types of model serving endpoints have health metrics available?

Resolved! Generic Spark Connect ML error. The fitted or loaded model size is too big.

Resolved! Vector search index initialization very slow

Databricks Model Serving Scaling Logic

Resolved! mlflow spark load_model fails with FMRegressor Model error on Unity Catalog

Why ENABLE_MLFLOW_TRACING does not work for serving endpoint?

Params with databricks Asset bundles

Replacing a Monolithic MLflow Serving Pipeline with Composed Models in Databricks

copilot studio to aws databricks genie

Resolved! Model Serving Only Shows WARNING/ERROR Logs

Resolved! Import CV2 results in Fatal Error

Facing 132 error in model serving while using faiss

Issue Running Job on Serverless GPU

Which types of model serving endpoints have health...

Unable to Access Azure Blob Storage from Databrick...

Generic Spark Connect ML error. The fitted or load...

Import CV2 results in Fatal Error