Machine Learning

by VELU1122 • New Contributor II

10-08-2024 3:39:15 PM

6047 Views
4 replies
2 kudos

Accessing Databricks Volumes from a Serving Endpoint Using a Custom Model Class in Unity Catalog

Hi everyone,I’m looking for accessing Unity Catalog (UC) Volumes from a Databricks Serving Endpoint. Here’s my current setup:I have a custom AI model class for inference, which I logged into Unity Catalog using mlflow.pyfunc.log_model.I’ve created a ...

Machine Learning

Reply

6047 Views
4 replies
2 kudos

10-08-2024 3:39:15 PM

View Replies

Latest Reply

iyashk-DB
Databricks Employee

Monday

2 kudos

Serverless Model Serving does not mount the UC Volumes FUSE path (/Volumes), so references to “/Volumes/…” inside a custom pyfunc’s model code will fail at container build or runtime. The correct pattern is to package any required files (like your ...

2 kudos

Monday

3 More Replies

by liu • Contributor

a week ago

136 Views
2 replies
1 kudos

Can serverless environments not use SynapseML's LightGBM?

When I use LightGBM, I get the following error on the line below: 'str' object has no attribute 'getParam'.Is this because serverless cannot run the JAR files that SynapseML depends on?File /local_disk0/.ephemeral_nfs/envs/pythonEnv-b0d5f8ce-8426-443...

Machine Learning

Reply

136 Views
2 replies
1 kudos

a week ago

View Replies

Latest Reply

liu
Contributor

a week ago

1 kudos

Sorry, I just found out while checking the official documentation that starting from November, dependency JAR files are supported.I’ll give it a try and see how well it works.If anyone has any insights, I would greatly appreciate your guidance.

1 kudos

a week ago

1 More Replies

by peternagy • New Contributor II

2 weeks ago

256 Views
2 replies
3 kudos

Resolved! Databricks Model Serving Endpoint Fails: “_USER not found for feature table”

Hi Databricks Community,I’m trying to deploy a model serving endpoint that uses Databricks Feature Store (Unity Catalog, online tables).My offline and online feature tables are created and visible in Databricks.The model is logged with FeatureEnginee...

Machine Learning

Reply

256 Views
2 replies
3 kudos

2 weeks ago

View Replies

Latest Reply

peternagy
New Contributor II

2 weeks ago

3 kudos

Thanks for the reply It is very useful and comprehensive.I managed to find another solution to the problem so I wanted to share some additional details on this topic:I was using 15.4 LTS ML Runtime, this could have caused the problem - I did not swit...

3 kudos

2 weeks ago

1 More Replies

by Spencer_Kent • New Contributor III

03-21-2023 9:39:06 PM

3886 Views
5 replies
1 kudos

Resolved! Lacking support for column-level select grants or attribute-based access control

In the Unity Catalog launch and its accompanying blog post, one of the primary selling points was a set of granular access control features that would at least partially eliminate the need to create a multitude of separate table views and the attenda...

Machine Learning

Reply

3886 Views
5 replies
1 kudos

03-21-2023 9:39:06 PM

View Replies

Latest Reply

Charuvil
New Contributor III

2 weeks ago

1 kudos

@at-khatri Databricks ABAC is in Public Preview nowhttps://docs.databricks.com/aws/en/data-governance/unity-catalog/abac/

1 kudos

2 weeks ago

4 More Replies

by hasnat_unifeye • New Contributor II

3 weeks ago

255 Views
1 replies
4 kudos

Hackathon Project: Recipe Recommendation Engine with Traditional ML + Genie on Databricks Free Edit

Hi everyone, For the Databricks Free Edition Hackathon, I wanted to show that traditional ML still has a big role today, and how it can work hand-in-hand with Databricks’ newer AI tooling. As a concrete use case, I created a recipe recommendation eng...

Machine Learning

Reply

255 Views
1 replies
4 kudos

3 weeks ago

View Replies

Latest Reply

Raman_Unifeye
Contributor III

3 weeks ago

4 kudos

This is amazing @hasnat_unifeye. Well done and good luck for the hackathon.

4 kudos

3 weeks ago

by kevin11 • Valued Contributor

3 weeks ago

187 Views
1 replies
0 kudos

AutoML Deprecation?

Hi All,It looks like AutoML is set to be deprecated with the next major version (although the note isn't specific on if that's 18). I haven't seen any announcement or alert about this impending change. Did I just miss it? I know we have teams using t...

Machine Learning

Reply

187 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

3 weeks ago

0 kudos

Hi @kevin11 ,I guess it's their standard way of library deprecation policy. In their docs they mentioned that when a library is planned for removal, Databricks takes following steps to notify customers:So they've added those note to AutoMl docs:And y...

0 kudos

3 weeks ago

by ashfire • New Contributor II

3 weeks ago

252 Views
1 replies
0 kudos

Resolved! How to store & update a FAISS Index in Databricks

I’m currently using FAISS in a Databricks notebook to perform semantic search in text data. My current workflow looks like this:encode ~10k text entries using an embedding model.build a FAISS index in memory.run similarity searches using index.search...

Machine Learning

Reply

252 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

3 weeks ago

0 kudos

Hello @ashfire , Here’s a practical path to scale your FAISS workflow on Databricks, along with patterns to persist indexes, incrementally add embeddings, and keep metadata aligned. Best practice to persist/load FAISS indexes on Databricks Use faiss...

0 kudos

3 weeks ago

by p4pratikjain • Contributor

08-19-2024 7:53:54 AM

3597 Views
2 replies
0 kudos

DAB - Add/remove task depending on workspace.

I use DAB for deploying Jobs, I want to add a specific Task in dev only but not in staging or prod. Is there any way to achieve this using DAB ?

Machine Learning

Reply

3597 Views
2 replies
0 kudos

08-19-2024 7:53:54 AM

View Replies

Latest Reply

Coffee77
Contributor III

3 weeks ago

0 kudos

You can define specific resources by target in DAB as shown here. This is valid for jobs and/or tasks:For instance, in my case:I think, best option (but not available as far as I know) would be to be able to define "include" sections by target, inste...

0 kudos

3 weeks ago

1 More Replies

by aswinkks • New Contributor III

05-28-2025 1:13:27 AM

1167 Views
2 replies
0 kudos

Distributed Training quits if any worker node fails

Hi,I'm training a Pytorch model in a distributed environment using the Pytorch's DistributedDataParallel (DDP) library. I have spin up 10 worker nodes.The issue which I'm facing is that during the training, if any worker node fails and exits, the ent...

Machine Learning

Reply

1167 Views
2 replies
0 kudos

05-28-2025 1:13:27 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

3 weeks ago

0 kudos

Distributed training with PyTorch’s DistributedDataParallel (DDP) is not inherently fault-tolerant—if any node fails, the whole job crashes, and, as you noted, checkpointing cannot auto-recover the process without hypervisor or application-level orch...

0 kudos

3 weeks ago

1 More Replies

by Kjetil • Contributor

08-26-2024 7:09:27 AM

3952 Views
1 replies
0 kudos

FeatureEngineeringClient and Unity Catalog

When testing this code ( fe.score_batch( df=dataset.drop("Target").limit(10), model_uri=f"models:/{model_name}/{mv.version}", ) .select("prediction") .limit(10) .display() ) I get the error: “MlflowException: The...

Machine Learning

Reply

3952 Views
1 replies
0 kudos

08-26-2024 7:09:27 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

3 weeks ago

0 kudos

Your issues are tied to authentication and network/configuration differences between Unity Catalog and Workspace models in Databricks, specifically when using the FeatureEngineeringClient. Key Issues FeatureEngineeringClient + Unity Catalog: You get...

0 kudos

3 weeks ago

by stochastic • New Contributor

08-26-2024 12:30:33 PM

3763 Views
1 replies
0 kudos

Why is spark mllib is not encouraged on the platform?/Why is ML dependent on .toPandas() on dbricks?

I'm new to Spark,Databricks and am surprised about how the Databricks tutorials for ML are using pandas DF > Spark DF. Of the tutorials I've seen, most data processing is done in a distributed manner but then its just cast to a pandas dataframe. From...

Machine Learning

Reply

3763 Views
1 replies
0 kudos

08-26-2024 12:30:33 PM

View Replies

Latest Reply

mark_ott
Databricks Employee

3 weeks ago

0 kudos

You are noticing a common pattern in Databricks ML tutorials: data is often processed with Spark for scalability, but training and modeling are frequently done on pandas DataFrames using single-node libraries like scikit-learn. This workflow can be c...

0 kudos

3 weeks ago

by nitinjain26 • New Contributor II

3 weeks ago

209 Views
1 replies
0 kudos

Resolved! No option for create compute in trial version

Hi,I dont see an option for "Create Compute". I have a trial version. I am trying to build machine learning model on Databricks for the first time.Please check the attached the screenshot.

Machine Learning

Reply

209 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

Advika
Databricks Employee

3 weeks ago

0 kudos

Hello @nitinjain26! Free trials only offer serverless/SQL compute clusters (due to resource and cost controls).Please check out this post for more details: [FREE TRIAL] Missing All-Purpose Clusters Access - New Account

0 kudos

3 weeks ago

by __paolo_c__ • Contributor II

08-23-2024 8:57:41 AM

4549 Views
1 replies
0 kudos

Feature tables & Null Values

Hi!I was wondering if any of you has ever dealt with Feature tables and null values (more specifically, via feature engineering objects, rather than feature store, although I don't think it really matters).In brief, null values are allowed to be stor...

Machine Learning

Reply

4549 Views
1 replies
0 kudos

08-23-2024 8:57:41 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

3 weeks ago

0 kudos

When dealing with feature tables and null values—especially via Databricks Feature Engineering objects (but also more broadly in Spark or feature platforms)—there are some nuanced behaviors when schema inference is required. Here are clear answers to...

0 kudos

3 weeks ago

by ryojikn • New Contributor III

05-03-2024 8:59:23 AM

1413 Views
2 replies
2 kudos

Model Serving - Shadow Deployment - Azure

Hey,I'm composing an architecture within the usage of Model Serving Endpoints and one of the needs that we're aiming to resolve is Shadow Deployment.Currently, it seems that the traffic configurations available in model serving do not allow this type...

Machine Learning

Reply

1413 Views
2 replies
2 kudos

05-03-2024 8:59:23 AM

View Replies

Latest Reply

KaushalVachhani
Databricks Employee

3 weeks ago

2 kudos

@ryojikn and @irtizak , you’re right. Databricks Model Serving allows splitting traffic between model versions, but it doesn’t have a true shadow deployment where live production traffic is mirrored to a new model for monitoring without affecting use...

2 kudos

3 weeks ago

1 More Replies

by tarunnagar • Contributor

3 weeks ago

373 Views
4 replies
1 kudos

What Are the Key Challenges in Developing ETL Pipelines Using Databricks?

I’m looking to understand the practical challenges that professionals face when building ETL (Extract, Transform, Load) pipelines on Databricks. Specifically, I’m curious about issues related to scalability, performance, data quality, integration wit...

Machine Learning

Reply

373 Views
4 replies
1 kudos

3 weeks ago

View Replies

Latest Reply

Suheb
New Contributor III

3 weeks ago

1 kudos

Developing ETL pipelines in Databricks comes with challenges like managing diverse data sources, optimizing Spark performance, and controlling cloud costs. Ensuring data quality, handling errors, and maintaining security and compliance add complexity...

1 kudos

3 weeks ago

3 More Replies

Databricks Community

Forum Posts

Accessing Databricks Volumes from a Serving Endpoint Using a Custom Model Class in Unity Catalog

Can serverless environments not use SynapseML's LightGBM?

Resolved! Databricks Model Serving Endpoint Fails: “_USER not found for feature table”

Resolved! Lacking support for column-level select grants or attribute-based access control

Hackathon Project: Recipe Recommendation Engine with Traditional ML + Genie on Databricks Free Edit

AutoML Deprecation?

Resolved! How to store & update a FAISS Index in Databricks

DAB - Add/remove task depending on workspace.

Distributed Training quits if any worker node fails

FeatureEngineeringClient and Unity Catalog

Why is spark mllib is not encouraged on the platform?/Why is ML dependent on .toPandas() on dbricks?

Resolved! No option for create compute in trial version

Feature tables & Null Values

Model Serving - Shadow Deployment - Azure

What Are the Key Challenges in Developing ETL Pipelines Using Databricks?

Join Us as a Local Community Builder!

How to store & update a FAISS Index in Databricks

course material access

Databricks Model Serving Endpoint Fails: “_USER no...

No option for create compute in trial version

notebook stuck at "filtering data" or waiting to r...