cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

ac10
by New Contributor II
  • 3408 Views
  • 1 replies
  • 1 kudos

Machine Learning Model Deployment on Databricks with Unity Catalog

Hi everyone! I found it would be helpful to document and share my experiences navigating model deployment on Databricks with the recent changes to deploy models to Unity Catalog instead of the Workspace Model Registry. https://medium.com/p/7d04e85395...

  • 3408 Views
  • 1 replies
  • 1 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 1 kudos

Thanks for sharing this in-depth piece, @ac10 . Your walkthrough of model deployment via Unity Catalog is clear and practical—especially the insight about handling model signatures when working with Spark DataFrames. This will definitely help practit...

  • 1 kudos
drjb1010
by New Contributor
  • 3395 Views
  • 2 replies
  • 0 kudos

Error when creating model env using 'virtualenv' with DBR 14.3

We were trying to inference from a logged model but had the following errorPreviously, we had been using `conda` as the environment manager, but that is no longer supported. I tried to update pyenv as some suggested but didn't get anywhere. Any insig...

Screen Shot 2025-02-05 at 10.05.12 AM.png
  • 3395 Views
  • 2 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

Hello @drjb1010 ,  This is a known issue with DBR 14.3 where the `virtualenv` environment manager fails because it depends on `pyenv` to install specific Python versions, but `pyenv` is either not installed or not properly configured in the runtime e...

  • 0 kudos
1 More Replies
SreeRam
by New Contributor
  • 3277 Views
  • 1 replies
  • 0 kudos

Patient Risk Score based on health history: Unable to create data folder for artifacts in S3 bucket

Hi All,we're using the below git project to build PoC on the concept of "Patient-Level Risk Scoring Based on Condition History": https://github.com/databricks-industry-solutions/hls-patient-riskI was able to import the solution into Databricks and ru...

  • 3277 Views
  • 1 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

Greetings @SreeRam , here are some suggestions for you. Based on the error you're encountering with the hls-patient-risk solution accelerator, this is a common issue related to MLflow artifact access and storage configuration in Databricks. The probl...

  • 0 kudos
ssequ
by New Contributor II
  • 3590 Views
  • 1 replies
  • 1 kudos

Table-Model Lineage for models without online Feature Lookups

Hi community,I am looking for the recommended way to achieve table-model lineage in Unity Catalog for models that don't use Feature Lookups but only offline features. When I use FeatureEngineeringClient.create_training_set with feature_lookups + mlfl...

  • 3590 Views
  • 1 replies
  • 1 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 1 kudos

Hey @ssequ  sorry this fell through the cracks but I have some ideas for you to consider.   You can get Unity Catalog table→model lineage without Feature Lookups by logging the training datasets to MLflow and registering the model in Unity Catalog.  ...

  • 1 kudos
cleversuresh
by New Contributor III
  • 4214 Views
  • 1 replies
  • 0 kudos

AutoGluon MLflow integration

I am working on a personalized price package recommendation and implemented an AutoGluon code integrating it with MLflow.The code has been created in a modular fashion to be used by other team members. They just need to pass the data, target column a...

  • 4214 Views
  • 1 replies
  • 0 kudos
Latest Reply
stbjelcevic
Databricks Employee
  • 0 kudos

Hi @cleversuresh  Thanks for sharing the code and the context. Here are the core issues I see and how to fix them so MLflow logging works reliably on Databricks. What’s breaking MLflow logging in your code Your PyFunc wrapper loads the AutoGluon mod...

  • 0 kudos
dkxxx-rc
by Contributor
  • 3494 Views
  • 1 replies
  • 0 kudos

AutoML master notebook failing

I have recently been able to run AutoML successfully on a certain dataset.  But it has just failed on a second dataset of similar construction, before being able to produce any machine learning training runs or output.  The Experiments page says```Mo...

dkxxxrc_0-1740403690249.png
  • 3494 Views
  • 1 replies
  • 0 kudos
Latest Reply
stbjelcevic
Databricks Employee
  • 0 kudos

Hi @dkxxx-rc , Thanks for the detailed context. This error is almost certainly coming from AutoML’s internal handling of imbalanced data and sampling, not your dataset itself. The internal column _automl_sample_weight_0000 is created by AutoML when i...

  • 0 kudos
excavator-matt
by Contributor
  • 1206 Views
  • 7 replies
  • 2 kudos

Resolved! What is the most efficient way of running sentence-transformers on a Spark DataFrame column?

We're trying to run the bundled sentence-transformers library from SBert in a notebook running Databricks ML 16.4 on an AWS g4dn.2xlarge [T4] instance.However, we're experiencing out of memory crashes and are wondering what the optimal to run sentenc...

Machine Learning
memory issues
sentence-transformers
vector embeddings
  • 1206 Views
  • 7 replies
  • 2 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 2 kudos

@excavator-matt I’d recommend a quick refresher on the Pandas API on Spark to understand the implementation details. This video breaks it down clearly: https://youtu.be/tdZDotqKtps?si=pcIzCUYs2s_TeQKx Hope this helps. — Louis

  • 2 kudos
6 More Replies
excavator-matt
by Contributor
  • 1522 Views
  • 8 replies
  • 1 kudos

Resolved! Importing sentence-transformers no longer works on Databricks runtime 17.2 ML

In Databricks Runtime 16.4 LTS for Machine Learning, I am used to be able to import sentence-transformers without any installation as it is part of the runtime with from sentence_transformers import SentenceTransformer.In this case I am running on a ...

Machine Learning
sentence-transformers
  • 1522 Views
  • 8 replies
  • 1 kudos
Latest Reply
excavator-matt
Contributor
  • 1 kudos

I now upgraded to the new 17.3 LTS ML and it now works. I didn't try 17.2 ML, but with 17.3 ML available, I don't see any reason to use it anymore.

  • 1 kudos
7 More Replies
pablobd
by Contributor II
  • 4931 Views
  • 6 replies
  • 1 kudos

Resolved! Model Serving Endpoint Creation through API

Hello,I am trying to create a model serving endpoint via the API as explained here: https://docs.databricks.com/api/workspace/servingendpoints/createI created a trusted IAM role with access to DynamoDB for the feature store. I try to use this field,"...

  • 4931 Views
  • 6 replies
  • 1 kudos
Latest Reply
EugeneBad
New Contributor
  • 1 kudos

If you're using the databricks terraform provider, make sure the role's name matches the instance-profile name.If not, use the `iam_role_arn` attribute to explicitly set the role's arn when creating the databricks instance profileresource "databricks...

  • 1 kudos
5 More Replies
sangramraje
by New Contributor
  • 3665 Views
  • 1 replies
  • 0 kudos

AutoML "need to sample" not working as expected

tl; dr:When the AutoML run realizes it needs to do sampling because the driver / worker node memory is not enough to load / process the entire dataset, it fails. A sample weight column is NOT provided by me, but I believe somewhere in the process the...

sangramraje_0-1732300084616.png sangramraje_1-1732300133987.png
  • 3665 Views
  • 1 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

Hey @sangramraje , sorry for the late response.  I wanted to check in to see if this is still an issue with the latest release?  Please let me know. Cheers, Louis.

  • 0 kudos
adoodsonruby
by New Contributor II
  • 3777 Views
  • 1 replies
  • 1 kudos

AutoML Doesn't Work Due to Not being able to generate the EDA notebook

HiI'm trying run AutoML classification experiment with a dataset that I have made, and am experiencing this issue even after I have purposely downsampled my dataset before running it into the AutoML experiment. It appears that there is no way for me ...

  • 3777 Views
  • 1 replies
  • 1 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 1 kudos

Hey @adoodsonruby , sorry this got lost in the shuffle.  Have you tried again recently? I believe limits have been increased that would remove this impediment. Let us know, Louis.

  • 1 kudos
lchicoma
by New Contributor
  • 3460 Views
  • 1 replies
  • 0 kudos

Error to create an endpoint of databricks with 2 primary keys online table

I have a delta table that has a primary key conformed by 2 fields (accountId,ruleModelVersionDesc) and I have also created an online table that has the same primary key, but when I create a feature spec to create an endpoint I get the following error...

Machine Learning
enpoints
featurespec
fetureserving
MachineLearning
onlinetabl
  • 3460 Views
  • 1 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

Hey @lchicoma , sorry for the delayed response.  Thanks for sharing the error and context—this looks like a parsing issue in the feature specification rather than a problem with Delta or the runtime versions.   What changed recently There was an inci...

  • 0 kudos
amanjethani
by New Contributor
  • 997 Views
  • 1 replies
  • 0 kudos

🐞 Stuck on LightGBM Distributed Training in PySpark – Hanging After Socket Communication

My Setup:I'm trying to run distributed LightGBM training using synapseml.lightgbm.LightGBMRegressor in PySpark.Cluster Details:Spark version: 3.5.1 (compatible with PySpark 3.5.6)PySpark version: 3.5.6synapseml: v0.11.1 (latest)Spark Cluster: 3 Hetzn...

  • 997 Views
  • 1 replies
  • 0 kudos
Latest Reply
stbjelcevic
Databricks Employee
  • 0 kudos

Hi @amanjethani , Thanks for laying out the setup and symptoms so clearly. The hang likely occurs because LightGBM’s distributed network either doesn’t fully form between executors or because the expected task count doesn’t match actual tasks, leadin...

  • 0 kudos
semsim
by Contributor
  • 3444 Views
  • 1 replies
  • 0 kudos

Can't query Legacy Serving Endpoint

Hi,I was able to deploy an endpoint using legacy serving (It's the only option we have to deploy endpoints in DB). Now I am having trouble querying the endpoint itself. When I try to query it I get the following error:    Here is the code I am using ...

semsim_0-1726245119742.png
  • 3444 Views
  • 1 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

Hey @semsim , sorry for the delayed response.   Thanks for the screenshot—this pinpoints the problem.   Root cause from the error Your model’s predict path is trying to create or write to /Workspace/Shared, and the serving container does not permit t...

  • 0 kudos
Kasen
by New Contributor III
  • 4118 Views
  • 1 replies
  • 1 kudos

Multi-tenant recommendation system (Machine learning)

Hello,I am looking to build a multi-tenant machine learning recommender system in Azure Databricks. The idea is to have a single shared model, where each tenant can use the same model to train on their own unique dataset. Essentially, while the model...

Machine Learning
machine learning
multi-tenant
recommendation
  • 4118 Views
  • 1 replies
  • 1 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 1 kudos

@Kasen , sorry for the delayed response.  Here are some things to consider regarding your question.   Azure Databricks is well-suited for a shared-architecture, tenant‑isolated recommender system. Below is a pragmatic blueprint, the isolation model o...

  • 1 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels