cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

ScyLukb
by New Contributor
  • 3808 Views
  • 1 replies
  • 0 kudos

Model serving with custom pip index URL

An mlflow model was logged with a custom pip requirements file which contains package versions (mlflow==2.11.3), as well as a custom --index-url. However model serving during the "Initializing model enviroment" step tries to pip install mlflow==2.2.2...

  • 3808 Views
  • 1 replies
  • 0 kudos
Latest Reply
stbjelcevic
Databricks Employee
  • 0 kudos

Hi @ScyLukb , This is a common and frustrating problem that occurs when the Model Serving environment's built-in dependencies conflict with your model's specific requirements. The root cause is that the Model Serving environment tries to install its ...

  • 0 kudos
Mario_D
by New Contributor III
  • 3389 Views
  • 1 replies
  • 2 kudos

Bug: MLflow recipe

I'm not sure whether this is the right place, but we've encountered a bug in the datasets.py(https://github.com/mlflow/mlflow/blob/master/mlflow/recipes/steps/ingest/datasets.py.). Anyone using recipes beware of forementioned.def _convert_spark_df_to...

  • 3389 Views
  • 1 replies
  • 2 kudos
Latest Reply
stbjelcevic
Databricks Employee
  • 2 kudos

Hi @Mario_D , Thanks for bringing this to our attention, I will pass this information along to the appropriate team!

  • 2 kudos
danielvdc
by New Contributor II
  • 3801 Views
  • 1 replies
  • 2 kudos

Rolling predictions with FeatureEngineeringClient

I am performing a time series analysis, using a XGBoostRegressor with rolling predictions. I am doing so using the FeatureEngineeringClient (in combination with Unity Catalog), where I create and load in my features during training and inference, as ...

  • 3801 Views
  • 1 replies
  • 2 kudos
Latest Reply
stbjelcevic
Databricks Employee
  • 2 kudos

You’re running into a fundamental limitation: score_batch does point‑in‑time feature lookups and batch scoring, but it doesn’t support recursive multi‑step forecasting where predictions update features for subsequent timesteps. Feature Store looks up...

  • 2 kudos
tooooods
by New Contributor
  • 3530 Views
  • 1 replies
  • 0 kudos

TorchDistributor: installation of custom python package via wheel across all nodes in cluster

I am trying to set up a training pipeline of a distributed PyTorch model using TorchDistributor. I have defined a train_object (in my case it is a Callable) that runs my training code. However, this method requires custom code from modules that I hav...

  • 3530 Views
  • 1 replies
  • 0 kudos
Latest Reply
stbjelcevic
Databricks Employee
  • 0 kudos

hi @tooooods , This is a classic challenge in distributed computing, and your observation is spot on. The ModuleNotFoundError on the workers, despite the UI and API showing the library as "Installed," is the key symptom. This happens because TorchDis...

  • 0 kudos
hawa
by New Contributor II
  • 6331 Views
  • 5 replies
  • 2 kudos

Problem serving a langchain model on Databricks

Hi, I've encountered a problem of serving a langchain model I just created successfully on Databricks.I was using the following code to set up a model in unity catalog:from mlflow.models import infer_signatureimport mlflowimport langchainmlflow.set_r...

  • 6331 Views
  • 5 replies
  • 2 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 2 kudos

Greetings @hawa ,  Thanks for sharing the details—this looks like a combination of registration and configuration issues that commonly surface with the MLflow LangChain flavor on Databricks. What’s going wrong The registered model name should be a fu...

  • 2 kudos
4 More Replies
gsalazar
by New Contributor
  • 3463 Views
  • 1 replies
  • 0 kudos

How to load a synapse/maven package in Dbricks Model Serving Endpoint

Hi!A lot similar to this 2021's post: https://community.databricks.com/t5/data-engineering/how-to-include-a-third-party-maven-package-in-mlflow-model/td-p/17060I'm attempting to serve a synapseml model (maven dependencies) using Databricks Model Serv...

Machine Learning
Endpoint
mlflow
Model serving
SynapseML
  • 3463 Views
  • 1 replies
  • 0 kudos
Latest Reply
mark_ott
Databricks Employee
  • 0 kudos

You are encountering issues serving a SynapseML model (with Maven dependencies) via Databricks Model Serving Endpoints, and the deployment works fine on general-purpose clusters but fails for the serving endpoint. This is a well-known issue with Data...

  • 0 kudos
rtreves
by Contributor
  • 3601 Views
  • 1 replies
  • 1 kudos

Proper mlflow run logging with SparkTrials and Hyperopt

Hello!I'm attempting to run a hyperparameter search using hyperopt and SparkTrials(), and log the resulting runs to an existing experiment (experiment A). I can see on this page that databricks suggests wrapping the `fmin()` call within a `mlflow.sta...

  • 3601 Views
  • 1 replies
  • 1 kudos
Latest Reply
mark_ott
Databricks Employee
  • 1 kudos

Both the parent and child runs of a Hyperopt sweep in Databricks are, by default, influenced by the experiment associated with the notebook context rather than the explicit experiment passed to mlflow.start_run(). As you noticed, child runs remain in...

  • 1 kudos
rjain
by New Contributor
  • 3508 Views
  • 1 replies
  • 0 kudos

Vector Index Creation for external embedding model takes a lot of time

I have embedding model endpoint created and served. It is huggingface model which databricks doesnt provide. I am using this model to create vector search index however this takes a lot of time to get created. I observed that when I use databricks of...

  • 3508 Views
  • 1 replies
  • 0 kudos
Latest Reply
mark_ott
Databricks Employee
  • 0 kudos

The main reason your Hugging Face embedding model endpoint is taking much longer than Databricks’ own large_bge_en model to build a vector search index is likely due to differences in operational architecture and performance optimizations between ext...

  • 0 kudos
aswanson
by New Contributor
  • 3589 Views
  • 1 replies
  • 0 kudos

Pickle/joblib.dump a pre-processing function defined in a notebook

I've built a custom MLFlow model class which I know functions. As part of a given run the model class uses `joblib.dump` to store necessary parameters on the databricks DBFS before logging them as artifacts in the MLFlow run. This works fine when usi...

  • 3589 Views
  • 1 replies
  • 0 kudos
Latest Reply
mark_ott
Databricks Employee
  • 0 kudos

The error you’re seeing—SPARK-5063 CONTEXT_ONLY_VALID_ON_DRIVER—arises when trying to serialize or use objects (such as functions) defined in Databricks notebooks from workers rather than the driver. This issue is especially common with Python functi...

  • 0 kudos
cbossi
by New Contributor II
  • 40 Views
  • 1 replies
  • 1 kudos

Resolved! Options sporadic (and cost-efficient) Model Serving on Databricks?

Hi all,I'm new to Databricks so would appreciate some advice.I have a ML model deployed using Databricks Model Serving. My use case is very sporadic: I only need to make 5–15 prediction requests per day (industrial application), and there can be long...

  • 40 Views
  • 1 replies
  • 1 kudos
Latest Reply
KaushalVachhani
Databricks Employee
  • 1 kudos

Hi @cbossi , You are right! A 30-minute idle period precedes the endpoint's scaling down. You are billed for the compute resources used during this period, plus the actual serving time when requests are made. This is the current expected behaviour. Y...

  • 1 kudos
intelliconnectq
by New Contributor
  • 64 Views
  • 1 replies
  • 2 kudos

Resolved! Model Registration and hosting

I have train & tested a model in databricks, now I want to register it and host it. But I am unable too do so. Please find attach snapshot of code & error 

intelliconnectq_0-1762230437372.png
  • 64 Views
  • 1 replies
  • 2 kudos
Latest Reply
joelrobin
Databricks Employee
  • 2 kudos

Hi @intelliconnectq The above code will fail with AttributeError: 'NoneType' object has no attribute 'info' on the line: model_uri = f"runs:/{mlflow.active_run().info.run_id}/xgboost-model"  This happens because once the with mlflow.start_run(): bloc...

  • 2 kudos
steve2
by New Contributor
  • 3434 Views
  • 1 replies
  • 0 kudos

Surprisingly sparse_logs and tensorboard logfiles in Databricks-Workspace

Hi, surprisingly we have found 2 new folders with some short logfiles in our Databricks workspace:ls -lFr sparse_logs/ tensorboard/tensorboard/:-rwxrwxrwx 1 root root 88 Sep  2 11:26 events.out.tfevents.1725275744.0830-063833-n68nsxoq-10-139-64-10.20...

  • 3434 Views
  • 1 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

Hey @steve2 ,  short answer: these look like TensorBoard event files, likely created by a library that briefly initialized a TensorBoard logger or writer during one of your training/serving runs; the sparse_logs folder naming and “manager stage: Mode...

  • 0 kudos
VELU1122
by New Contributor II
  • 5617 Views
  • 3 replies
  • 0 kudos

Accessing Databricks Volumes from a Serving Endpoint Using a Custom Model Class in Unity Catalog

Hi everyone,I’m looking for accessing Unity Catalog (UC) Volumes from a Databricks Serving Endpoint. Here’s my current setup:I have a custom AI model class for inference, which I logged into Unity Catalog using mlflow.pyfunc.log_model.I’ve created a ...

  • 5617 Views
  • 3 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

Greetings @VELU1122 ,  you’re correct that the Databricks Model Serving container is isolated, so you can’t rely on cluster-only affordances like mounts or executor-distributed file utilities. The reliable way to read from Unity Catalog (UC) Volumes ...

  • 0 kudos
2 More Replies
grajee
by New Contributor II
  • 3584 Views
  • 1 replies
  • 1 kudos

Lakehouse Monitoring of Inference Table

All,I'm trying to setup a lakehouse monitoring process for the WineQuality model that is widely available. While setting up the Serving Endpoint, I enabled "Inference Table" option for which the inference table was created automatically. The columns ...

Machine Learning
Inference Table
Lakehouse-Monitoring
  • 3584 Views
  • 1 replies
  • 1 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 1 kudos

Hello @grajee ,  I can see you're dealing with two separate issues here. Let me address both: Issue 1: The model_id column (request_metadata MAP type) You're correct that request_metadata is a MAP type and can't be directly used as the model_id colum...

  • 1 kudos
sharpbetty
by New Contributor II
  • 3653 Views
  • 1 replies
  • 0 kudos

Custom AutoML pipeline: Beyond StandardScaler().

The automated notebook pipeline in an AutoML experiment applies StandardScaler to all numerical features in the training dataset as part of the PreProcessor. See below.But I want a more nuanced and varied treatment of my numeric values (e.g. I have l...

sharpbetty_0-1728884608851.png
  • 3653 Views
  • 1 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

Greetings @sharpbetty  Great question! Databricks AutoML's "glass box" approach actually gives you several options to customize preprocessing beyond the default StandardScaler. Here are two practical approaches: Option A: Pre-process Features Before ...

  • 0 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels