cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Science & Machine Learning

Forum Posts

roman_belkin
by New Contributor
  • 59 Views
  • 0 replies
  • 0 kudos

Gemini though Mosaic Gateway

I am trying to configure the Gemini Vertex API in Databricks. In simple Python code, everything works fine, which indicates that I have correctly set up the API and credentials. Error message: {"error_code":"INVALID_PARAMETER_VALUE","message":"INVALI...

  • 59 Views
  • 0 replies
  • 0 kudos
kishanmaladkar
by New Contributor II
  • 748 Views
  • 3 replies
  • 1 kudos

Not able to edit_mode UI_LOCKED to EDITABLE in bundle deployment for development mode

The edit_mode for Databricks jobs cannot be overridden using the bundle. Based on the jobs REST API docs, there is a functionality to set this parameter but in the bundle docs, it's not available. How can I use this in the bundle to override the para...

  • 748 Views
  • 3 replies
  • 1 kudos
Latest Reply
pietern
Databricks Employee
  • 1 kudos

The `edit_mode` property cannot be set by design. It is set to `UI_LOCKED` on bundle deployment to let viewers of the job in the UI know that any changes they make to the job instance are going to be clobbered the next time someone runs a bundle depl...

  • 1 kudos
2 More Replies
GGG_P
by New Contributor III
  • 2793 Views
  • 3 replies
  • 2 kudos

How to proper use Databricks MLFlow Managed tracker/register with Databricks Workflow

Hey.I'm building a DevOps/MLOps pipeline to train/register simple scikit learn model.I created a simple Databricks Workflow to execute training and register task on specific .git branch. (Workflow is setup with Databricks Repo on specifc branch, with...

  • 2793 Views
  • 3 replies
  • 2 kudos
Latest Reply
kdatt
New Contributor
  • 2 kudos

I had same issue while trying to call notebook from workflow. I was able to do what you did. But it needs new experiment name for each run, so I had to do this:# Set the experimentexperiment_name = f"/Workspace/MLOps/{env}/experiment/{experiment}_{ti...

  • 2 kudos
2 More Replies
Orianh
by Valued Contributor II
  • 1885 Views
  • 1 replies
  • 2 kudos

Run mlflow project from a Job.

Hey Guys, I'm trying to make automated process to run ML training sessions using mlflow and databricks jobs.While developing the model on my local machine using IDE, When finished I have a template notebook that get as parameters the mlflow project p...

error
  • 1885 Views
  • 1 replies
  • 2 kudos
Latest Reply
kdatt
New Contributor
  • 2 kudos

Hi,Were you able to figure out this one? I have same issue trying to call training notebook from workflow. Each run needs a new experiment name which I can do but then it creates a new experiment ID/name for each workflow run. Where as when you run f...

  • 2 kudos
Pablo_Camacho
by New Contributor
  • 129 Views
  • 1 replies
  • 0 kudos

Serving Endpoint Deployment

Hello Community,I am seeking assistance with an issue related to serving a custom Hugging Face model (M2M100). I successfully registered the model in Unity Catalog using the MLflow Python libraries without any problems. However, when attempting to se...

  • 129 Views
  • 1 replies
  • 0 kudos
Latest Reply
helios
New Contributor
  • 0 kudos

see https://community.databricks.com/t5/machine-learning/serving-endpoint-container-image-creation-fails/td-p/83458

  • 0 kudos
girl_chan
by New Contributor II
  • 915 Views
  • 2 replies
  • 0 kudos

Azure databricks API and DLT databricks

how can i pass parameter from Azure data factory rest web API to delta live Databricks?I get this error: "Py4JJavaError: An error occurred while calling o382.getArgument.: com.databricks.dbutils_v1.InputWidgetNotDefined: No input widget named *** def...

  • 915 Views
  • 2 replies
  • 0 kudos
Latest Reply
Panda
Valued Contributor
  • 0 kudos

@girl_chan The error you are encountering usually occurs when a widget referenced in a Databricks notebook is not defined. In your case, the issue is likely related to how you pass parameters from Azure Data Factory (ADF) to Delta Live Tables (DLT) i...

  • 0 kudos
1 More Replies
rtreves
by New Contributor III
  • 2251 Views
  • 7 replies
  • 0 kudos

One-hot encoding of strong cardinality features failing, causes downstream issues

Hi Databricks support,I'm training an ML model using mlflow on DBR 13.3 LTS ML, Spark 3.4.1 using databricks.automl_runtime 0.2.17 and databricks.automl 1.20.3, with shap 0.45.1. My training data has two float-type columns with three or fewer unique ...

  • 2251 Views
  • 7 replies
  • 0 kudos
Latest Reply
rtreves
New Contributor III
  • 0 kudos

@NandiniN Were you able to use my reprex above to investigate this issue at all? Thank you.

  • 0 kudos
6 More Replies
User16826990884
by New Contributor III
  • 1291 Views
  • 2 replies
  • 0 kudos

Rollback cluster changes

Is it possible to rollback changes made to a cluster? The problem I'm trying to solve is to recover from an accidental change made by a user on a cluster that affects interactive and job runs. Cluster policies help, but the policy still provides the ...

  • 1291 Views
  • 2 replies
  • 0 kudos
Latest Reply
Panda
Valued Contributor
  • 0 kudos

@User16826990884 Along with what @sajith_appukutt mentioned, we can achive this viaVersion Control for Cluster Configurations: Store cluster configurations in JSON files in GitHub or another version control system.In case of accidental changes, you c...

  • 0 kudos
1 More Replies
sharpbetty
by New Contributor II
  • 135 Views
  • 0 replies
  • 0 kudos

Custom AutoML pipeline: Beyond StandardScaler().

The automated notebook pipeline in an AutoML experiment applies StandardScaler to all numerical features in the training dataset as part of the PreProcessor. See below.But I want a more nuanced and varied treatment of my numeric values (e.g. I have l...

sharpbetty_0-1728884608851.png
  • 135 Views
  • 0 replies
  • 0 kudos
amirA
by New Contributor II
  • 416 Views
  • 2 replies
  • 1 kudos

Resolved! Extracting Topics From Text Data Using PySpark

Hi EveryoneI tried to follow the same steps in Topic from Text on similar data as example. However, when I tri to fit the model with data I get this error.IllegalArgumentException: requirement failed: Column features must be of type equal to one of t...

  • 416 Views
  • 2 replies
  • 1 kudos
Latest Reply
filipniziol
Contributor
  • 1 kudos

Hi @amirA ,The LDA model expects the features column to be of type Vector from the pyspark.ml.linalg module, specifically either a SparseVector or DenseVector, whereas you have provided Row type.You need to convert your Row object to SparseVector.Che...

  • 1 kudos
1 More Replies
VELU1122
by New Contributor II
  • 206 Views
  • 1 replies
  • 0 kudos

Accessing Databricks Volumes from a Serving Endpoint Using a Custom Model Class in Unity Catalog

Hi everyone,I’m looking for accessing Unity Catalog (UC) Volumes from a Databricks Serving Endpoint. Here’s my current setup:I have a custom AI model class for inference, which I logged into Unity Catalog using mlflow.pyfunc.log_model.I’ve created a ...

  • 206 Views
  • 1 replies
  • 0 kudos
Latest Reply
VELU1122
New Contributor II
  • 0 kudos

Additionally, I log the model as shown below, with MicrosoftResnet50Model being my custom inference class with load_context and predict methods:with mlflow.start_run():model_info = mlflow.pyfunc.log_model(REGISTERED_MODEL_NAME,python_model=MicrosoftR...

  • 0 kudos
User16826993440
by Databricks Employee
  • 2795 Views
  • 2 replies
  • 1 kudos

What is the best practice for applying MLFlow to clustering algorithms?

What is the best practice for applying MLFlow to clustering algorithms? What are the kinds of metrics customers track?

  • 2795 Views
  • 2 replies
  • 1 kudos
Latest Reply
Joseph_B
Databricks Employee
  • 1 kudos

Good question! I'll divide my suggestions into 2 parts:(1) In terms of MLflow Tracking, clustering is pretty similar to other ML workflows, so not much changes.(2) In terms of specific parameters, metrics, etc. to track, clustering is very different...

  • 1 kudos
1 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels