Machine Learning

by kevin11 • Valued Contributor

3 weeks ago

212 Views
1 replies
0 kudos

AutoML Deprecation?

Hi All,It looks like AutoML is set to be deprecated with the next major version (although the note isn't specific on if that's 18). I haven't seen any announcement or alert about this impending change. Did I just miss it? I know we have teams using t...

Machine Learning

Reply

212 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

3 weeks ago

0 kudos

Hi @kevin11 ,I guess it's their standard way of library deprecation policy. In their docs they mentioned that when a library is planned for removal, Databricks takes following steps to notify customers:So they've added those note to AutoMl docs:And y...

0 kudos

3 weeks ago

by sharpbetty • New Contributor II

10-13-2024 10:53:38 PM

3848 Views
1 replies
0 kudos

Custom AutoML pipeline: Beyond StandardScaler().

The automated notebook pipeline in an AutoML experiment applies StandardScaler to all numerical features in the training dataset as part of the PreProcessor. See below.But I want a more nuanced and varied treatment of my numeric values (e.g. I have l...

Machine Learning

Reply

3848 Views
1 replies
0 kudos

10-13-2024 10:53:38 PM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

11-03-2025 12:47:47 PM

0 kudos

Greetings @sharpbetty Great question! Databricks AutoML's "glass box" approach actually gives you several options to customize preprocessing beyond the default StandardScaler. Here are two practical approaches: Option A: Pre-process Features Before ...

0 kudos

11-03-2025 12:47:47 PM

by dkxxx-rc • Contributor

02-24-2025 5:33:38 AM

3898 Views
2 replies
4 kudos

Resolved! AutoML master notebook failing

I have recently been able to run AutoML successfully on a certain dataset. But it has just failed on a second dataset of similar construction, before being able to produce any machine learning training runs or output. The Experiments page says```Mo...

Machine Learning

Reply

3898 Views
2 replies
4 kudos

02-24-2025 5:33:38 AM

View Replies

Latest Reply

stbjelcevic
Databricks Employee

10-31-2025 8:49:16 AM

4 kudos

Hi @dkxxx-rc , Thanks for the detailed context. This error is almost certainly coming from AutoML’s internal handling of imbalanced data and sampling, not your dataset itself. The internal column _automl_sample_weight_0000 is created by AutoML when i...

4 kudos

10-31-2025 8:49:16 AM

1 More Replies

by SreeRam • New Contributor

01-29-2025 9:26:04 AM

3584 Views
1 replies
0 kudos

Patient Risk Score based on health history: Unable to create data folder for artifacts in S3 bucket

Hi All,we're using the below git project to build PoC on the concept of "Patient-Level Risk Scoring Based on Condition History": https://github.com/databricks-industry-solutions/hls-patient-riskI was able to import the solution into Databricks and ru...

Machine Learning

Reply

3584 Views
1 replies
0 kudos

01-29-2025 9:26:04 AM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

11-01-2025 12:48:09 PM

0 kudos

Greetings @SreeRam , here are some suggestions for you. Based on the error you're encountering with the hls-patient-risk solution accelerator, this is a common issue related to MLflow artifact access and storage configuration in Databricks. The probl...

0 kudos

11-01-2025 12:48:09 PM

by sangramraje • New Contributor

11-22-2024 10:33:12 AM

3957 Views
1 replies
1 kudos

AutoML "need to sample" not working as expected

tl; dr:When the AutoML run realizes it needs to do sampling because the driver / worker node memory is not enough to load / process the entire dataset, it fails. A sample weight column is NOT provided by me, but I believe somewhere in the process the...

Machine Learning

Reply

3957 Views
1 replies
1 kudos

11-22-2024 10:33:12 AM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

10-30-2025 2:59:22 AM

1 kudos

Hey @sangramraje , sorry for the late response. I wanted to check in to see if this is still an issue with the latest release? Please let me know. Cheers, Louis.

1 kudos

10-30-2025 2:59:22 AM

by spearitchmeta • Contributor

10-23-2025 1:57:58 AM

290 Views
1 replies
1 kudos

Resolved! How does Databricks AutoML handle null imputation for categorical features by default?

Hi everyone I’m using Databricks AutoML (classification workflow) on Databricks Runtime 10.4 LTS ML+, and I’d like to clarify how missing (null) values are handled for categorical (string) columns by default.From the AutoML documentation, I see that:...

Machine Learning

Reply

290 Views
1 replies
1 kudos

10-23-2025 1:57:58 AM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

10-23-2025 12:07:39 PM

1 kudos

Hello @spearitchmeta , I looked internally to see if I could help with this and I found some information that will shed light on your question. Here’s how missing (null) values in categorical (string) columns are handled in Databricks AutoML on Dat...

1 kudos

10-23-2025 12:07:39 PM

by MightyMasdo • New Contributor III

06-07-2024 2:23:20 AM

3427 Views
3 replies
7 kudos

Spark context not implemented Error when using Databricks connect

I am developing an application using databricks connect and when I try to use VectorAssembler I get the Error sc is not none Assertion Error. is there a workaround for this ?

Machine Learning

Reply

3427 Views
3 replies
7 kudos

06-07-2024 2:23:20 AM

View Replies

Latest Reply

pibe1
New Contributor II

10-16-2025 11:27:02 AM

7 kudos

Ran into exactly the same issue as @Łukasz1 After some googling, I found this SO post explaining the issue: later versions of databricks connect no longer support the SparkContext API. Our code is failing because the underlying library is trying to f...

7 kudos

10-16-2025 11:27:02 AM

2 More Replies

by spearitchmeta • Contributor

08-14-2025 5:40:36 AM

1475 Views
4 replies
3 kudos

Resolved! Data Drift & Model Comparison in Production MLOps: Handling Scale Changes with AutoML

BackgroundI'm implementing a production MLOps pipeline for part classification using Databricks AutoML. My pipeline automatically retrains models when new data arrives and compares performance with existing production models.The ChallengeI've encount...

Machine Learning

Reply

1475 Views
4 replies
3 kudos

08-14-2025 5:40:36 AM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

09-22-2025 6:47:35 PM

3 kudos

Here are my thoughts to the questions you pose. However, it is important that you dig into the documentation to fully understand the capabilites of Lakehouse Monitoring. I will also be helpful if you deploy it to understand the mechanics of how it wo...

3 kudos

09-22-2025 6:47:35 PM

3 More Replies

by staskh • New Contributor III

08-26-2025 6:41:04 AM

1245 Views
3 replies
3 kudos

Resolved! Error in automl.regress

Hi,I'm running example notebook from https://docs.databricks.com/aws/en/machine-learning/automl/regression-train-api on a node with ML cluster 17.0 (includes Apache Spark 4.0.0, Scala 2.13) and getting error at from databricks import automlsummary = ...

Machine Learning

Reply

1245 Views
3 replies
3 kudos

08-26-2025 6:41:04 AM

View Replies

Latest Reply

staskh
New Contributor III

08-27-2025 7:10:26 AM

3 kudos

Ilir, greetings!Thank you for a prompt response. Unfortunately, none of the suggested solutions works. I checked with Genie:"The error occurs because databricks-automl is not available for Databricks Runtime 17.0.x. Databricks AutoML is not supported...

3 kudos

08-27-2025 7:10:26 AM

2 More Replies

by Aravinda • New Contributor III

08-18-2025 11:59:21 PM

1030 Views
5 replies
2 kudos

Resolved! Databricks Machine Learning Practitioner Plan - DBC section unavailability

Hi Everyone,I am not able to locate any DBC folders for each course present in the machine learning practitioner plan.Earlier, we used to have DBC sections where we can access the course and lab materials.Do we have any solution to this??? Or can som...

Machine Learning

Reply

1030 Views
5 replies
2 kudos

08-18-2025 11:59:21 PM

View Replies

Latest Reply

Aravinda
New Contributor III

08-19-2025 3:06:23 AM

2 kudos

Thanks @szymon_dybczak !!

2 kudos

08-19-2025 3:06:23 AM

4 More Replies

by drii_cavalcanti • New Contributor III

08-11-2025 8:40:16 PM

1122 Views
2 replies
1 kudos

Resolved! Installing opencv-python on DBX

Hi everyone,I was wondering how I can install such a basic Python package on Databricks without running into conflict issues or downgrading to a runtime version lower than 15.Specs:The worker type is g4dn.xlarge [T4].The runtime is 16.4 LTS (includes...

Machine Learning

Reply

1122 Views
2 replies
1 kudos

08-11-2025 8:40:16 PM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

08-12-2025 12:05:02 AM

1 kudos

Hi @drii_cavalcanti ,You encountered this issue because opencv-python depends on packages that still require numpy in version lower than 2. You need to reinstall numpy to supported version and then try once again installing library. You can do it usi...

1 kudos

08-12-2025 12:05:02 AM

1 More Replies

by dbuser24 • Contributor

08-11-2025 1:35:05 AM

4426 Views
14 replies
12 kudos

Resolved! ML experiment giving error - RESOURCE_DOES_NOT_EXIST

Followed the below documentation to create a ML experiment - https://docs.databricks.com/aws/en/mlflow/experimentsI created an experiment using the databricks console, then tried running the below code but getting error - getting error - RESOURCE_DOE...

Machine Learning

Reply

4426 Views
14 replies
12 kudos

08-11-2025 1:35:05 AM

View Replies

Latest Reply

BS_THE_ANALYST
Esteemed Contributor III

08-11-2025 5:45:01 AM

12 kudos

can you mark your own post as a solution as well @dbuser24? (would be useful for the additional steps)Appreciate you feeding back your findings.Congrats on getting it working.All the best,BS

12 kudos

08-11-2025 5:45:01 AM

13 More Replies

by elisabethfalck • New Contributor

08-02-2025 6:50:20 AM

586 Views
1 replies
0 kudos

Forecasting serverless can write predicitons, compute cluster cannot ???

Hi! I have something I don't understand.... I used automl forecasting (serverless) to train a model and marked my schema edw_forecasting as output database where it saved the predictions of my best model. Awesome.However, when I try to do automl fore...

Machine Learning

Reply

586 Views
1 replies
0 kudos

08-02-2025 6:50:20 AM

View Replies

Latest Reply

Khaja_Zaffer
Contributor III

08-02-2025 7:42:12 PM

0 kudos

Did you contact your account team? @elisabethfalck Also as per the error: can you make 5 max worker nodes?

0 kudos

08-02-2025 7:42:12 PM

by Sri2025 • New Contributor

04-16-2025 1:35:34 PM

983 Views
1 replies
0 kudos

Not able to run end to end ML project on Databricks Trial

I started using Databricks trial version from today. I want to explore full end to end ML lifecycle on the databricks. I observed for the compute only 'serverless' option is available. I was trying to execute the notebook posted on https://docs.datab...

Machine Learning

Reply

983 Views
1 replies
0 kudos

04-16-2025 1:35:34 PM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

04-17-2025 8:15:56 AM

0 kudos

I can take up to 15 minutes for the serving endpoint to be created. Once you initiate the "create endpoint" chunk of code go and grab a cup of coffee and wait 15 minutes. Then, before you use it verify it is running (bottom left menu "Serving") by g...

0 kudos

04-17-2025 8:15:56 AM

by cmd0160 • New Contributor

03-14-2025 7:51:12 PM

1139 Views
1 replies
0 kudos

Interactive EDA task in a Job Workflow

I am trying to configure an interactive EDA task as part of a job workflow. I'd like to be able to trigger a workflow, perform some basic analysis then proceed to a subsequent task. I haven't had any success freezing execution. Also, the job workflow...

Machine Learning

Reply

1139 Views
1 replies
0 kudos

03-14-2025 7:51:12 PM

View Replies

Latest Reply

Alberto_Umana
Databricks Employee

03-15-2025 10:22:29 AM

0 kudos

Hello @cmd0160, Freezing job execution to perform interactive tasks directly within a job workflow is not natively supported in Databricks. The job workflow UI and the notebook UI serve different purposes, and the interactive capabilities you find in...

0 kudos

03-15-2025 10:22:29 AM

Databricks Community

Forum Posts

AutoML Deprecation?

Custom AutoML pipeline: Beyond StandardScaler().

Resolved! AutoML master notebook failing

Patient Risk Score based on health history: Unable to create data folder for artifacts in S3 bucket

AutoML "need to sample" not working as expected

Resolved! How does Databricks AutoML handle null imputation for categorical features by default?

Spark context not implemented Error when using Databricks connect

Resolved! Data Drift & Model Comparison in Production MLOps: Handling Scale Changes with AutoML

Resolved! Error in automl.regress

Resolved! Databricks Machine Learning Practitioner Plan - DBC section unavailability

Resolved! Installing opencv-python on DBX

Resolved! ML experiment giving error - RESOURCE_DOES_NOT_EXIST

Forecasting serverless can write predicitons, compute cluster cannot ???

Not able to run end to end ML project on Databricks Trial

Interactive EDA task in a Job Workflow