Topics with Label: Hyperparameter Tuning

Forum Posts

Sorted by:

by RiyazAli • Valued Contributor III

06-21-2022 1:42:00 AM

2984 Views
1 replies
3 kudos

Errors in notebooks of Scalable Machine Learning with Apache Spark course in Databricks academy.

HI there,I'm following the course mentioned from Databricks Academy. I downloaded the .dbc archiive and working along side the videos from academy. In ML-08 - Hyperopt notebook, I see the following error in cmd 13. best_hyperparam = fmin(fn=objectiv...

Data Engineering

2984 Views
1 replies
3 kudos

06-21-2022 1:42:00 AM

View Replies

Latest Reply

RiyazAli
Valued Contributor III

06-22-2022 5:51:15 AM

3 kudos

Tagging @Kaniz Fatma as there was no response what so ever!By any chance, do you know how to resolve these errors in the notebook?Thanks!

3 kudos

06-22-2022 5:51:15 AM

by Joseph_B • Databricks Employee

12-20-2021 9:03:12 AM

2130 Views
1 replies
0 kudos

How should I tune hyperparameters when fitting models for every item?

My dataset has an "item" column which groups the rows into many groups. (Think of these groups as items in a store.) I want to fit 1 ML model per group. Should I tune hyperparameters for each group separately? Or should I tune them for the entire...

Data Engineering

2130 Views
1 replies
0 kudos

12-20-2021 9:03:12 AM

View Replies

Latest Reply

Joseph_B
Databricks Employee

12-20-2021 9:31:16 AM

0 kudos

For the first question ("which option is better?"), you need to answer that via your understanding of the problem domain.Do you expect similar behavior across the groups (items)?If so, that's a +1 in favor of sharing hyperparameters. And vice versa....

0 kudos

12-20-2021 9:31:16 AM

by User16826994223 • Honored Contributor III

06-25-2021 7:10:24 AM

1161 Views
0 replies
0 kudos

Best practices: Hyperparameter tuning with Hyperopt Bayesian approaches can be much more efficient than grid search and random search. Hence, with the...

Best practices: Hyperparameter tuning with HyperoptBayesian approaches can be much more efficient than grid search and random search. Hence, with the Hyperopt Tree of Parzen Estimators (TPE) algorithm, you can explore more hyperparameters and larger ...

Data Engineering

1161 Views
0 replies
0 kudos

06-25-2021 7:10:24 AM

by User16752240150 • New Contributor II

06-04-2021 12:34:03 PM

1524 Views
1 replies
0 kudos

What's the best way to use hyperopt to train a spark.ml model and track automatically with mlflow?

I've read this article, which covers:Using CrossValidator or TrainValidationSplit to track hyperparameter tuning (no hyperopt). Only random/grid searchparallel "single-machine" model training with hyperopt using hyperopt.SparkTrials (not spark.ml)"Di...

Data Engineering

1524 Views
1 replies
0 kudos

06-04-2021 12:34:03 PM

View Replies

Latest Reply

sean_owen
Databricks Employee

06-17-2021 5:00:45 PM

0 kudos

It's actually pretty simple: use hyperopt, but use "Trials" not "SparkTrials". You get parallelism from Spark, not from the tuning process.

0 kudos

06-17-2021 5:00:45 PM

by Joseph_B • Databricks Employee

06-09-2021 5:51:24 PM

1146 Views
1 replies
0 kudos

When doing hyperparameter tuning with Hyperopt, when should I use SparkTrials? Does it work with both single-machine ML (like sklearn) and distributed ML (like Apache Spark ML)?

I want to know how to use Hyperopt in different situations:Tuning a single-machine algorithm from scikit-learn or single-node TensorFlowTuning a distributed algorithm from Spark ML or distributed TensorFlow / Horovod

Data Engineering

1146 Views
1 replies
0 kudos

06-09-2021 5:51:24 PM

View Replies

Latest Reply

Joseph_B
Databricks Employee

06-09-2021 5:56:20 PM

0 kudos

The right question to ask is indeed: Is the algorithm you want to tune single-machine or distributed?If it's a single-machine algorithm like any from scikit-learn, then you can use SparkTrials with Hyperopt to distribute hyperparameter tuning.If it's...

0 kudos

06-09-2021 5:56:20 PM

Databricks Community

Errors in notebooks of Scalable Machine Learning with Apache Spark course in Databricks academy.

How should I tune hyperparameters when fitting models for every item?

Best practices: Hyperparameter tuning with Hyperopt Bayesian approaches can be much more efficient than grid search and random search. Hence, with the...

What's the best way to use hyperopt to train a spark.ml model and track automatically with mlflow?

When doing hyperparameter tuning with Hyperopt, when should I use SparkTrials? Does it work with both single-machine ML (like sklearn) and distributed ML (like Apache Spark ML)?