Topics with Label: Hyperopt

Forum Posts

Sorted by:

by RiyazAli • Valued Contributor

06-21-2022 1:42:00 AM

1642 Views
2 replies
3 kudos

Errors in notebooks of Scalable Machine Learning with Apache Spark course in Databricks academy.

HI there,I'm following the course mentioned from Databricks Academy. I downloaded the .dbc archiive and working along side the videos from academy. In ML-08 - Hyperopt notebook, I see the following error in cmd 13. best_hyperparam = fmin(fn=objectiv...

Data Engineering

1642 Views
2 replies
3 kudos

06-21-2022 1:42:00 AM

View Replies

Latest Reply

RiyazAli
Valued Contributor

06-22-2022 5:51:15 AM

3 kudos

Tagging @Kaniz Fatma as there was no response what so ever!By any chance, do you know how to resolve these errors in the notebook?Thanks!

3 kudos

06-22-2022 5:51:15 AM

1 More Replies

by User16789201666 • Contributor II

06-23-2021 7:45:18 AM

797 Views
0 replies
0 kudos

Hyperopt, how to setup hyper-parameter for categorical vs numerical hyperparameter?

hp.quniform (“quantized uniform”) or hp.qloguniform to generate integers. hp.choice is the right choice when, for example, choosing among categorical choices (which might in some situations even be integers, but not usually).https://databricks.com/b...

Data Engineering

797 Views
0 replies
0 kudos

06-23-2021 7:45:18 AM

by User16752240150 • New Contributor II

06-04-2021 12:34:03 PM

792 Views
1 replies
0 kudos

What's the best way to use hyperopt to train a spark.ml model and track automatically with mlflow?

I've read this article, which covers:Using CrossValidator or TrainValidationSplit to track hyperparameter tuning (no hyperopt). Only random/grid searchparallel "single-machine" model training with hyperopt using hyperopt.SparkTrials (not spark.ml)"Di...

Data Engineering

792 Views
1 replies
0 kudos

06-04-2021 12:34:03 PM

View Replies

Latest Reply

sean_owen
Honored Contributor II

06-17-2021 5:00:45 PM

0 kudos

It's actually pretty simple: use hyperopt, but use "Trials" not "SparkTrials". You get parallelism from Spark, not from the tuning process.

0 kudos

06-17-2021 5:00:45 PM

by User16857281869 • New Contributor II

06-17-2021 1:34:22 AM

739 Views
1 replies
0 kudos

How do I benefit from parallelisation when doing machine learning?

There are in principle four distinct ways of using parallelisation when doing machine learning. Any combination of these can speed up the whole pipeline significantly.1) Using spark distributed processing in feature engineering 2) When the data set...

Data Engineering

739 Views
1 replies
0 kudos

06-17-2021 1:34:22 AM

View Replies

Latest Reply

sean_owen
Honored Contributor II

06-17-2021 11:25:11 AM

0 kudos

Good summary! yes those are the main strategies I can think of.

0 kudos

06-17-2021 11:25:11 AM

by Joseph_B • New Contributor III

06-09-2021 5:51:24 PM

519 Views
1 replies
0 kudos

When doing hyperparameter tuning with Hyperopt, when should I use SparkTrials? Does it work with both single-machine ML (like sklearn) and distributed ML (like Apache Spark ML)?

I want to know how to use Hyperopt in different situations:Tuning a single-machine algorithm from scikit-learn or single-node TensorFlowTuning a distributed algorithm from Spark ML or distributed TensorFlow / Horovod

Data Engineering

519 Views
1 replies
0 kudos

06-09-2021 5:51:24 PM

View Replies

Latest Reply

Joseph_B
New Contributor III

06-09-2021 5:56:20 PM

0 kudos

The right question to ask is indeed: Is the algorithm you want to tune single-machine or distributed?If it's a single-machine algorithm like any from scikit-learn, then you can use SparkTrials with Hyperopt to distribute hyperparameter tuning.If it's...

0 kudos

06-09-2021 5:56:20 PM