by
umair
• New Contributor
- 2445 Views
- 1 replies
- 1 kudos
I'm running some machine learning experiments in databricks. For random forest algorithm when i restart the cluster, each time the training output is changes even though random state is set. Anyone has any clue about this issue?Note : I tried the sam...
- 2445 Views
- 1 replies
- 1 kudos
Latest Reply
RF is non-deterministic by it´s nature.However as you mentioned you can control this by using random_state.This will guarantee a deterministic result ON A CERTAIN SYSTEM, but not necessarily over systems.SO has a topic about this, check it out, very ...
- 1947 Views
- 1 replies
- 0 kudos
Is there a way to automatically distribute training and model tuning across a Spark cluster, if I want to keep using scikit-learn?
- 1947 Views
- 1 replies
- 0 kudos
Latest Reply
It depends on what you mean by "automagically."If you want to keep using scikit-learn, there are ways to distribute parts of training and tuning with minimal effort. However, there is no "magic" way to distribute training an individual model in scik...
- 14427 Views
- 3 replies
- 0 kudos
Hi Team ,
Can you please help me in implementing KNN classifer in pyspark using distributed architecture and processing the dataset.
Even I want to validate the KNN model with the testing dataset.
I tried to use scikit learn but the program is runn...
- 14427 Views
- 3 replies
- 0 kudos
Latest Reply
Hey, about about using NEC Frovedis (https://github.com/frovedis/frovedis) framework for the same.
Refer: https://github.com/frovedis/frovedis/blob/master/src/foreign_if/python/examples/unsupervised_knn_demo.py
It works on a distributed framework (...
2 More Replies