Cannot Reproduce Result scikit-learn random forest

umair · ‎02-18-2022

I'm running some machine learning experiments in databricks. For random forest algorithm when i restart the cluster, each time the training output is changes even though random state is set. Anyone has any clue about this issue?

Note : I tried the same algorithm with same code in anacoda enviroment in my local machine, there is no different in the result even though I restart the machine.

-werners- · ‎02-21-2022

RF is non-deterministic by it´s nature.

However as you mentioned you can control this by using random_state.

This will guarantee a deterministic result ON A CERTAIN SYSTEM, but not necessarily over systems.

SO has a topic about this, check it out, very interesting.

View solution in original post