Hi Debaya
Thanks for your reply, it runs without any issues. After rerunning the model each time, I got different cluster outputs even after applying seed and tolerance as I have mentioned in my code snippet.
I would expect the results to be the same once you apply seed since it removes any randomness. I also increased the number of iterations which didn't help either.
Is there a way to reproduce the results in Spark?
Thanks
Mala