Are UDFs necessary for applying models from ML libraries at scale ?
Hello,I recently finished the "scalable machine learning with apache spark" course and saw that SKLearn models could be applied faster in a distributed manner when used in pandas UDFs or with mapInPandas() method. Spark MLlib models don't need this k...
- 2624 Views
- 3 replies
- 4 kudos
Latest Reply
MlLib is in the maintenance model and udf is not used by creating model in most cases
- 4 kudos