What is the difference between a Narrow Transformation and Wide Transformation

aladda
Databricks Employee
Databricks Employee
 

aladda
Databricks Employee
Databricks Employee
  • Narrow Transformation: In Narrow transformation, all the elements that are required to compute the records in single partition live in the single partition of parent RDD. Ex:- Select, Filter, Union,
  • Wide Transformation: Wide transformation, all the elements that are required to compute the records in the single partition may live in many partitions of parent RDD. The partition may live in many partitions of parent RDD. Involves a network shuffle and are split between stages. Ex:- GroupBy, Repartition, Sorts

View solution in original post