Spark reparation vs coalesce

Srikanth_Gupta_
Databricks Employee
Databricks Employee
 

sajith_appukutt
Databricks Employee
Databricks Employee
  • coalesce avoids a full shuffle and could be used to decrease the number of partitions
  • repartition results in a full shuffle and could be used to increase or decrease the number of partitions