Databricks Community

payalbhatia · 07-21-2024

What if I have lot of empty shuffled partitions due to data skewness Secondly , if the shuffle partition size is 128 MB and if the size of the key's partition is 700 MB

payalbhatia · 07-21-2024

I have follow up questions here :1) OP mentions about the 1 GB of data in each folder. So , the spark will read ~8 partitions on 8 cores(if there ) ?2)what if I get empty partitions after shuffle?

Databricks Community

User Stats

User Activity

Shuffle Partitions

Re: Partition in Spark