06-28-2022 10:03 PM
06-29-2022 05:33 AM
Hi @Abdullah Durrani,
Spark workers will spill the data on disk if the dataset is larger than the memory size.
I'd advise you to follow the best practices page https://docs.databricks.com/clusters/cluster-config-best-practices.html#cluster-sizing-consideration... to determine what cluster size you should configure for your use case.
06-29-2022 05:12 AM
@Kaniz Fatma @Cedric Law Hing Ping
06-29-2022 05:33 AM
Hi @Abdullah Durrani,
Spark workers will spill the data on disk if the dataset is larger than the memory size.
I'd advise you to follow the best practices page https://docs.databricks.com/clusters/cluster-config-best-practices.html#cluster-sizing-consideration... to determine what cluster size you should configure for your use case.
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now