Hi, @Sivagangireddy Singam​ Singam. I see that the RDD programming guide does say that the default storage level is

MEMORY_ONLY

, but the latest PySpark docs (2.4.4) state "The default storage level has changed to

MEMORY_AND_DISK

." (The PySpark docs you linked to were 2.1.2.)