- DBFS is just an abstraction on cloud storage
By default when you create a workspace, you get an instance of DBFS - so-called DBFS Root. Plus you can mount additional storage accounts under the
/mnt
folder. Data written to mount point paths (
/mnt
) is stored outside of the DBFS root. Even though the DBFS root is writeable, It's recommended that you store data in mounted object storage rather than in the DBFS root. The mounted storage does't change the performance of read and write , spark stills read the data in form of chunks and parallel processing is still applicable so there is no question of production degradation