โ01-28-2022 05:54 AM
โ01-28-2022 07:13 AM
Delta cache is per file so if you have dataset which data structure is splited in 100 files and 1 file was updated theoretically that 1 only should be evicted. It is automatic.
There is also Spark cache which is totally manual so you control persist/cache operation.
When you use Databricks SQL endpoint delta cache is automatically handled, for delta cache optimized VMS is enabled by default for other by setting it in spark config.
โ01-28-2022 07:13 AM
Delta cache is per file so if you have dataset which data structure is splited in 100 files and 1 file was updated theoretically that 1 only should be evicted. It is automatic.
There is also Spark cache which is totally manual so you control persist/cache operation.
When you use Databricks SQL endpoint delta cache is automatically handled, for delta cache optimized VMS is enabled by default for other by setting it in spark config.
โ01-31-2022 03:42 PM
@Mark Hummelโ - Does Hubert's answer resolve your question?
โ02-01-2022 05:22 AM
Yes
โ02-01-2022 07:57 AM
AWESOME!
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโt want to miss the chance to attend and share knowledge.
If there isnโt a group near you, start one and help create a community that brings people together.
Request a New Group