cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Any recommendations on instance type for z-order / vacuum/ optimize ?

Anonymous
Not applicable
 
1 ACCEPTED SOLUTION

Accepted Solutions

aladda
Databricks Employee
Databricks Employee

For Delta in general having Delta cache accelerates data reads by creating copies of remote files in nodesโ€™ local storage using a fast intermediate data format. The data is cached automatically whenever a file has to be fetched from a remote location. Successive reads of the same data are then performed locally, which results in significantly improved reading speed. And this can be useful even when performing merges where data has to be read.

https://docs.databricks.com/delta/optimizations/delta-cache.html#use-delta-caching

The recommended (and easiest) way to use Delta caching is to choose a Delta Cache Acceleratedโ€”

i3 or i3en seriesโ€”worker type when you configure your cluster. Such workers are enabled and configured for Delta caching.

View solution in original post

1 REPLY 1

aladda
Databricks Employee
Databricks Employee

For Delta in general having Delta cache accelerates data reads by creating copies of remote files in nodesโ€™ local storage using a fast intermediate data format. The data is cached automatically whenever a file has to be fetched from a remote location. Successive reads of the same data are then performed locally, which results in significantly improved reading speed. And this can be useful even when performing merges where data has to be read.

https://docs.databricks.com/delta/optimizations/delta-cache.html#use-delta-caching

The recommended (and easiest) way to use Delta caching is to choose a Delta Cache Acceleratedโ€”

i3 or i3en seriesโ€”worker type when you configure your cluster. Such workers are enabled and configured for Delta caching.

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group