cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Optimize & Compaction

thushar
Contributor

Hi,

From which data bricks runtime will support Optimize and compaction

5 REPLIES 5

Debayan
Esteemed Contributor III
Esteemed Contributor III

Hi, DBR 9.1 LTS and above. Please refer: https://docs.databricks.com/optimizations/auto-optimize.html#enable-auto-optimize

Please let us know if this helps. 

Also, please tag @Debayanโ€‹ with your next response which will notify me. Thank you!

Thanks, only SQL version will support this feature in 9.1 LTS, PySpark scripts are availbe or not ?

-werners-
Esteemed Contributor III

yes, as the optimize is set in spark config options, this is language agnostic.

Anonymous
Not applicable

Hi @Thushar Rโ€‹ 

Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. 

We'd love to hear from you.

Thanks!

Joe_Suarez
New Contributor III

Optimize and compaction are operations commonly used in Apache Spark for optimizing and improving the performance of data storage and processing. Databricks, which is a cloud-based platform for Apache Spark, provides support for these operations on various runtime versions.

Here are some runtime versions in Databricks that support optimize and compaction:

  1. Databricks Runtime 7.2 and later versions: Databricks Runtime 7.2 introduced a new feature called Delta Lake Auto Optimize, which enables automatic optimization and compaction of Delta Lake tables based on a set of predefined rules. This feature is available in all later versions of Databricks Runtime as well.
  2. Databricks Runtime 6.4 and later versions: Databricks Runtime 6.4 introduced the OPTIMIZE command for Delta Lake tables, which enables manual optimization and compaction of Delta Lake tables.
  3. Earlier runtime versions: Optimize and compaction are not available in earlier versions of Databricks Runtime. If you're using an earlier version, you can consider upgrading to a later version to take advantage of these features.

Join 100K+ Data Experts: Register Now & Grow with Us!

Excited to expand your horizons with us? Click here to Register and begin your journey to success!

Already a member? Login and join your local regional user group! If there isn’t one near you, fill out this form and we’ll create one for you to join!