Out-of-the-box, self-tuning data layout that scales with your data
We’re excited to announce the General Availability of Delta Lake Liquid Clustering in the Databricks Data Intelligence Platform. Liquid Clustering is an innovative data management technique that replaces table partitioning and ZORDER so you no longer have to fine-tune your data layout to achieve optimal query performance.
Liquid clustering significantly simplifies data layout-related decisions and provides the flexibility to redefine clustering keys without data rewrites. It allows data layout to evolve alongside analytic needs over time – something you could never do with partitioning on Delta.
Since the Public Preview of Liquid Clustering at the Data and AI Summit last year, we’ve worked with hundreds of customers who benefited from better query performance with Liquid Clustering. During that time, we have 1000+ active customers, and have written 100+ petabytes to and readnearly 20 exabytes from Liquid clustered tables. Customers have seen Liquid improve read performance by 2-12x compared to traditional methods.
Continue to read.