Liquid Clustering replaces manual partitioning and Z-Ordering with adaptive file clustering.
It keeps your data physically organized for faster queries and merges, without forcing you to manage partition columns or compaction jobs.
Itโs powered by cluster-by keys, Deltaโs internal clustering metadata, and automatic reclustering handled by the Delta optimizer.
Table Size Rough Range LC Benefit Notes
| Small | < 10 GB or < 50 million rows | Limited | Metadata overhead may outweigh benefit. Stick with Delta defaults or small Z-ORDER. |
| Medium | 10 GB โ 1 TB or 50Mโ1B rows | Strong | Ideal range โ LC improves scan times, merges, and compaction efficiency. |
| Large | > 1 TB or billions of rows | Very high | Major gains in data skipping and read performance, especially for multi-year or multi-tenant data. |
More you can find in the documentation. If you have a specific case, not generic, I am more than happy to advise.
https://docs.databricks.com/aws/en/delta/clustering
https://docs.databricks.com/aws/en/delta/best-practices
https://docs.databricks.com/aws/en/delta/optimize