Databricks Community

data-engineer-d · ‎07-03-2024

We enabled liquid clustering on one of the large tables (380GBs). This table goes many operations daily, which improved many folds after liquid clustering. However, after enabling liquid clustering and optimizing it number of files are increased.

Previously it had around 4300 files and now it shows 7900 files. Though table size is almost the same before and after.

It is clustered using two columns which are both in first 32 columns. How can we justify this increase in number of file sizes i.e decrease in data per file.

data-engineer-d · ‎07-09-2024

Thank you for detailed explanation @Retired_mod .

Databricks Community

Liquid Clustering - Number of files are increasing

🌟 Community Pulse: Your Weekly Roundup! July 06 – 12, 2026

Upcoming Community BrickTalk | Sports Analytics: Turning Tracking Data into Real-Time AI Decisions

How to Optimize Your Content for GEO: Best Practices for Writing Discoverable Community Content

Solution Accelerator Series | Building Common Sense Product Recommendations With LLMs

Databricks Community Fellows – June 2026 Recap