Selective overwrite on Partition and Liquid clustered tables
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Friday - last edited Friday
Hi,
I have created 2 identical tables but one is partitioned and the one is a Liquid Clustered with Auto Clustering.
I inserted 30M rows x 2 (60M) for two dates , date 1 = 2026-06-01 and date = 2026-06-02 , then I overwrite the date 2026-06-02 with a selective overwrite statement
REPLACE USING (BED)
For partition table , history shows ,
| operationParameters | |
object
| |
| operationMetrics | |
object
|
For Liquid Clustered table
| operationParameters | |
object
| |
| operationMetrics | |
object
|
Its overwriting 43 files (Total 86) ?, is this optimal ?
Is there a way to improve performance by reducing the # of files ?
I am using
INSERT INTO <target> REPLACE USING (col)
SELECT <cols> FROM <table>
Labels:
- Labels:
-
Delta Lake