Z-order or Partitioning? Which is better for Data skipping?

brickster_2018 — Tue, 22 Jun 2021 23:16:50 GMT

For Delta tables, among Z-order and Partioning which is recommended technique for efficient Data Skipping

Re: Z-order or Partitioning? Which is better for Data skipping?

brickster_2018 — Tue, 22 Jun 2021 23:19:13 GMT

Partition pruning is the most efficient way to ensure Data skipping. However, choosing the right column for partitioning is very important. It's common to see choosing the wrong column for partitioning can cause a large number of small file problems and in such scenarios, Z-order is the preferred option.

topic Re: Z-order or Partitioning? Which is better for Data skipping? in Data Engineering

Z-order or Partitioning? Which is better for Data skipping?

Re: Z-order or Partitioning? Which is better for Data skipping?