ManojkMohan
Honored Contributor II

@ck7007 brainstormed some solution approaches ., do you have some test data to test these hands on 

 

Approach                            Throughput Query Speed Complexity Notes

Partition-level zonemapsHighMediumLowScales with micro-batches; prune at partition/file level
File consolidation / OptimizeMediumHighMediumReduces metadata churn; needs tuning for latency vs file size
Deferred global indexHighMedium-HighMediumPreserves streaming throughput; query may hit unoptimized files
Bloom filters / secondary indexHighHighMediumLow false positives; good for selective queries
Delta predictive optimizationHighMedium-HighLowFully managed; minimal operational overhead

View solution in original post