cancel
Showing results for 
Search instead for 
Did you mean: 
ck7007
Contributor
since ‎08-19-2025
‎09-09-2025

User Stats

  • 17 Posts
  • 2 Solutions
  • 36 Kudos given
  • 9 Kudos received

User Activity

Zero-Downtime Unity Catalog Migration for 500TB Data LakeJust completed migrating 500TB to Unity Catalog without a single minute of downtime. Here's how:The Challenge500 TB across 12,000 tables200+ concurrent usersZero tolerance for downtimeMixed Hiv...
Maintain Zonemaps with Streaming Writes Challenge: Streaming breaks zonemaps due to constant micro-batches.Solution: Incremental Updatesdef write_streaming_with_zonemap(stream_df, table_path):def update_zonemap(batch_df, batch_id):# Write databatch_d...
Reduced Monthly Databricks Bill from $47K to $12.7KThe Problem: We were scanning 2.3TB for queries needing only 8GB of data.Three Quick Wins1. Multi-dimensional Partitioning (30% savings)# Beforedf.write.partitionBy("date").parquet(path)# After-parti...
Bloom Filters + Zonemaps: The Ultimate Query Optimization ComboAfter my zonemap post last week got great feedback, several of you asked about Bloom filter integration. Here's the complete implementation!Why Bloom Filters Changed EverythingZonemaps ar...
Reduced Monthly Databricks Bill from $47K to $12.7KThe Problem: We were scanning 2.3TB for queries needing only 8GB of data.Three Quick Wins1. Multi-dimensional Partitioning (30% savings)# Beforedf.write.partitionBy("date").parquet(path)# After-parti...