yesterday
What should you do when your dataset is uneven—some values appear too many times and others appear very few times—while working in Databricks?
2 hours ago
Hi @Suheb ,
Refer to really good guide prepared by Databricks team. When you have a skewed dataset the primary things you can do are following:
1. Filter skewed values
2. Apply Skew hints
3. AQE skew optimization
4. Salting
Much detailed description of above terms can be found in below guide:
Comprehensive Guide to Optimize Data Workloads | Databricks
never-displayed
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!