08-22-2022 11:54 PM
Recommendations for performance tuning best practices on Databricks
We recommend also checking out this article from my colleague @Franco Patano on best practices for performance tuning on Databricks.
Performance tuning your workloads is an important step to take before putting your project into production to ensure you are getting the best performance and the lowest cost to help meet you save money and meet your SLAs.
When tuning on Databricks, it is important to follow the the framework illustrated in the diagram below:
Continued below
08-22-2022 11:54 PM
File Layout Optimization - tips for efficient file layout
08-22-2022 11:55 PM
Code Optimization - advice to avoid code bottlenecks
08-22-2022 11:55 PM
Cluster Optimization - how to choose the right cluster for your workload
08-22-2022 11:55 PM
Let us know in the comments if you have any other performance tuning tips & tricks
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group