โ09-18-2024 01:18 AM
Iโm currently working on a project using Delta Lake on Databricks and would like to optimize performance for both batch and streaming workloads.
โ09-18-2024 04:21 AM - edited โ09-18-2024 04:25 AM
Hi @joshbuttler ,
It's a big topic and it really depends on the use case. You can follow some guides/recommendations from databricks documentation:- Optimization recommendations on Databricks | Databricks on AWS
- Use liquid clustering for Delta tables | Databricks on AWS
- Comprehensive Guide to Optimize Data Workloads | Databricks
And if your company is databricks partner then you have access to a nice course on optimization in Databricks Academy for free.
never-displayed
Passionate about hosting events and connecting people? Help us grow a vibrant local communityโsign up today to get started!