Merge take too long
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-01-2024 07:54 PM - edited 09-01-2024 07:56 PM
Hi all,
I performed a merge process on approximately 19 million rows using two i3.4xlarge workers. However, the process took around 20 minutes to complete. How can I further optimize this process? I have already implemented the OPTIMIZE command and used a liquid cluster setup.
Labels:
- Labels:
-
Delta Lake
-
Spark
1 REPLY 1
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
11-11-2024 05:08 AM
@abduldjafar Use this general doc to optimize your workload based on your job analysis
https://www.databricks.com/discover/pages/optimize-data-workloads-guide

