Hi all,
I need to run weekly maintenance on approximately 7,000 tables in my Databricks environment, involving OPTIMIZE, VACUUM, and ANALYZE TABLE (for statistics calculation) on all tables.
My question is: between the Ev4, Edv4, and Fsv2 VM series, which would be best suited for the driver and worker nodes in a Databricks cluster handling this workload, especially considering time constraints?
I’m looking for recommendations on the VM series that would minimize task completion times while balancing cost and resource efficiency.