Hi everyone,
We’re currently reviewing the performance impact of enabling Photon acceleration on our Databricks jobs, particularly those involving table maintenance tasks. Our job includes three main operations: OPTIMIZE, VACUUM, and ANALYZE_COMPUTE_STATS. We’ve observed that enabling Photon significantly improves the performance of the ANALYZE_COMPUTE_STATS task—it runs much faster when Photon is enabled on the cluster.
Given that, I’m wondering if enabling Photon for the other two tasks (OPTIMIZE and VACUUM) would also lead to better performance or reduced job time. Has anyone experienced improvements in these tasks with Photon?
Also, more generally, I’d like to understand which types of tasks or workloads benefit most from Photon acceleration.
Any insights, benchmarks, or shared experiences would be really helpful. Thanks!