Hi,
We're currently experiencing the following issue across our entire Databricks Workspace when either starting a cluster, running a workflow, or upscaling a running cluster. The following errors we receive on our AP clusters and job clusters are below:
Compute upsize complete, but below target size. The current worker count is 6, out of a target of 8. Reason: Storage Download Failure Slow
Cluster '0925-190009-qlelyoz' was terminated. Reason: STORAGE_DOWNLOAD_FAILURE_SLOW (CLIENT_ERROR). Parameters: databricks_error_message:Downloading worker artifacts onto the instance timed out.
This results in workflows failing and AP clusters not being able to gather additional resources. I haven't seen any similar issues across the community and was wondering how we can go about troubleshooting this issue.
Thank you,