Hi, there a few things that could cause these types of problems.
1. Azure service availablity (when these happen check the Azure service availability to make sure there are no outages)
2. Local network connection problems (verify all your other internet connections are working fine)
3. The biggest and most likely cause is that the clusters your trying to run on are incorrectly sized for the jobs, so they are slow or you're unable to connect at all.
I would recommend starting by looking at the cluster utilisation metrics in the UI, you should be able to see whether the clusters you're trying to run are over utilised. Look for the following
- CPU Usage: Consistently high (>80%) usage may indicate under-provisioned nodes.
- Memory Usage: Look for memory saturation, which can lead to task spill or slowdowns.
- Disk I/O: High or maxed-out disk I/O could be a bottleneck if your tasks read/write intensive datasets.
