Hi everyone,
Iโm facing a connectivity issue in my Databricks workspace on GCP and would appreciate any guidance.
Problem
When I run commands from a Databricks notebook, I see intermittent errors like:
Connection reset
Retrying request to https://us-east4.gcp.databricks.com:443
However, when I test connectivity manually from the cluster node using curl, it works fine.
What I verified
- Direct connectivity works
curl -v https://us-east4.gcp.databricks.com
- Resolves to public IP (34.x.x.x)
- TLS handshake successful
- Returns HTTP 303 โ /login.html
- DNS resolution is correct
getent hosts us-east4.gcp.databricks.com
โ 34.128.x.x
- Proxy removed
- Removed HTTP_PROXY / HTTPS_PROXY environment variables
- Verified no proxy is being used
Issue inside Databricks runtime
- Notebook / Spark jobs still show:
- Connection reset
- Retry attempts in logs
Questions
- Is this expected behavior due to connection reuse / keep-alive in Databricks runtime?
- Could this be related to JVM/Spark HTTP client behavior?
- Are there recommended configurations to avoid these connection reset logs?
- When should this be considered a real failure vs harmless retry?