Intermittent secret resolution error service fault in GCP
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-16-2023 09:26 PM
Experiencing the error below in GCP when starting a cluster (both manually and in jobs). It's causing our ETL and other production jobs to fail multiple times a week. Its intermittent, but requires manual intervention to retry scheduled jobs.
run failed with error message Unexpected failure while waiting for the cluster (0817-041248-m827uwd4) to be ready: Cluster 0817-041248-m827uwd4 is in unexpected state Terminating: SECRET_RESOLUTION_ERROR(SERVICE_FAULT): databricks_error_message:Cannot fetch secrets referred in the Spark Environment Variables due to internal error.
1 REPLY 1
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-18-2023 11:56 AM
Thanks @Retired_mod . 1 and 2 are confirmed fine. I would imagine 3 to not result in intermittent failures if it were a config issue, but perhaps it's another network related issue that would be susceptible to intermittent failure.
The link you provided is for a training request. Is there another place where I can file a bug report?
![](/skins/images/B38AF44D4BD6CE643D2A527BE673CCF6/responsive_peak/images/icon_anonymous_message.png)
![](/skins/images/B38AF44D4BD6CE643D2A527BE673CCF6/responsive_peak/images/icon_anonymous_message.png)