Databricks Community

mrstevegross · ‎05-06-2025

We use a "warmup" mechanism to get our DBR instance pool into a state where it has at-least-N instances. The logic is:

For N repetitions:
1. Request a new DBR cluster in the pool (which causes the pool to request an AWS instance)
2. Wait for the cluster to report as RUNNING
  1. If it reports as TERMINATING, abandon this iteration
3. Terminating the DBR cluster (to free it up for an upcoming real request)

Normally, this works fine. Lately, however, there's a weird issue: we hit the 1.1.1 situation ("If it reports as TERMINATING, abandon this iteration") for *all* clusters. I have occasionally seen this for 1 cluster (of, say, 40), but never for ALL of them.

What could cause DBR to transition a cluster to "TERMINATING" right after it's created?

mrstevegross · ‎05-07-2025

Aha, found it. I monitored the pool status via the DBR UI, and when a cluster *started* being provisioned, I clicked into it. Then I looked at the event log, and found useful information about failed steps. The underlying error was indeed AWS related (an issue in our role configuration).

View solution in original post

mrstevegross · ‎05-06-2025

I see that there is some documentation on the subject; I'm exploring whether AWS is actually the culprit.

mrstevegross · ‎05-07-2025

Aha, found it. I monitored the pool status via the DBR UI, and when a cluster *started* being provisioned, I clicked into it. Then I looked at the event log, and found useful information about failed steps. The underlying error was indeed AWS related (an issue in our role configuration).

Databricks Community

Trying to understand why a cluster reports as "terminating" right after being created

Join Us as a Local Community Builder!

PSA: Community Edition retires on January 1, 2026. Move to the Free Edition today to keep your work.

🎤 Call for Presentations: Data + AI Summit 2026 is Open!

Last Chance: Help Shape the 2026 Data + AI Summit | Win a Full Conference Pass

🌟 Community Pulse: Your Weekly Roundup! December 05 – 11, 2025

Jaipur Usergroup First Virtual Meetup: AI/BI Genie + Data Science Careers — 19 Dec | 6 PM IST