cancel
Showing results for 
Search instead for 
Did you mean: 
Administration & Architecture
Explore discussions on Databricks administration, deployment strategies, and architectural best practices. Connect with administrators and architects to optimize your Databricks environment for performance, scalability, and security.
cancel
Showing results for 
Search instead for 
Did you mean: 

How to monitor and kill rogue Genie benchmark evaluation processes?

M5C
New Contributor

I started a Genie benchmark evaluation process, but it is unable to complete and continues to run (with Execution Status = Running). I am unable to pause/stop the process - I have tried using the Pause button for the overall evaluation process and even though it attempts to suspend the process, it does not successfully pause.  I have also tried to pause/stop the individual benchmark steps within the process, but these options are not enabled and greyed out.  Any info on where I can monitor this process within Databricks and how I can kill the process would be helpful please!  Subsequent evaluation runs have completed successfully, but this one continues to run and looks like it is never going to complete. 

1 REPLY 1

Ashwin_DSA
Databricks Employee
Databricks Employee

Hi @M5C,

Sorry you’re running into this.

Based on the public Genie monitoring docs, benchmark evaluations run in the background and can be monitored from the Benchmarks / Evaluations area of the Genie space, where you can review execution status and drill into individual runs: Test and monitor a Genie Space.

If one evaluation remains stuck in Running and the Pause action does not work, I’m not seeing a documented self-service force-cancel option in the public docs. In that case, the best next step would usually be to open a Databricks support case so the team can investigate and clean up the stuck backend run.

It would help to include:

  • the workspace ID
  • the Genie space ID
  • the name or timestamp of the stuck evaluation run
  • screenshots showing the run stuck in Running and the unsuccessful pause attempt

Hopefully, that helps point you in the right direction, and I'm sorry for not getting back to you sooner.

If this answer resolves your question, could you mark it as “Accept as Solution”? That helps other users quickly find the correct fix.

Regards,
Ashwin | Delivery Solution Architect @ Databricks
Helping you build and scale the Data Intelligence Platform.
***Opinions are my own***