<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to send alert when cluster is running for too long in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/how-to-send-alert-when-cluster-is-running-for-too-long/m-p/41176#M5697</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;Thank you for your reply. I marked your response as the solution; however, my company must use a private Databricks deployment due to the nature of its business and is missing many of the features available in the latest release of "normal" Databricks. This appears to be one of them as I don't see any of the options listed in the instructions to add duration warnings when editing notifications for a job.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 23 Aug 2023 14:05:24 GMT</pubDate>
    <dc:creator>kurtrm</dc:creator>
    <dc:date>2023-08-23T14:05:24Z</dc:date>
    <item>
      <title>How to send alert when cluster is running for too long</title>
      <link>https://community.databricks.com/t5/get-started-discussions/how-to-send-alert-when-cluster-is-running-for-too-long/m-p/41075#M5695</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;Our team recently experienced an issue where a teammate started a new workflow job then went on vacation. This job ended up running continuously without failing for 4.5 days. The usage of the cluster did not seem out of place during the workday since we are all putting load on it, it's an all purpose cluster that runs jobs at night so it's normal if it is on with high usage then, and we're not looking at the cluster outside of hours. Only during one of our meetings did someone notice that it was running at max for an unusually long time. We determined the culprit,&amp;nbsp;&lt;STRONG&gt;&lt;EM&gt;and we are aware of the ability to add max timeouts for jobs,&lt;/EM&gt;&lt;/STRONG&gt; but we still feel there should be safeguards in place to prevent an occurrence like this in the future.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Is there a way to send a notification or alert in the event the cluster has been running for X period of time without termination?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Tue, 22 Aug 2023 23:31:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/how-to-send-alert-when-cluster-is-running-for-too-long/m-p/41075#M5695</guid>
      <dc:creator>kurtrm</dc:creator>
      <dc:date>2023-08-22T23:31:11Z</dc:date>
    </item>
    <item>
      <title>Re: How to send alert when cluster is running for too long</title>
      <link>https://community.databricks.com/t5/get-started-discussions/how-to-send-alert-when-cluster-is-running-for-too-long/m-p/41176#M5697</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;Thank you for your reply. I marked your response as the solution; however, my company must use a private Databricks deployment due to the nature of its business and is missing many of the features available in the latest release of "normal" Databricks. This appears to be one of them as I don't see any of the options listed in the instructions to add duration warnings when editing notifications for a job.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 23 Aug 2023 14:05:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/how-to-send-alert-when-cluster-is-running-for-too-long/m-p/41176#M5697</guid>
      <dc:creator>kurtrm</dc:creator>
      <dc:date>2023-08-23T14:05:24Z</dc:date>
    </item>
    <item>
      <title>Re: How to send alert when cluster is running for too long</title>
      <link>https://community.databricks.com/t5/get-started-discussions/how-to-send-alert-when-cluster-is-running-for-too-long/m-p/41354#M5699</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;I ended up creating a job leveraging the Databricks Python SDK to check cluster and active job run times. The script will raise an error and notify the team if the cluster hasn't terminated or restarted in the past 24 hours or if a job has been running in excess of 6 hours. Thank you again for your help!&lt;/P&gt;&lt;P&gt;Kurt&lt;/P&gt;</description>
      <pubDate>Thu, 24 Aug 2023 14:20:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/how-to-send-alert-when-cluster-is-running-for-too-long/m-p/41354#M5699</guid>
      <dc:creator>kurtrm</dc:creator>
      <dc:date>2023-08-24T14:20:34Z</dc:date>
    </item>
  </channel>
</rss>

