<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Spot instances  - Best practice in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/spot-instances-best-practice/m-p/24730#M17213</link>
    <description>&lt;P&gt;We are&amp;nbsp;having difficulties running our jobs with&amp;nbsp;spot&amp;nbsp;instances&amp;nbsp;that get re-claimed by AWS during shuffles. Do we have any documentation / best-practices around this? We went through &lt;A href="https://docs.databricks.com/clusters/cluster-config-best-practices.html#on-demand-and-spot-instances" alt="https://docs.databricks.com/clusters/cluster-config-best-practices.html#on-demand-and-spot-instances" target="_blank"&gt;this&amp;nbsp;article&lt;/A&gt;&amp;nbsp;but is there anything else to keep in mind?&lt;/P&gt;</description>
    <pubDate>Mon, 14 Jun 2021 21:26:10 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2021-06-14T21:26:10Z</dc:date>
    <item>
      <title>Spot instances  - Best practice</title>
      <link>https://community.databricks.com/t5/data-engineering/spot-instances-best-practice/m-p/24730#M17213</link>
      <description>&lt;P&gt;We are&amp;nbsp;having difficulties running our jobs with&amp;nbsp;spot&amp;nbsp;instances&amp;nbsp;that get re-claimed by AWS during shuffles. Do we have any documentation / best-practices around this? We went through &lt;A href="https://docs.databricks.com/clusters/cluster-config-best-practices.html#on-demand-and-spot-instances" alt="https://docs.databricks.com/clusters/cluster-config-best-practices.html#on-demand-and-spot-instances" target="_blank"&gt;this&amp;nbsp;article&lt;/A&gt;&amp;nbsp;but is there anything else to keep in mind?&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jun 2021 21:26:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/spot-instances-best-practice/m-p/24730#M17213</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-06-14T21:26:10Z</dc:date>
    </item>
    <item>
      <title>Re: Spot instances  - Best practice</title>
      <link>https://community.databricks.com/t5/data-engineering/spot-instances-best-practice/m-p/24731#M17214</link>
      <description>&lt;P&gt;What are you setting your bid price to? I think its' reasonable to set it to 100% of on-demand price, or else you may get evicted more frequently. It's also a good idea for a job like this to set only _some_ of the executors to be spot instances, so that you never lose a critical mass of executors, while saving some money otherwise.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Jun 2021 23:15:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/spot-instances-best-practice/m-p/24731#M17214</guid>
      <dc:creator>sean_owen</dc:creator>
      <dc:date>2021-06-17T23:15:49Z</dc:date>
    </item>
    <item>
      <title>Re: Spot instances  - Best practice</title>
      <link>https://community.databricks.com/t5/data-engineering/spot-instances-best-practice/m-p/24732#M17215</link>
      <description>&lt;P&gt;Due to the recent changes in &lt;A href="https://aws.amazon.com/blogs/compute/new-amazon-ec2-spot-pricing/" alt="https://aws.amazon.com/blogs/compute/new-amazon-ec2-spot-pricing/" target="_blank"&gt;&lt;U&gt;AWS spot market place &lt;/U&gt;&lt;/A&gt;, legacy techniques like higher spot bid price (&amp;gt;100%) are ineffective to retain the acquired spot node and the instances can be lost in 2 minutes notice causing workloads to fail.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;To mitigate this, we should encourage customers to rely on -&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Using multiple instance families as part of their cluster/pool creation&lt;/LI&gt;&lt;LI&gt;Provision master node from an on demand pool &lt;/LI&gt;&lt;LI&gt;Consider using the appropriate spot allocation strategy like CAPACITY_OPTIMIZED, LOW_PRICE etc&lt;/LI&gt;&lt;/OL&gt;</description>
      <pubDate>Fri, 25 Jun 2021 22:08:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/spot-instances-best-practice/m-p/24732#M17215</guid>
      <dc:creator>User16783853906</dc:creator>
      <dc:date>2021-06-25T22:08:38Z</dc:date>
    </item>
  </channel>
</rss>

