<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Why is Databricks on AWS cluster start time less than 5 mins and EMR cluster start time is 15 mins? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28256#M20079</link>
    <description>&lt;P&gt;yes that's my assumption too. But do we have any documentation by Databricks stating that?&lt;/P&gt;</description>
    <pubDate>Mon, 10 Oct 2022 11:55:36 GMT</pubDate>
    <dc:creator>gud4eve</dc:creator>
    <dc:date>2022-10-10T11:55:36Z</dc:date>
    <item>
      <title>Why is Databricks on AWS cluster start time less than 5 mins and EMR cluster start time is 15 mins?</title>
      <link>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28254#M20077</link>
      <description>&lt;P&gt;We are migrating from AWS EMR to Databricks. One thing that we have noticed during the POCs is that Databricks cluster of same size and instance type takes much lesser time to start compared to EMR.&lt;/P&gt;&lt;P&gt;My understanding is Databricks also would be requesting instances from same AWS pool as EMR would do. Then why AWS's own service (EMR) is slow in getting the clusters up?&lt;/P&gt;</description>
      <pubDate>Mon, 10 Oct 2022 06:42:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28254#M20077</guid>
      <dc:creator>gud4eve</dc:creator>
      <dc:date>2022-10-10T06:42:57Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Databricks on AWS cluster start time less than 5 mins and EMR cluster start time is 15 mins?</title>
      <link>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28255#M20078</link>
      <description>&lt;P&gt;Suppose the worker provisioning is identical between EMR and Databricks (I think they are the same, but am not certain), it is very possible that installing EMR on a cluster takes more time than installing Databricks.  Databricks has worked hard to get their nodes up and running as fast as possible, perhaps Amazon did not do such a thing.&lt;/P&gt;</description>
      <pubDate>Mon, 10 Oct 2022 11:26:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28255#M20078</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-10-10T11:26:30Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Databricks on AWS cluster start time less than 5 mins and EMR cluster start time is 15 mins?</title>
      <link>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28256#M20079</link>
      <description>&lt;P&gt;yes that's my assumption too. But do we have any documentation by Databricks stating that?&lt;/P&gt;</description>
      <pubDate>Mon, 10 Oct 2022 11:55:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28256#M20079</guid>
      <dc:creator>gud4eve</dc:creator>
      <dc:date>2022-10-10T11:55:36Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Databricks on AWS cluster start time less than 5 mins and EMR cluster start time is 15 mins?</title>
      <link>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28257#M20080</link>
      <description>&lt;P&gt;@gud4eve​&amp;nbsp;what kind of cluster you are using, have you configured pools. if not as @Werner Stinckens​&amp;nbsp;said there might be chance Databricks worked hard to get provisioning of instances in faster way &lt;/P&gt;</description>
      <pubDate>Thu, 13 Oct 2022 14:41:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28257#M20080</guid>
      <dc:creator>karthik_p</dc:creator>
      <dc:date>2022-10-13T14:41:18Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Databricks on AWS cluster start time less than 5 mins and EMR cluster start time is 15 mins?</title>
      <link>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28258#M20081</link>
      <description>&lt;P&gt;I hear about trying to improve starting time at conferences for two years, so it is something like a never-ending story. Pools and serverless pools will offer further improvements. Recommended instance types are also usually better, as databricks is working with vendors on that. Additionally, I heard that GCC is now the fastest to start vms/cluster. For me, big improvements with deployment time would be that pools would have preinstalled libraries (instead of setting them on cluster level).&lt;/P&gt;</description>
      <pubDate>Sun, 16 Oct 2022 16:22:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28258#M20081</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2022-10-16T16:22:33Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Databricks on AWS cluster start time less than 5 mins and EMR cluster start time is 15 mins?</title>
      <link>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28259#M20082</link>
      <description>&lt;P&gt;No we haven't configured pools&lt;/P&gt;</description>
      <pubDate>Tue, 18 Oct 2022 07:22:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/why-is-databricks-on-aws-cluster-start-time-less-than-5-mins-and/m-p/28259#M20082</guid>
      <dc:creator>gud4eve</dc:creator>
      <dc:date>2022-10-18T07:22:50Z</dc:date>
    </item>
  </channel>
</rss>

