<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: A Standard cluster is recommended for a single user - what is meant by that ? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/a-standard-cluster-is-recommended-for-a-single-user-what-is/m-p/25000#M17408</link>
    <description>&lt;P&gt;Thank you so much for your reply. Now it makes more sense.&lt;/P&gt;</description>
    <pubDate>Mon, 21 Mar 2022 20:06:44 GMT</pubDate>
    <dc:creator>Tahseen0354</dc:creator>
    <dc:date>2022-03-21T20:06:44Z</dc:date>
    <item>
      <title>A Standard cluster is recommended for a single user - what is meant by that ?</title>
      <link>https://community.databricks.com/t5/data-engineering/a-standard-cluster-is-recommended-for-a-single-user-what-is/m-p/24996#M17404</link>
      <description>&lt;P&gt;Hi, I have seen it written in the documentation that standard cluster is recommended for a single user. But why ? What is meant by that ? Me and one of my colleagues were testing it on the same notebook. Both of us can use the same standard all purpose cluster in the same notebook at the same time. It is just that we could not execute the same cell at the same time but that is reasonably normal. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;But if two persons can use the same standard all purpose cluster in the same notebook at the same time, then why it is recommended for single user ? Does that mean that we should select high concurrency cluster when multiple people are collaborating in the same notebook at the same time for simple data read and write experiments ?&lt;/P&gt;</description>
      <pubDate>Mon, 21 Mar 2022 18:54:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/a-standard-cluster-is-recommended-for-a-single-user-what-is/m-p/24996#M17404</guid>
      <dc:creator>Tahseen0354</dc:creator>
      <dc:date>2022-03-21T18:54:43Z</dc:date>
    </item>
    <item>
      <title>Re: A Standard cluster is recommended for a single user - what is meant by that ?</title>
      <link>https://community.databricks.com/t5/data-engineering/a-standard-cluster-is-recommended-for-a-single-user-what-is/m-p/24997#M17405</link>
      <description>&lt;UL&gt;&lt;LI&gt;Standard clusters are ideal for processing large amounts of data with Apache Spark. We recommand to use standard cluster for a single user because it meant to be handle less load compare to  high con cluster.&lt;/LI&gt;&lt;LI&gt;High Concurrency clusters are ideal for groups of users who need to share resources or run ad-hoc jobs. Administrators usually create High Concurrency clusters.The key benefits of High Concurrency clusters are that they provide Apache Spark-native fine-grained sharing for maximum resource utilization and minimum query latencies.&lt;/LI&gt;&lt;LI&gt;you can go through this best practice which may help - &lt;A href="https://docs.databricks.com/clusters/cluster-config-best-practices.html" target="test_blank"&gt;https://docs.databricks.com/clusters/cluster-config-best-practices.html&lt;/A&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Mar 2022 19:08:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/a-standard-cluster-is-recommended-for-a-single-user-what-is/m-p/24997#M17405</guid>
      <dc:creator>Atanu</dc:creator>
      <dc:date>2022-03-21T19:08:22Z</dc:date>
    </item>
    <item>
      <title>Re: A Standard cluster is recommended for a single user - what is meant by that ?</title>
      <link>https://community.databricks.com/t5/data-engineering/a-standard-cluster-is-recommended-for-a-single-user-what-is/m-p/24998#M17406</link>
      <description>&lt;P&gt;High concurrency cluster just split resource between users more evenly. So when 4 people run notebooks in the same time on cluster with 4 cpu you can imagine that every will get 1 cpu. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In standard cluster 1 person could utilize all worker cpus as your job have multiple partitions (for example 4) so will require multiple cores (1 cpu process 1 partition at a time so all 4 cpus will be busy processing 4 partitions) so other users' jobs will wait in queue till your job is finished.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In standard cluster you can also maintain resource allocations on notebook level using pools. To do that set sparkContext property in first line of notebook:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;spark.sparkContext.setLocalProperty("spark.scheduler.pool", "pool1")&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Mar 2022 19:11:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/a-standard-cluster-is-recommended-for-a-single-user-what-is/m-p/24998#M17406</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2022-03-21T19:11:13Z</dc:date>
    </item>
    <item>
      <title>Re: A Standard cluster is recommended for a single user - what is meant by that ?</title>
      <link>https://community.databricks.com/t5/data-engineering/a-standard-cluster-is-recommended-for-a-single-user-what-is/m-p/24999#M17407</link>
      <description>&lt;P&gt;Thank you so much for your reply. So I think it is more related to how the load is handled, not how many users are using the cluster.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Mar 2022 20:06:20 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/a-standard-cluster-is-recommended-for-a-single-user-what-is/m-p/24999#M17407</guid>
      <dc:creator>Tahseen0354</dc:creator>
      <dc:date>2022-03-21T20:06:20Z</dc:date>
    </item>
    <item>
      <title>Re: A Standard cluster is recommended for a single user - what is meant by that ?</title>
      <link>https://community.databricks.com/t5/data-engineering/a-standard-cluster-is-recommended-for-a-single-user-what-is/m-p/25000#M17408</link>
      <description>&lt;P&gt;Thank you so much for your reply. Now it makes more sense.&lt;/P&gt;</description>
      <pubDate>Mon, 21 Mar 2022 20:06:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/a-standard-cluster-is-recommended-for-a-single-user-what-is/m-p/25000#M17408</guid>
      <dc:creator>Tahseen0354</dc:creator>
      <dc:date>2022-03-21T20:06:44Z</dc:date>
    </item>
  </channel>
</rss>

