<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Questions About Workers and Executors Configuration in Databricks in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141483#M51740</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/199978"&gt;@dvd_lg_bricks&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;1&lt;STRONG&gt;. &lt;/STRONG&gt;Yes.In Databricks:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;A &lt;STRONG&gt;cluster&lt;/STRONG&gt; = driver node + one or more &lt;STRONG&gt;worker nodes&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Each &lt;STRONG&gt;worker&lt;/STRONG&gt; is a &lt;EM&gt;separate &lt;/EM&gt;(VM)&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;2. &lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Databricks&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;runs one executor per worker node. Therefore, the terms executor and worker are used interchangeably in the context of the Databricks architecture.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 09 Dec 2025 10:00:41 GMT</pubDate>
    <dc:creator>szymon_dybczak</dc:creator>
    <dc:date>2025-12-09T10:00:41Z</dc:date>
    <item>
      <title>Questions About Workers and Executors Configuration in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141471#M51734</link>
      <description>&lt;P&gt;Hi everyone, sorry, I’m new here. I’m considering migrating to Databricks, but I need to clarify a few things first.&lt;/P&gt;&lt;P&gt;When I define and launch an application, I see that I can specify the number of &lt;EM&gt;workers&lt;/EM&gt;, and then later configure the number of &lt;EM&gt;executors&lt;/EM&gt;.&lt;BR /&gt;My questions are:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Are those workers running on different machines?&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Can I define how many executors run on each worker?&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;If so, is this controlled through Spark configuration variables?&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Tue, 09 Dec 2025 07:10:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141471#M51734</guid>
      <dc:creator>dvd_lg_bricks</dc:creator>
      <dc:date>2025-12-09T07:10:06Z</dc:date>
    </item>
    <item>
      <title>Re: Questions About Workers and Executors Configuration in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141483#M51740</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/199978"&gt;@dvd_lg_bricks&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;1&lt;STRONG&gt;. &lt;/STRONG&gt;Yes.In Databricks:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;A &lt;STRONG&gt;cluster&lt;/STRONG&gt; = driver node + one or more &lt;STRONG&gt;worker nodes&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Each &lt;STRONG&gt;worker&lt;/STRONG&gt; is a &lt;EM&gt;separate &lt;/EM&gt;(VM)&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;2. &lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Databricks&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;runs one executor per worker node. Therefore, the terms executor and worker are used interchangeably in the context of the Databricks architecture.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Dec 2025 10:00:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141483#M51740</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-12-09T10:00:41Z</dc:date>
    </item>
    <item>
      <title>Re: Questions About Workers and Executors Configuration in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141486#M51741</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/199978"&gt;@dvd_lg_bricks&lt;/a&gt;&amp;nbsp;- Databricks does the spark-cluster management.&amp;nbsp;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Are those workers running on different machines? - &lt;/STRONG&gt;Yes, it runs one VM per worker&amp;nbsp;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Can I define how many executors run on each worker? - &lt;/STRONG&gt;No, Databricks runs 1 executor per worker&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;If so, is this controlled through Spark configuration variables? - &lt;/STRONG&gt;not on databricks. but on vanilla spark, you would manage it using below 3 variables -&amp;nbsp;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;&lt;SPAN&gt;spark.executor.instances&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;SPAN&gt;spark.executor.cores&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;SPAN&gt;spark.executor.memory&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/OL&gt;</description>
      <pubDate>Tue, 09 Dec 2025 10:56:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141486#M51741</guid>
      <dc:creator>Raman_Unifeye</dc:creator>
      <dc:date>2025-12-09T10:56:55Z</dc:date>
    </item>
    <item>
      <title>Re: Questions About Workers and Executors Configuration in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141490#M51744</link>
      <description>&lt;P&gt;Thank you both,&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/110502"&gt;@szymon_dybczak&lt;/a&gt;&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/179607"&gt;@Raman_Unifeye&lt;/a&gt;&amp;nbsp;for your feedback. I really appreciate the time and clarity you provided.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Dec 2025 11:16:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141490#M51744</guid>
      <dc:creator>dvd_lg_bricks</dc:creator>
      <dc:date>2025-12-09T11:16:22Z</dc:date>
    </item>
    <item>
      <title>Re: Questions About Workers and Executors Configuration in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141504#M51745</link>
      <description>&lt;P&gt;No problem&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/199978"&gt;@dvd_lg_bricks&lt;/a&gt;&amp;nbsp; &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Dec 2025 12:10:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141504#M51745</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-12-09T12:10:54Z</dc:date>
    </item>
    <item>
      <title>Re: Questions About Workers and Executors Configuration in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141510#M51746</link>
      <description>&lt;P&gt;I mean: while we’re at it &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/110502"&gt;@szymon_dybczak&lt;/a&gt;&amp;nbsp;or &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/179607"&gt;@Raman_Unifeye&lt;/a&gt;&amp;nbsp;, is there a place where all available Databricks configuration parameters are documented? I have some pipelines that rely on special settings, such as changing the serializer, enabling Apache Arrow, and a few other uncommon configs.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Dec 2025 14:11:26 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141510#M51746</guid>
      <dc:creator>dvd_lg_bricks</dc:creator>
      <dc:date>2025-12-09T14:11:26Z</dc:date>
    </item>
    <item>
      <title>Re: Questions About Workers and Executors Configuration in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141512#M51748</link>
      <description>&lt;P&gt;perhaps you are looking for this -&amp;nbsp;&lt;A href="https://docs.databricks.com/aws/en/spark/conf" target="_blank"&gt;https://docs.databricks.com/aws/en/spark/conf&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Dec 2025 14:29:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141512#M51748</guid>
      <dc:creator>Raman_Unifeye</dc:creator>
      <dc:date>2025-12-09T14:29:53Z</dc:date>
    </item>
    <item>
      <title>Re: Questions About Workers and Executors Configuration in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141744#M51812</link>
      <description>&lt;P&gt;Hello again,&lt;/P&gt;&lt;P&gt;I have a couple of questions regarding Databricks cluster configuration and best practices.&lt;/P&gt;&lt;P&gt;Are there any recommended best practices or guidelines for configuring Databricks clusters (e.g. sizing, cores per executor, memory settings, etc.) depending on the workload?&lt;/P&gt;&lt;P&gt;In on-premise Spark deployments, it is sometimes recommended to leave a certain number of CPU cores or a percentage of CPU/memory reserved for the operating system and the JVM (for example, not allocating 100% of resources to Spark executors).&lt;BR /&gt;Is there an equivalent recommendation or consideration in Databricks-managed environments, or is resource management fully handled by the platform?&lt;/P&gt;&lt;P&gt;Thanks in advance for your help.&lt;/P&gt;</description>
      <pubDate>Fri, 12 Dec 2025 11:21:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141744#M51812</guid>
      <dc:creator>dvd_lg_bricks</dc:creator>
      <dc:date>2025-12-12T11:21:36Z</dc:date>
    </item>
    <item>
      <title>Re: Questions About Workers and Executors Configuration in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141767#M51815</link>
      <description>&lt;P&gt;Resource management for a cluster is fully handled by Databricks. Ideally, one would be more focused on the cluster sizing based on their workload type and set the autoscaling parameters for the cost/load balancing.&lt;/P&gt;&lt;P&gt;In our various recent implementation, we are moving more towards serverless to get away with the management overhead. Obviously, serverless do not fit for every use-case, so still need to work on hybrid method, having serverless-first approach.&lt;/P&gt;</description>
      <pubDate>Fri, 12 Dec 2025 15:23:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141767#M51815</guid>
      <dc:creator>Raman_Unifeye</dc:creator>
      <dc:date>2025-12-12T15:23:58Z</dc:date>
    </item>
    <item>
      <title>Re: Questions About Workers and Executors Configuration in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141768#M51816</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/199978"&gt;@dvd_lg_bricks&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;There's a section about cluster configuration and tuning in&amp;nbsp;&lt;STRONG&gt;Comprehensive Guide to Optimize Databricks, Spark and Delta Lake Workloads:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.databricks.com/discover/pages/optimize-data-workloads-guide#databricks-cluster" target="_blank"&gt;Comprehensive Guide to Optimize Data Workloads | Databricks&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 12 Dec 2025 15:40:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141768#M51816</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-12-12T15:40:27Z</dc:date>
    </item>
    <item>
      <title>Re: Questions About Workers and Executors Configuration in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141769#M51817</link>
      <description>&lt;P&gt;Your Databricks question about workers versus executors. Many teams encounter the same sizing and configuration issues when evaluating a migration. At Kanerika, we help companies plan cluster architecture, optimize Spark workloads, and avoid overspend during the move. If you want, I can share a quick sizing framework we use with clients. Would you be open to a brief 15-minute exchange next week?&lt;/P&gt;</description>
      <pubDate>Fri, 12 Dec 2025 16:01:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/questions-about-workers-and-executors-configuration-in/m-p/141769#M51817</guid>
      <dc:creator>Abeshek</dc:creator>
      <dc:date>2025-12-12T16:01:08Z</dc:date>
    </item>
  </channel>
</rss>

