<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to choose the right node type for my Databricks workload? in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/how-to-choose-the-right-node-type-for-my-databricks-workload/m-p/131606#M10696</link>
    <description>&lt;P&gt;Hey all, I’m setting up a Databricks workspace to run data pipelines + ML training. I’m unsure which node types to pick (driver vs worker, instance size, memory vs CPU optimized) for different workloads. For example small ETL jobs vs large batch processing, training medium‐sized ML models vs inference workloads&lt;/P&gt;&lt;P&gt;Can someone share how to decide on node sizing / type based on workload pattern? What are trade-offs (cost, performance, scalability)?&lt;/P&gt;</description>
    <pubDate>Thu, 11 Sep 2025 04:02:47 GMT</pubDate>
    <dc:creator>Ednexllc</dc:creator>
    <dc:date>2025-09-11T04:02:47Z</dc:date>
    <item>
      <title>How to choose the right node type for my Databricks workload?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/how-to-choose-the-right-node-type-for-my-databricks-workload/m-p/131606#M10696</link>
      <description>&lt;P&gt;Hey all, I’m setting up a Databricks workspace to run data pipelines + ML training. I’m unsure which node types to pick (driver vs worker, instance size, memory vs CPU optimized) for different workloads. For example small ETL jobs vs large batch processing, training medium‐sized ML models vs inference workloads&lt;/P&gt;&lt;P&gt;Can someone share how to decide on node sizing / type based on workload pattern? What are trade-offs (cost, performance, scalability)?&lt;/P&gt;</description>
      <pubDate>Thu, 11 Sep 2025 04:02:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/how-to-choose-the-right-node-type-for-my-databricks-workload/m-p/131606#M10696</guid>
      <dc:creator>Ednexllc</dc:creator>
      <dc:date>2025-09-11T04:02:47Z</dc:date>
    </item>
    <item>
      <title>Re: How to choose the right node type for my Databricks workload?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/how-to-choose-the-right-node-type-for-my-databricks-workload/m-p/131620#M10697</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/184026"&gt;@Ednexllc&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;It really depends on many factor like workload type, size of your data, number of tables etc. You can check some recommendations given by Databricks here:&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.databricks.com/aws/en/compute/cluster-config-best-practices" target="_blank"&gt;Compute configuration recommendations | Databricks on AWS&lt;/A&gt;&lt;/P&gt;&lt;P&gt;And also here - you should find useful info in section called&amp;nbsp;Databricks Cluster Configuration and Tuning&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.databricks.com/discover/pages/optimize-data-workloads-guide#databricks-cluster" target="_blank"&gt;Comprehensive Guide to Optimize Data Workloads | Databricks&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;But for me, it’s always a process that requires some trial and error at the beginning. I try different settings, and in the end I choose the ones that handled the given workload best.&lt;/P&gt;&lt;P&gt;So, to put it simply - there's no silver bullet. You can use some guidelines, but in the end you need to test and align compute to your workload/envirionment yourself&lt;/P&gt;</description>
      <pubDate>Thu, 11 Sep 2025 06:53:42 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/how-to-choose-the-right-node-type-for-my-databricks-workload/m-p/131620#M10697</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-09-11T06:53:42Z</dc:date>
    </item>
  </channel>
</rss>

