<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Data engineering professional exam in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/data-engineering-professional-exam/m-p/87833#M37451</link>
    <description>&lt;P&gt;Each configuration below is identical in that each cluster has 400 GB total of RAM, 160 total cores, and only one Executor per VM.&lt;/P&gt;&lt;P&gt;Given an extremely long-running job for which completion must be guaranteed, which cluster configuration will be able to guarantee completion of the job in light of one or more VM failures?&lt;/P&gt;&lt;P&gt;A. Total VMs: 8, 50 GB per Executor, 20 Cores / Executor&lt;BR /&gt;B. Total VMs: 16, 25 GB per Executor, 10 Cores / Executor&lt;BR /&gt;C. Total VMs: 1, 400 GB per Executor, 160 Cores / Executor&lt;BR /&gt;D. Total VMs: 4, 100 GB per Executor, 40 Cores / Executor&lt;BR /&gt;E. Total VMs: 2, 200 GB per Executor, 80 Cores / Executor&lt;/P&gt;</description>
    <pubDate>Tue, 03 Sep 2024 10:07:18 GMT</pubDate>
    <dc:creator>anshi_t_k</dc:creator>
    <dc:date>2024-09-03T10:07:18Z</dc:date>
    <item>
      <title>Data engineering professional exam</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-professional-exam/m-p/87833#M37451</link>
      <description>&lt;P&gt;Each configuration below is identical in that each cluster has 400 GB total of RAM, 160 total cores, and only one Executor per VM.&lt;/P&gt;&lt;P&gt;Given an extremely long-running job for which completion must be guaranteed, which cluster configuration will be able to guarantee completion of the job in light of one or more VM failures?&lt;/P&gt;&lt;P&gt;A. Total VMs: 8, 50 GB per Executor, 20 Cores / Executor&lt;BR /&gt;B. Total VMs: 16, 25 GB per Executor, 10 Cores / Executor&lt;BR /&gt;C. Total VMs: 1, 400 GB per Executor, 160 Cores / Executor&lt;BR /&gt;D. Total VMs: 4, 100 GB per Executor, 40 Cores / Executor&lt;BR /&gt;E. Total VMs: 2, 200 GB per Executor, 80 Cores / Executor&lt;/P&gt;</description>
      <pubDate>Tue, 03 Sep 2024 10:07:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-professional-exam/m-p/87833#M37451</guid>
      <dc:creator>anshi_t_k</dc:creator>
      <dc:date>2024-09-03T10:07:18Z</dc:date>
    </item>
    <item>
      <title>Re: Data engineering professional exam</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-professional-exam/m-p/87837#M37452</link>
      <description>&lt;P&gt;I am confused between option B and D according to different sites, can anyone provide a clarity regarding this&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 03 Sep 2024 10:09:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-professional-exam/m-p/87837#M37452</guid>
      <dc:creator>anshi_t_k</dc:creator>
      <dc:date>2024-09-03T10:09:17Z</dc:date>
    </item>
    <item>
      <title>Re: Data engineering professional exam</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-professional-exam/m-p/87886#M37455</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/111883"&gt;@anshi_t_k&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;The key consideration here is fault tolerance.&amp;nbsp;&lt;BR /&gt;How do you protect against a VM failure?&amp;nbsp;By having more VMs, as the impact of a single VM&amp;nbsp; failure will be the lowest.&lt;BR /&gt;&lt;BR /&gt;For example answer C - the crash of the VM is loosing 1/1 so 100% capacity: no fault tolerance.&lt;BR /&gt;In case of answer B - the crash of the VM means loosing 1/16, so only 6,25% capacity.&lt;BR /&gt;The other answer you mentioned B - the crash of the VM means loosing 1/4, so 25% capacity.&lt;BR /&gt;&lt;BR /&gt;The answer is B:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;It has the highest number of VMs, which spreads the workload more evenly.&lt;/LI&gt;&lt;LI&gt;The impact of any single VM failure is minimal, with only 6.25% of the total capacity lost per VM failure.&lt;/LI&gt;&lt;LI&gt;This configuration provides the best fault tolerance, as the cluster can sustain multiple VM failures and still continue to operate effectively, ensuring that the job can complete.&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Tue, 03 Sep 2024 10:32:26 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-professional-exam/m-p/87886#M37455</guid>
      <dc:creator>filipniziol</dc:creator>
      <dc:date>2024-09-03T10:32:26Z</dc:date>
    </item>
    <item>
      <title>Re: Data engineering professional exam</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-professional-exam/m-p/87967#M37465</link>
      <description>&lt;P&gt;Thank you for the information.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Sep 2024 11:31:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-professional-exam/m-p/87967#M37465</guid>
      <dc:creator>anshi_t_k</dc:creator>
      <dc:date>2024-09-03T11:31:38Z</dc:date>
    </item>
  </channel>
</rss>

