<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Which process is eating up my driver memory? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104892#M41926</link>
    <description>&lt;P&gt;non-interactive commands (like 'free´) can be run from notebooks btw using the %sh magic command.&lt;/P&gt;</description>
    <pubDate>Thu, 09 Jan 2025 11:04:15 GMT</pubDate>
    <dc:creator>-werners-</dc:creator>
    <dc:date>2025-01-09T11:04:15Z</dc:date>
    <item>
      <title>Which process is eating up my driver memory?</title>
      <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104879#M41917</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;We're running DBR 14.3 on a shared multi-node cluster.&lt;/P&gt;&lt;P&gt;When checking the metrics of the driver, I see that the &lt;EM&gt;Memory utilization&lt;/EM&gt; and &lt;EM&gt;Memory swap utilization&lt;/EM&gt; are increasing a lot and are almost never decreasing. Even if no processes are running anymore.&lt;/P&gt;&lt;P&gt;It seems that some processes are allocating memory, but are never releasing it.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Is there a way to detect which processes are allocating memory on the driver node?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Is there a way to detect which processes are causing the &lt;EM&gt;memory swap utilization&lt;/EM&gt; on the driver node?&lt;/STRONG&gt;I know these are the result of memory pressure, but it seems these are not released after a node crash and restart due to Out-of-memory (OOM)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jan 2025 10:03:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104879#M41917</guid>
      <dc:creator>TjommeV-Vlaio</dc:creator>
      <dc:date>2025-01-09T10:03:59Z</dc:date>
    </item>
    <item>
      <title>Re: Which process is eating up my driver memory?</title>
      <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104880#M41918</link>
      <description>&lt;P&gt;The Spark UI provides detailed information about the memory usage of different processes. You can access the Spark UI by navigating to the "Executors" tab, which shows the memory usage of the driver and executors. This can help identify if specific tasks or stages are consuming excessive memory.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jan 2025 10:05:31 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104880#M41918</guid>
      <dc:creator>Walter_C</dc:creator>
      <dc:date>2025-01-09T10:05:31Z</dc:date>
    </item>
    <item>
      <title>Re: Which process is eating up my driver memory?</title>
      <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104881#M41919</link>
      <description>&lt;P&gt;Hi Walter,&lt;/P&gt;&lt;P&gt;This overview shows the consumption by node, not by process.&lt;/P&gt;&lt;P&gt;Thread dump and heap histogram do not seem to provide usefull information (for my issue)&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jan 2025 10:19:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104881#M41919</guid>
      <dc:creator>TjommeV-Vlaio</dc:creator>
      <dc:date>2025-01-09T10:19:27Z</dc:date>
    </item>
    <item>
      <title>Re: Which process is eating up my driver memory?</title>
      <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104882#M41920</link>
      <description>&lt;P&gt;To add on this, in case the spark ui does not help:&lt;BR /&gt;try to ssh into the driver and check in 'top' (or htop if it is installed) what processes use the mem.&lt;BR /&gt;check the RES, VIRT and SWAP columns (and COMMAND to see which program).&lt;BR /&gt;GC should free up memory, but perhaps for some reason memory does not get released.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jan 2025 10:19:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104882#M41920</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2025-01-09T10:19:58Z</dc:date>
    </item>
    <item>
      <title>Re: Which process is eating up my driver memory?</title>
      <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104887#M41922</link>
      <description>&lt;P&gt;Do I do this from Databricks or from Azure? To do it from Azure, I'm missing the credentials to connect.&lt;/P&gt;&lt;P&gt;From Databricks I don't know how to do this&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jan 2025 10:47:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104887#M41922</guid>
      <dc:creator>TjommeV-Vlaio</dc:creator>
      <dc:date>2025-01-09T10:47:25Z</dc:date>
    </item>
    <item>
      <title>Re: Which process is eating up my driver memory?</title>
      <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104891#M41925</link>
      <description>&lt;P&gt;there is something as the 'web terminal' that you can enable in the settings.&lt;BR /&gt;This will open a terminal on the driver (I am pretty sure it is the driver and not a worker).&lt;BR /&gt;And from there you can run top/htop etc like on a normal linux shell.&lt;BR /&gt;If you are not comfortable with linux you might wanna ask someone who is.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jan 2025 11:03:37 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104891#M41925</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2025-01-09T11:03:37Z</dc:date>
    </item>
    <item>
      <title>Re: Which process is eating up my driver memory?</title>
      <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104892#M41926</link>
      <description>&lt;P&gt;non-interactive commands (like 'free´) can be run from notebooks btw using the %sh magic command.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jan 2025 11:04:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104892#M41926</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2025-01-09T11:04:15Z</dc:date>
    </item>
    <item>
      <title>Re: Which process is eating up my driver memory?</title>
      <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104921#M41937</link>
      <description>&lt;P&gt;Found the web terminal doc:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/compute/web-terminal" target="_blank"&gt;Run shell commands in Azure Databricks web terminal - Azure Databricks | Microsoft Learn&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Unfortunately, we're running shared clusters on DBR 14.3, so no web terminal support&lt;/P&gt;&lt;P&gt;Running&amp;nbsp;&lt;SPAN&gt;&lt;EM&gt;%sh htop&lt;/EM&gt;&amp;nbsp;from a notebook does not align with memory usage shown in Metrics tab&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Thu, 09 Jan 2025 12:41:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104921#M41937</guid>
      <dc:creator>TjommeV-Vlaio</dc:creator>
      <dc:date>2025-01-09T12:41:27Z</dc:date>
    </item>
    <item>
      <title>Re: Which process is eating up my driver memory?</title>
      <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104930#M41940</link>
      <description>&lt;P&gt;htop in a notebook looks kinda wonky so i would not use that.&lt;BR /&gt;Free gives you a general overview so with free -h or -m you can also see some info.&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.howtogeek.com/659529/how-to-check-memory-usage-from-the-linux-terminal/" target="_blank"&gt;https://www.howtogeek.com/659529/how-to-check-memory-usage-from-the-linux-terminal/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Also trust os commands over the metrics. Nothing knows better what's going on on an os than the os itself.&lt;/P&gt;&lt;P&gt;But it being a shared interactive cluster: how long has it been you restarted it?&amp;nbsp; Is it always the same job that gives issues?&amp;nbsp; Are you sure nobody is running anything?&lt;BR /&gt;I'd check the same workload on a single user cluster and see what happens.&amp;nbsp; Shared clusters do have some limitations.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jan 2025 12:52:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104930#M41940</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2025-01-09T12:52:53Z</dc:date>
    </item>
    <item>
      <title>Re: Which process is eating up my driver memory?</title>
      <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104946#M41948</link>
      <description>&lt;P&gt;I'll check this out.&amp;nbsp;&lt;/P&gt;&lt;P&gt;My goal is to see which notebooks/processes are consuming large amounts of driver memory (without releasing it) as this might indicate there is a memory leak or coding contains some non-parallel code that needs to be resolved.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jan 2025 13:19:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104946#M41948</guid>
      <dc:creator>TjommeV-Vlaio</dc:creator>
      <dc:date>2025-01-09T13:19:19Z</dc:date>
    </item>
    <item>
      <title>Re: Which process is eating up my driver memory?</title>
      <link>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104948#M41949</link>
      <description>&lt;P&gt;On OS level you will not see notebooks, you will see the mem consumption of the spark application (so this is all notebooks).&lt;BR /&gt;For that there is the spark ui.&lt;/P&gt;&lt;P&gt;I'd look for collect(), broadcast() statements. Python code outside of spark, tons of graphics/docs in notebooks (makes the notebook heavy), loops over dataframe records etc.&amp;nbsp; It all exists &lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jan 2025 13:36:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/which-process-is-eating-up-my-driver-memory/m-p/104948#M41949</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2025-01-09T13:36:54Z</dc:date>
    </item>
  </channel>
</rss>

