<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How can the shared memory size (/dev/shm) be increased on databricks worker nodes with custom do in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/81605#M36358</link>
    <description>&lt;P&gt;Hey folks, any follow-up on this, or alternative solution? thanks&lt;/P&gt;</description>
    <pubDate>Fri, 02 Aug 2024 09:55:13 GMT</pubDate>
    <dc:creator>Hugh_Ku</dc:creator>
    <dc:date>2024-08-02T09:55:13Z</dc:date>
    <item>
      <title>How can the shared memory size (/dev/shm) be increased on databricks worker nodes with custom docker images?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/12207#M7055</link>
      <description>&lt;P&gt;PyTorch uses shared memory to efficiently share tensors between its dataloader workers and its main process. However in a docker container the default size of the shared memory (a tmpfs file system mounted at /dev/shm) is 64MB, which is too small to use to share image tensor batches. This means that when using a custom docker image on a databricks cluster it is not possible to use PyTorch with multiple dataloaders. We can fix this by setting the `--shm-size` or `--ipc=host` args on `docker run`  - how can this be set on a databricks cluster?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Note that this doesn't affect the default databricks runtime it looks like that is using the linux default of making half the physical RAM available to /dev/shm  - 6.9GB on the Standard_DS3_v2 node I tested.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;To reproduce: start a cluster using a custom docker image, run `df -h /dev/shm` in a notebook.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks in advance!&lt;/P&gt;</description>
      <pubDate>Thu, 28 Oct 2021 09:59:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/12207#M7055</guid>
      <dc:creator>Alex_Persin</dc:creator>
      <dc:date>2021-10-28T09:59:06Z</dc:date>
    </item>
    <item>
      <title>Re: How can the shared memory size (/dev/shm) be increased on databricks worker nodes with custom docker images?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/12208#M7056</link>
      <description>&lt;P&gt;Also interested in increasing shared memory for use with &lt;A href="https://www.ray.io/docs/libraries" alt="https://www.ray.io/docs/libraries" target="_blank"&gt;ray&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jan 2022 15:25:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/12208#M7056</guid>
      <dc:creator>mstuder</dc:creator>
      <dc:date>2022-01-19T15:25:46Z</dc:date>
    </item>
    <item>
      <title>Re: How can the shared memory size (/dev/shm) be increased on databricks worker nodes with custom docker images?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/12209#M7057</link>
      <description>&lt;P&gt;We spoke to DataBricks about this and they are working on it. At the beginning of the month they said it should be available on Jan 17th but I'm not sure of the status now, we ended up moving this workload off of the platform.   &lt;/P&gt;</description>
      <pubDate>Thu, 20 Jan 2022 13:17:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/12209#M7057</guid>
      <dc:creator>Alex_Persin</dc:creator>
      <dc:date>2022-01-20T13:17:08Z</dc:date>
    </item>
    <item>
      <title>Re: How can the shared memory size (/dev/shm) be increased on databricks worker nodes with custom do</title>
      <link>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/81605#M36358</link>
      <description>&lt;P&gt;Hey folks, any follow-up on this, or alternative solution? thanks&lt;/P&gt;</description>
      <pubDate>Fri, 02 Aug 2024 09:55:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/81605#M36358</guid>
      <dc:creator>Hugh_Ku</dc:creator>
      <dc:date>2024-08-02T09:55:13Z</dc:date>
    </item>
    <item>
      <title>Re: How can the shared memory size (/dev/shm) be increased on databricks worker nodes with custom do</title>
      <link>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/91277#M38127</link>
      <description>&lt;P&gt;Recently stumbled on this problem. It seems like it basically makes impossible usage of compute with custom docker images for any pytorch-based real life computer vision ML experiments. Which is unfortunate. +1 for requesting followup and possible alternative solutions! Thank you!&lt;/P&gt;</description>
      <pubDate>Sat, 21 Sep 2024 10:12:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/91277#M38127</guid>
      <dc:creator>OxFF</dc:creator>
      <dc:date>2024-09-21T10:12:50Z</dc:date>
    </item>
    <item>
      <title>Re: How can the shared memory size (/dev/shm) be increased on databricks worker nodes with custom do</title>
      <link>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/105864#M42292</link>
      <description>&lt;P&gt;bump this one. interested in the topic too.&lt;BR /&gt;is there a known solution yet?&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jan 2025 10:10:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/105864#M42292</guid>
      <dc:creator>sashme</dc:creator>
      <dc:date>2025-01-16T10:10:46Z</dc:date>
    </item>
    <item>
      <title>Re: How can the shared memory size (/dev/shm) be increased on databricks worker nodes with custom do</title>
      <link>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/114980#M45005</link>
      <description>&lt;P&gt;Bump again... does anyone have a solution for this?&lt;/P&gt;</description>
      <pubDate>Wed, 09 Apr 2025 13:54:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-can-the-shared-memory-size-dev-shm-be-increased-on/m-p/114980#M45005</guid>
      <dc:creator>stevewb</dc:creator>
      <dc:date>2025-04-09T13:54:01Z</dc:date>
    </item>
  </channel>
</rss>

