<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Save file to /tmp in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/save-file-to-tmp/m-p/119402#M45861</link>
    <description>&lt;P&gt;I am experiencing the same problem. I create a file in /tmp and can verify that it exists. But when an attempt is made to open the file using pyspark, the file is not found. I noticed that the path I used to create the file is /tmp/foobar.parquet and the path being reported as not found is dbfs:/tmp/foobar.parquet.&lt;/P&gt;</description>
    <pubDate>Fri, 16 May 2025 02:10:28 GMT</pubDate>
    <dc:creator>JimBiard</dc:creator>
    <dc:date>2025-05-16T02:10:28Z</dc:date>
    <item>
      <title>Save file to /tmp</title>
      <link>https://community.databricks.com/t5/data-engineering/save-file-to-tmp/m-p/78116#M35469</link>
      <description>&lt;P&gt;Hello, I have python code that collects data in json, and sends it to an S3 bucket, everything works fine. But when there is a lot of data, it causes memory overflow.&lt;/P&gt;&lt;P&gt;So I want to save locally, for example in /tmp or dbfs:/tmp and after sending it to S3, but when saving it says that the directory or file does not exist, as if the file is generated but cannot be found.&lt;/P&gt;&lt;P&gt;If I mount UC Vulumes, then it works.&lt;/P&gt;&lt;P&gt;Are there any restrictions? I'm mounting everything via unity catalog, not via dbfs.&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Jul 2024 14:01:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/save-file-to-tmp/m-p/78116#M35469</guid>
      <dc:creator>thiagoawstest</dc:creator>
      <dc:date>2024-07-10T14:01:11Z</dc:date>
    </item>
    <item>
      <title>Re: Save file to /tmp</title>
      <link>https://community.databricks.com/t5/data-engineering/save-file-to-tmp/m-p/79290#M35722</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/105521"&gt;@thiagoawstest&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Thank you for reaching out to our community! We're here to help you.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;To ensure we provide you with the best support, could you please take a moment to review the response and choose the one that best answers your question? Your feedback not only helps us assist you better but also benefits other community members who may have similar questions in the future.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;If you found the answer helpful, consider giving it a kudo. If the response fully addresses your question, please mark it as the accepted solution. This will help us close the thread and ensure your question is resolved.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;We appreciate your participation and are here to assist you further if you need it!&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Thanks,&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Rishabh&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jul 2024 16:45:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/save-file-to-tmp/m-p/79290#M35722</guid>
      <dc:creator>RishabhTiwari07</dc:creator>
      <dc:date>2024-07-18T16:45:21Z</dc:date>
    </item>
    <item>
      <title>Re: Save file to /tmp</title>
      <link>https://community.databricks.com/t5/data-engineering/save-file-to-tmp/m-p/119402#M45861</link>
      <description>&lt;P&gt;I am experiencing the same problem. I create a file in /tmp and can verify that it exists. But when an attempt is made to open the file using pyspark, the file is not found. I noticed that the path I used to create the file is /tmp/foobar.parquet and the path being reported as not found is dbfs:/tmp/foobar.parquet.&lt;/P&gt;</description>
      <pubDate>Fri, 16 May 2025 02:10:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/save-file-to-tmp/m-p/119402#M45861</guid>
      <dc:creator>JimBiard</dc:creator>
      <dc:date>2025-05-16T02:10:28Z</dc:date>
    </item>
    <item>
      <title>Re: Save file to /tmp</title>
      <link>https://community.databricks.com/t5/data-engineering/save-file-to-tmp/m-p/119500#M45894</link>
      <description>&lt;P&gt;I found what my problem was. I used pandas to save my parquet file to /tmp. It stored it in the compute node local file system /tmp folder. When I passed the same path to pyspark to load the file, it prepended 'dbfs:' to the path. The file wasn't in dbfs:/tmp, so the call failed. I prepended 'file:' to the path name that I passed to pyspark and the call succeeded.&lt;/P&gt;</description>
      <pubDate>Fri, 16 May 2025 15:51:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/save-file-to-tmp/m-p/119500#M45894</guid>
      <dc:creator>JimBiard</dc:creator>
      <dc:date>2025-05-16T15:51:41Z</dc:date>
    </item>
  </channel>
</rss>

