<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Databricks drop and remove s3 storage files safely in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/databricks-drop-and-remove-s3-storage-files-safely/m-p/34568#M25310</link>
    <description>&lt;P&gt;After dropping a delta table using DROP command in databricks, is there a way to drop the s3 files in databricks without using rm command? Looking for a solution where junior developers can safely drop a table wihout messing with the rm command where they may cause accidental data loss using recursive option. &lt;/P&gt;&lt;P&gt;thanks&lt;/P&gt;&lt;P&gt;Alina.&lt;/P&gt;</description>
    <pubDate>Mon, 22 Nov 2021 17:26:11 GMT</pubDate>
    <dc:creator>Abela</dc:creator>
    <dc:date>2021-11-22T17:26:11Z</dc:date>
    <item>
      <title>Databricks drop and remove s3 storage files safely</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-drop-and-remove-s3-storage-files-safely/m-p/34568#M25310</link>
      <description>&lt;P&gt;After dropping a delta table using DROP command in databricks, is there a way to drop the s3 files in databricks without using rm command? Looking for a solution where junior developers can safely drop a table wihout messing with the rm command where they may cause accidental data loss using recursive option. &lt;/P&gt;&lt;P&gt;thanks&lt;/P&gt;&lt;P&gt;Alina.&lt;/P&gt;</description>
      <pubDate>Mon, 22 Nov 2021 17:26:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-drop-and-remove-s3-storage-files-safely/m-p/34568#M25310</guid>
      <dc:creator>Abela</dc:creator>
      <dc:date>2021-11-22T17:26:11Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks drop and remove s3 storage files safely</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-drop-and-remove-s3-storage-files-safely/m-p/34569#M25311</link>
      <description>&lt;P&gt;official way is that before DROP:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Run DELETE FROM:&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;               DELETE FROM events&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Run VACUUM with an interval of zero:&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;               VACUUM events RETAIN 0 HOURS&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I agree that there could be some DEEP DROP &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Alternatively not in SQL but in python you could write custom  class/function to do that and then preinstall it on clusters so people would use some CleanTable(TableName) to make data validation and then delete+vacuum+drop+rm&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 22 Nov 2021 18:41:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-drop-and-remove-s3-storage-files-safely/m-p/34569#M25311</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2021-11-22T18:41:15Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks drop and remove s3 storage files safely</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-drop-and-remove-s3-storage-files-safely/m-p/34570#M25312</link>
      <description>&lt;P&gt;Hi @Alina Bella​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Like @Hubert Dudek​&amp;nbsp;mentioned, we have a best practice guide for dropping managed tables. You can find the docs &lt;A href="https://kb.databricks.com/delta/drop-delta-table.html" alt="https://kb.databricks.com/delta/drop-delta-table.html" target="_blank"&gt;here&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 22 Nov 2021 19:51:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-drop-and-remove-s3-storage-files-safely/m-p/34570#M25312</guid>
      <dc:creator>jose_gonzalez</dc:creator>
      <dc:date>2021-11-22T19:51:12Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks drop and remove s3 storage files safely</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-drop-and-remove-s3-storage-files-safely/m-p/34571#M25313</link>
      <description>&lt;P&gt;Hi @Alina Bella​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;If @Hubert Dudek​&amp;nbsp;''s answer solved the issue, would you be happy to mark their answer as best? That will help others find the solution more easily in the future.&lt;/P&gt;</description>
      <pubDate>Mon, 29 Nov 2021 19:14:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-drop-and-remove-s3-storage-files-safely/m-p/34571#M25313</guid>
      <dc:creator>jose_gonzalez</dc:creator>
      <dc:date>2021-11-29T19:14:17Z</dc:date>
    </item>
  </channel>
</rss>

