<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Dela Table and history in Warehousing &amp; Analytics</title>
    <link>https://community.databricks.com/t5/warehousing-analytics/dela-table-and-history/m-p/137301#M2308</link>
    <description>&lt;P class=""&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/125237"&gt;@patilsuhasv&lt;/a&gt;,&lt;/P&gt;&lt;P class=""&gt;Maintaining &lt;SPAN class=""&gt;&lt;STRONG&gt;7 years of transactional data&lt;/STRONG&gt;&lt;/SPAN&gt; in a Delta Lake table is technically possible, but it requires careful configuration — and it’s important to understand the recent &lt;SPAN class=""&gt;&lt;STRONG&gt;changes Databricks announced (effective December 2025)&lt;/STRONG&gt;&lt;/SPAN&gt; regarding &lt;SPAN class=""&gt;&lt;STRONG&gt;time travel and VACUUM&lt;/STRONG&gt;&lt;/SPAN&gt;.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P class=""&gt;Databricks is aligning &lt;SPAN class=""&gt;&lt;STRONG&gt;time travel retention&lt;/STRONG&gt;&lt;/SPAN&gt; with table-level retention settings to ensure consistent and deterministic behavior.&lt;/P&gt;&lt;P class=""&gt;From that date onward:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;Time travel queries&lt;/STRONG&gt;&lt;SPAN class=""&gt; (SELECT … AS OF, RESTORE, CLONE, CDC) will only be allowed &lt;/SPAN&gt;&lt;STRONG&gt;within the period defined by delta.deletedFileRetentionDuration&lt;/STRONG&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;The &lt;SPAN class=""&gt;&lt;STRONG&gt;retention argument in VACUUM will be ignored&lt;/STRONG&gt;&lt;/SPAN&gt; (except when set to &lt;SPAN class=""&gt;0 hours&lt;/SPAN&gt;, to fully purge history).&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;delta.logRetentionDuration must be greater than or equal to delta.deletedFileRetentionDuration.&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;So that configuration will &lt;SPAN class=""&gt;&lt;STRONG&gt;not be allowed&lt;/STRONG&gt;&lt;/SPAN&gt; after the upcoming Databricks update.&lt;/P&gt;&lt;P class=""&gt;delta.logRetentionDuration&lt;SPAN class=""&gt; must always be &lt;/SPAN&gt;&lt;SPAN class=""&gt;&lt;STRONG&gt;equal to or greater than&lt;/STRONG&gt;&lt;/SPAN&gt; delta.deletedFileRetentionDuration&lt;SPAN class=""&gt;.&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;As per the official Databricks documentation:&lt;/P&gt;&lt;P class=""&gt;&lt;A href="https://docs.databricks.com/aws/en/delta/history" target="_blank" rel="noopener"&gt;Delta table history — Databricks Docs&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;&lt;EM&gt;"Databricks does not recommend using Delta Lake table history as a long-term backup solution for data archival"&lt;/EM&gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&lt;EM&gt;"Databricks recommends using only the past &lt;SPAN class=""&gt;7 days for time travel operations&lt;/SPAN&gt;, unless you have set both data and log retention configurations to a larger value.”&lt;BR /&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;If your goal is &lt;/SPAN&gt;&lt;STRONG&gt;regulatory data retention (e.g., 7 years)&lt;/STRONG&gt;&lt;SPAN class=""&gt;, you should:&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;Keep your &lt;SPAN class=""&gt;&lt;STRONG&gt;operational Delta table&lt;/STRONG&gt;&lt;/SPAN&gt; with a short retention (e.g., 7–30 days).&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;Periodically &lt;SPAN class=""&gt;&lt;STRONG&gt;snapshot or archive&lt;/STRONG&gt;&lt;/SPAN&gt; the table (e.g., by writing it to versioned Parquet files, or exporting to object storage such as S3 Glacier).&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;Use those archives for compliance or recovery purposes — not as active Delta history.&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Hope this helps :),&lt;BR /&gt;&lt;BR /&gt;Isi&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Sun, 02 Nov 2025 16:17:57 GMT</pubDate>
    <dc:creator>Isi</dc:creator>
    <dc:date>2025-11-02T16:17:57Z</dc:date>
    <item>
      <title>Dela Table and history</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/dela-table-and-history/m-p/110710#M1901</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;How can I maintain 7 years of transactional data in delta table? Can I have log retention of 7 days, but data retention of 7 years?&lt;/P&gt;&lt;P&gt;Appreciate your response.&lt;/P&gt;&lt;P&gt;Thanks and regards&amp;nbsp;&lt;/P&gt;&lt;P&gt;Suhas&lt;/P&gt;</description>
      <pubDate>Thu, 20 Feb 2025 09:02:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/dela-table-and-history/m-p/110710#M1901</guid>
      <dc:creator>patilsuhasv</dc:creator>
      <dc:date>2025-02-20T09:02:52Z</dc:date>
    </item>
    <item>
      <title>Re: Dela Table and history</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/dela-table-and-history/m-p/137263#M2307</link>
      <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/125237"&gt;@patilsuhasv&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Yes, you absolutely can maintain 7 years of transactional data in a Delta table while having only 7 days of log retention. These are two separate concepts that work independently.&lt;/P&gt;
&lt;H2&gt;Understanding the Difference&lt;/H2&gt;
&lt;P&gt;&lt;STRONG&gt;Log Retention&lt;/STRONG&gt;&amp;nbsp;controls how long Delta Lake keeps the transaction log history (used for time travel and versioning capabilities). This doesn't delete your actual data.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Data Retention&lt;/STRONG&gt;&amp;nbsp;refers to how long you keep the actual data records in your table. Your current data remains in the table regardless of log retention settings.&lt;/P&gt;
&lt;H2&gt;Key Configuration&lt;/H2&gt;
&lt;P&gt;There are two main properties to understand:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;delta.logRetentionDuration&lt;/STRONG&gt;: Controls transaction log history (default 30 days)&lt;BR /&gt;- This affects your ability to time travel to previous versions&lt;BR /&gt;- Setting this to 7 days means you can only query versions from the last 7 days&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;delta.deletedFileRetentionDuration&lt;/STRONG&gt;: Controls when VACUUM removes old data files (default 7 days)&lt;BR /&gt;- This only affects files that are no longer part of the current table (due to updates, deletes, or merges)&lt;BR /&gt;- Does NOT delete your current active data&lt;/P&gt;
&lt;H2&gt;Solution for Your Use Case&lt;/H2&gt;
&lt;P&gt;To keep 7 years of data with 7 days of log retention:&lt;/P&gt;
&lt;P&gt;```sql&lt;BR /&gt;-- Set log retention to 7 days&lt;BR /&gt;ALTER TABLE your_table SET TBLPROPERTIES (&lt;BR /&gt;'delta.logRetentionDuration' = 'interval 7 days'&lt;BR /&gt;);&lt;BR /&gt;```&lt;/P&gt;
&lt;P&gt;Your current transactional data will remain in the table indefinitely. The 7-day log retention simply means:&lt;BR /&gt;- You can only time travel back 7 days&lt;BR /&gt;- Transaction logs older than 7 days will be cleaned up automatically&lt;/P&gt;
&lt;H2&gt;Important Considerations&lt;/H2&gt;
&lt;P&gt;&lt;STRONG&gt;What Gets Kept&lt;/STRONG&gt;:&lt;BR /&gt;- All current data in your table (regardless of age) stays&lt;BR /&gt;- Only historical versions beyond 7 days become inaccessible for time travel&lt;BR /&gt;- Active records from 7 years ago remain queryable as part of the current table state&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;What Gets Cleaned&lt;/STRONG&gt;:&lt;BR /&gt;- Transaction logs older than 7 days&lt;BR /&gt;- Old data file versions (if you run VACUUM) that are no longer part of the current table&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Best Practices&lt;/STRONG&gt;:&lt;BR /&gt;- Don't run VACUUM frequently if you need longer time travel capabilities&lt;BR /&gt;- If you never update/delete records, your data files stay intact even with short log retention&lt;BR /&gt;- Consider your compliance and audit requirements before reducing log retention&lt;BR /&gt;- Storage costs are primarily driven by data volume, not log retention&lt;/P&gt;
&lt;H2&gt;Example Scenario&lt;/H2&gt;
&lt;P&gt;If you have a transaction table with records from 2018-2025:&lt;BR /&gt;- With 7 days log retention: All transactions from 2018-2025 remain queryable in the current table&lt;BR /&gt;- You just can't time travel to see what the table looked like 30 days ago&lt;BR /&gt;- The actual transaction records themselves are NOT deleted&lt;/P&gt;
&lt;P&gt;The log retention setting is about versioning history, not data lifecycle management. Your 7-year-old transactions will remain accessible in your table's current state.&lt;/P&gt;
&lt;P&gt;Hope this helps, Louis.&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 01 Nov 2025 19:56:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/dela-table-and-history/m-p/137263#M2307</guid>
      <dc:creator>Louis_Frolio</dc:creator>
      <dc:date>2025-11-01T19:56:21Z</dc:date>
    </item>
    <item>
      <title>Re: Dela Table and history</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/dela-table-and-history/m-p/137301#M2308</link>
      <description>&lt;P class=""&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/125237"&gt;@patilsuhasv&lt;/a&gt;,&lt;/P&gt;&lt;P class=""&gt;Maintaining &lt;SPAN class=""&gt;&lt;STRONG&gt;7 years of transactional data&lt;/STRONG&gt;&lt;/SPAN&gt; in a Delta Lake table is technically possible, but it requires careful configuration — and it’s important to understand the recent &lt;SPAN class=""&gt;&lt;STRONG&gt;changes Databricks announced (effective December 2025)&lt;/STRONG&gt;&lt;/SPAN&gt; regarding &lt;SPAN class=""&gt;&lt;STRONG&gt;time travel and VACUUM&lt;/STRONG&gt;&lt;/SPAN&gt;.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P class=""&gt;Databricks is aligning &lt;SPAN class=""&gt;&lt;STRONG&gt;time travel retention&lt;/STRONG&gt;&lt;/SPAN&gt; with table-level retention settings to ensure consistent and deterministic behavior.&lt;/P&gt;&lt;P class=""&gt;From that date onward:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;Time travel queries&lt;/STRONG&gt;&lt;SPAN class=""&gt; (SELECT … AS OF, RESTORE, CLONE, CDC) will only be allowed &lt;/SPAN&gt;&lt;STRONG&gt;within the period defined by delta.deletedFileRetentionDuration&lt;/STRONG&gt;&lt;SPAN class=""&gt;.&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;The &lt;SPAN class=""&gt;&lt;STRONG&gt;retention argument in VACUUM will be ignored&lt;/STRONG&gt;&lt;/SPAN&gt; (except when set to &lt;SPAN class=""&gt;0 hours&lt;/SPAN&gt;, to fully purge history).&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;STRONG&gt;delta.logRetentionDuration must be greater than or equal to delta.deletedFileRetentionDuration.&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;So that configuration will &lt;SPAN class=""&gt;&lt;STRONG&gt;not be allowed&lt;/STRONG&gt;&lt;/SPAN&gt; after the upcoming Databricks update.&lt;/P&gt;&lt;P class=""&gt;delta.logRetentionDuration&lt;SPAN class=""&gt; must always be &lt;/SPAN&gt;&lt;SPAN class=""&gt;&lt;STRONG&gt;equal to or greater than&lt;/STRONG&gt;&lt;/SPAN&gt; delta.deletedFileRetentionDuration&lt;SPAN class=""&gt;.&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;As per the official Databricks documentation:&lt;/P&gt;&lt;P class=""&gt;&lt;A href="https://docs.databricks.com/aws/en/delta/history" target="_blank" rel="noopener"&gt;Delta table history — Databricks Docs&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;&lt;EM&gt;"Databricks does not recommend using Delta Lake table history as a long-term backup solution for data archival"&lt;/EM&gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&lt;EM&gt;"Databricks recommends using only the past &lt;SPAN class=""&gt;7 days for time travel operations&lt;/SPAN&gt;, unless you have set both data and log retention configurations to a larger value.”&lt;BR /&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;If your goal is &lt;/SPAN&gt;&lt;STRONG&gt;regulatory data retention (e.g., 7 years)&lt;/STRONG&gt;&lt;SPAN class=""&gt;, you should:&lt;/SPAN&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;Keep your &lt;SPAN class=""&gt;&lt;STRONG&gt;operational Delta table&lt;/STRONG&gt;&lt;/SPAN&gt; with a short retention (e.g., 7–30 days).&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;Periodically &lt;SPAN class=""&gt;&lt;STRONG&gt;snapshot or archive&lt;/STRONG&gt;&lt;/SPAN&gt; the table (e.g., by writing it to versioned Parquet files, or exporting to object storage such as S3 Glacier).&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;Use those archives for compliance or recovery purposes — not as active Delta history.&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Hope this helps :),&lt;BR /&gt;&lt;BR /&gt;Isi&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 02 Nov 2025 16:17:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/dela-table-and-history/m-p/137301#M2308</guid>
      <dc:creator>Isi</dc:creator>
      <dc:date>2025-11-02T16:17:57Z</dc:date>
    </item>
  </channel>
</rss>

