<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: DeltaFileNotFoundException: [DELTA_TRUNCATED_TRANSACTION_LOG] Error in Streaming Table with Mini in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/138531#M50948</link>
    <description>&lt;P&gt;I would assume it is trying to read v899 because in you read up until v898 in the last [streaming]batch and stored the state in the streaming checkpoint. Now, if you run the code again and continue the stream, it tries to pick up from the first version it has not read yet - which would be v899 and tries to construct the state of that version. If the necessary files (=Checkpoint of v890 and transactions of v890-v899) cannot be found anymore, it will throw an error like this&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;you should make sure that you run the stream at lest once within your retention period (better at least daily in your case) otherwise you will inevitably run into that error&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 11 Nov 2025 07:59:03 GMT</pubDate>
    <dc:creator>gbrueckl</dc:creator>
    <dc:date>2025-11-11T07:59:03Z</dc:date>
    <item>
      <title>DeltaFileNotFoundException: [DELTA_TRUNCATED_TRANSACTION_LOG] Error in Streaming Table with Minimal</title>
      <link>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/108876#M43166</link>
      <description>&lt;P&gt;Dear Databricks Experts,&lt;/P&gt;&lt;P&gt;I am encountering a recurring issue while working with Delta streaming tables in my system. The error message is as follows:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="minhhung0507_0-1738728278906.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/14610iDB105B6BB1B1B89D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="minhhung0507_0-1738728278906.png" alt="minhhung0507_0-1738728278906.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;com.databricks.sql.transaction.tahoe.DeltaFileNotFoundException: [DELTA_TRUNCATED_TRANSACTION_LOG] gs://cimb-prod-lakehouse/bronze-layer/icoredb/dpb_revi_loan/&lt;SPAN class=""&gt;_delta&lt;/SPAN&gt;_log/&lt;SPAN class=""&gt;00000000000000000000.&lt;/SPAN&gt;json: Unable &lt;SPAN class=""&gt;to&lt;/SPAN&gt; reconstruct state &lt;SPAN class=""&gt;at&lt;/SPAN&gt; &lt;SPAN class=""&gt;version&lt;/SPAN&gt; &lt;SPAN class=""&gt;899&lt;/SPAN&gt; &lt;SPAN class=""&gt;as&lt;/SPAN&gt; &lt;SPAN class=""&gt;the&lt;/SPAN&gt; transaction &lt;SPAN class=""&gt;log&lt;/SPAN&gt; has been truncated due &lt;SPAN class=""&gt;to&lt;/SPAN&gt; manual deletion &lt;SPAN class=""&gt;or&lt;/SPAN&gt; &lt;SPAN class=""&gt;the&lt;/SPAN&gt; &lt;SPAN class=""&gt;log&lt;/SPAN&gt; retention policy (delta.logRetentionDuration=&lt;SPAN class=""&gt;3&lt;/SPAN&gt; days) &lt;SPAN class=""&gt;and&lt;/SPAN&gt; checkpoint retention policy (delta.checkpointRetentionDuration=&lt;SPAN class=""&gt;2&lt;/SPAN&gt; days)&lt;/PRE&gt;&lt;H3&gt;Context:&lt;/H3&gt;&lt;UL&gt;&lt;LI&gt;I am designing a system that uses Delta format for streaming tables.&lt;/LI&gt;&lt;LI&gt;The affected tables have very few transactions or updates, which seems to make them prone to this error.&lt;/LI&gt;&lt;LI&gt;Upon inspecting the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;_delta_log&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;directory, I noticed that only checkpoint versions 900 and 979 exist. However, the error indicates that it is trying to read from version 899&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="minhhung0507_1-1738728343460.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/14611i7992A4BCBB890BA0/image-size/medium?v=v2&amp;amp;px=400" role="button" title="minhhung0507_1-1738728343460.png" alt="minhhung0507_1-1738728343460.png" /&gt;&lt;/span&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;H3&gt;Questions:&lt;/H3&gt;&lt;OL&gt;&lt;LI&gt;Why is Databricks attempting to access version 899 when the checkpoint files available start from version 900? Could this be a bug or misconfiguration in Delta Lake's automatic cleanup process?&lt;/LI&gt;&lt;LI&gt;Is it possible that Delta Lake's log and checkpoint retention policies are prematurely removing active checkpoints for tables with minimal updates? If so, how can I adjust these settings to prevent this issue?&lt;/LI&gt;&lt;LI&gt;What are the recommended best practices for managing retention policies (delta.logRetentionDuration&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;and&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;delta.checkpointRetentionDuration) for Delta tables with infrequent updates?&lt;/LI&gt;&lt;/OL&gt;&lt;H3&gt;Additional Information:&lt;/H3&gt;&lt;UL&gt;&lt;LI&gt;Retention settings:&lt;UL&gt;&lt;LI&gt;delta.logRetentionDuration = "3 days"&lt;/LI&gt;&lt;LI&gt;delta.checkpointRetentionDuration = "2 days"&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;I would greatly appreciate any insights or suggestions on how to resolve this issue and prevent it from occurring in the future.&lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Wed, 05 Feb 2025 04:06:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/108876#M43166</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-02-05T04:06:33Z</dc:date>
    </item>
    <item>
      <title>Re: DeltaFileNotFoundException: [DELTA_TRUNCATED_TRANSACTION_LOG] Error in Streaming Table with Mini</title>
      <link>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/109072#M43219</link>
      <description>&lt;P&gt;Hi, does anyone have any suggestions for this topic?&lt;/P&gt;</description>
      <pubDate>Thu, 06 Feb 2025 04:50:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/109072#M43219</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-02-06T04:50:04Z</dc:date>
    </item>
    <item>
      <title>Re: DeltaFileNotFoundException: [DELTA_TRUNCATED_TRANSACTION_LOG] Error in Streaming Table with Mini</title>
      <link>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/109221#M43247</link>
      <description>&lt;P&gt;Without knowing the read patterns it's hard to say what the checkpointing issue is. But I'd recommend leaving the default retention periods for log and checkpoint locations if your table's not updated that often. I'd rarely recommend lower than 7 days unless you had some very large fast pipeline.&lt;/P&gt;
&lt;P&gt;I've also never seen someone set checkpoint retention differently from log retention. Not saying it's wrong, just never seen it before.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I'd also recommend looking into &lt;A href="https://docs.databricks.com/en/optimizations/predictive-optimization.html" target="_self"&gt;predictive optimisation&lt;/A&gt; - it's a great way to manage stale files without having to think about it much.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 06 Feb 2025 14:45:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/109221#M43247</guid>
      <dc:creator>holly</dc:creator>
      <dc:date>2025-02-06T14:45:48Z</dc:date>
    </item>
    <item>
      <title>Re: DeltaFileNotFoundException: [DELTA_TRUNCATED_TRANSACTION_LOG] Error in Streaming Table with Mini</title>
      <link>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/109342#M43281</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/36301"&gt;@holly&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for the suggestions and solutions you gave, I will try to apply them again and check the results.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Feb 2025 03:42:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/109342#M43281</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-02-07T03:42:53Z</dc:date>
    </item>
    <item>
      <title>Re: DeltaFileNotFoundException: [DELTA_TRUNCATED_TRANSACTION_LOG] Error in Streaming Table with Mini</title>
      <link>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/109347#M43282</link>
      <description>&lt;P&gt;And the reason I had to set log retention and checkpoint retention to less than 7 days is that if I leave the default values, my pipeline will get a 'Listing file' error which we don't know how to fix yet. So the temporary solution is to reduce the default values ​​to less than 7 days.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Feb 2025 03:56:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/109347#M43282</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-02-07T03:56:54Z</dc:date>
    </item>
    <item>
      <title>Re: DeltaFileNotFoundException: [DELTA_TRUNCATED_TRANSACTION_LOG] Error in Streaming Table with Mini</title>
      <link>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/138531#M50948</link>
      <description>&lt;P&gt;I would assume it is trying to read v899 because in you read up until v898 in the last [streaming]batch and stored the state in the streaming checkpoint. Now, if you run the code again and continue the stream, it tries to pick up from the first version it has not read yet - which would be v899 and tries to construct the state of that version. If the necessary files (=Checkpoint of v890 and transactions of v890-v899) cannot be found anymore, it will throw an error like this&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;you should make sure that you run the stream at lest once within your retention period (better at least daily in your case) otherwise you will inevitably run into that error&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Nov 2025 07:59:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/deltafilenotfoundexception-delta-truncated-transaction-log-error/m-p/138531#M50948</guid>
      <dc:creator>gbrueckl</dc:creator>
      <dc:date>2025-11-11T07:59:03Z</dc:date>
    </item>
  </channel>
</rss>

