<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: File not found error. Does OPTIMIZE deletes initial versions of the delta table? in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/file-not-found-error-does-optimize-deletes-initial-versions-of/m-p/9353#M440</link>
    <description>&lt;P&gt;Had you run vacuum on the table? Vacuum can clean up data files marked for removal and are older than retention period.&lt;/P&gt;&lt;P&gt;Optimize compacts files and marks the small files for removal, but does not physically remove the data files&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 23 Mar 2023 00:08:03 GMT</pubDate>
    <dc:creator>swethaNandan</dc:creator>
    <dc:date>2023-03-23T00:08:03Z</dc:date>
    <item>
      <title>File not found error. Does OPTIMIZE deletes initial versions of the delta table?</title>
      <link>https://community.databricks.com/t5/machine-learning/file-not-found-error-does-optimize-deletes-initial-versions-of/m-p/9352#M439</link>
      <description>&lt;P&gt;df = (spark.readStream.format("delta")\&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;.option("readChangeFeed", "true")\&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;.option("startingVersion", 1)\&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;.table("CatalogName.SchemaName.TableName")&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;)&lt;/P&gt;&lt;P&gt;display(df)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;A file referenced in the transaction log cannot be found. This occurs when data has been manually deleted from the file system rather than using the table `DELETE` statement.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;When I see the path of the file, file is not present there. It was neither deleted manually nor Optimized. &lt;/P&gt;&lt;P&gt;Does delta table have any default setting to optimize the table? When I check the history of the table, few records of OPTIMIZE is there and after that OPTIMIZE I was not able to see previous versions of the table though the complete data is available but I cannot see the initial version data of the table.&lt;/P&gt;</description>
      <pubDate>Wed, 15 Feb 2023 05:59:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/file-not-found-error-does-optimize-deletes-initial-versions-of/m-p/9352#M439</guid>
      <dc:creator>Data_Analytics1</dc:creator>
      <dc:date>2023-02-15T05:59:54Z</dc:date>
    </item>
    <item>
      <title>Re: File not found error. Does OPTIMIZE deletes initial versions of the delta table?</title>
      <link>https://community.databricks.com/t5/machine-learning/file-not-found-error-does-optimize-deletes-initial-versions-of/m-p/9353#M440</link>
      <description>&lt;P&gt;Had you run vacuum on the table? Vacuum can clean up data files marked for removal and are older than retention period.&lt;/P&gt;&lt;P&gt;Optimize compacts files and marks the small files for removal, but does not physically remove the data files&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 23 Mar 2023 00:08:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/file-not-found-error-does-optimize-deletes-initial-versions-of/m-p/9353#M440</guid>
      <dc:creator>swethaNandan</dc:creator>
      <dc:date>2023-03-23T00:08:03Z</dc:date>
    </item>
  </channel>
</rss>

