<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: DLT optimize and vacuum in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36514#M26155</link>
    <description>&lt;P&gt;I am verifying that optimize and vacuum is running by looking at table history. &amp;nbsp; I am checking which older versions I am able to query and have found I can still query versions older than 7 days. &amp;nbsp; If vacuum is working I should not see versions older than 7 days. &amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 30 Jun 2023 23:09:14 GMT</pubDate>
    <dc:creator>Gil</dc:creator>
    <dc:date>2023-06-30T23:09:14Z</dc:date>
    <item>
      <title>DLT optimize and vacuum</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36307#M26092</link>
      <description>&lt;P&gt;We were finally able to get DLT pipelines to run the optimize and vacuum automatically. &amp;nbsp;We verified this via the the table history. &amp;nbsp; However I am able to still query versions older than 7 days. &amp;nbsp; Has anyone been experiencing this and how were you able to fix it. &amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 29 Jun 2023 19:58:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36307#M26092</guid>
      <dc:creator>Gil</dc:creator>
      <dc:date>2023-06-29T19:58:52Z</dc:date>
    </item>
    <item>
      <title>Re: DLT optimize and vacuum</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36432#M26118</link>
      <description>&lt;P&gt;Can you please tell me how you verified the vacuum and optimize it's performing automatically. Because I couldn't figure out so I'm running optimize and vacuum command manually every night. Any help would be appreciated.&lt;/P&gt;</description>
      <pubDate>Thu, 29 Jun 2023 22:44:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36432#M26118</guid>
      <dc:creator>NathanSundarara</dc:creator>
      <dc:date>2023-06-29T22:44:53Z</dc:date>
    </item>
    <item>
      <title>Re: DLT optimize and vacuum</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36482#M26140</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/38758"&gt;@Gil&lt;/a&gt;how much retention period u r setting to your vacuum command please, looks by default it is 7, but still it is recommended to add retention time&lt;/P&gt;</description>
      <pubDate>Fri, 30 Jun 2023 16:12:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36482#M26140</guid>
      <dc:creator>karthik_p</dc:creator>
      <dc:date>2023-06-30T16:12:23Z</dc:date>
    </item>
    <item>
      <title>Re: DLT optimize and vacuum</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36483#M26141</link>
      <description>&lt;P&gt;Even with our case I didn't see the default 7 days didn't work based on what I saw that's why I'm running the command manually. IF&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/38758"&gt;@Gil&lt;/a&gt;&amp;nbsp;can explain or someone can explain how to validate I can stop my job and see if it's actually working (the automatic Vacuum process)&lt;/P&gt;</description>
      <pubDate>Fri, 30 Jun 2023 16:18:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36483#M26141</guid>
      <dc:creator>NathanSundarara</dc:creator>
      <dc:date>2023-06-30T16:18:29Z</dc:date>
    </item>
    <item>
      <title>Re: DLT optimize and vacuum</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36486#M26143</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/60745"&gt;@NathanSundarara&lt;/a&gt;&amp;nbsp;it looks vacuum and optimize are part of maintenance&amp;nbsp; tasks, these tasks will get triggered only&amp;nbsp;&lt;SPAN&gt;within 24 hours of a table being updated&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="karthik_p_0-1688143460140.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2719iA054B797906B17BB/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="karthik_p_0-1688143460140.png" alt="karthik_p_0-1688143460140.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 30 Jun 2023 16:46:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36486#M26143</guid>
      <dc:creator>karthik_p</dc:creator>
      <dc:date>2023-06-30T16:46:17Z</dc:date>
    </item>
    <item>
      <title>Re: DLT optimize and vacuum</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36488#M26145</link>
      <description>&lt;P&gt;That's what I thought as well but I checked the number of files didn't reduce now after adding the job it did show less files and compressed. That's why I asked&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/38758"&gt;@Gil&lt;/a&gt;&amp;nbsp;for verification. Here is how I did one of the table we get like 24 files every hour. One day I noticed it was like total 300 files then I was under assumption if we add 24 files next day after compression it should go down it kept increasing. Now after I created the job it's now showing like 4 or 5 files when I look in the morning and as day progress I see the files it gets added and next day again it will come down to 4 or 5 files.&lt;/P&gt;</description>
      <pubDate>Fri, 30 Jun 2023 16:51:39 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36488#M26145</guid>
      <dc:creator>NathanSundarara</dc:creator>
      <dc:date>2023-06-30T16:51:39Z</dc:date>
    </item>
    <item>
      <title>Re: DLT optimize and vacuum</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36513#M26154</link>
      <description>&lt;P&gt;We left the default so I believe it’s 7 days. &amp;nbsp;Thanks&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 30 Jun 2023 23:01:35 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36513#M26154</guid>
      <dc:creator>Gil</dc:creator>
      <dc:date>2023-06-30T23:01:35Z</dc:date>
    </item>
    <item>
      <title>Re: DLT optimize and vacuum</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36514#M26155</link>
      <description>&lt;P&gt;I am verifying that optimize and vacuum is running by looking at table history. &amp;nbsp; I am checking which older versions I am able to query and have found I can still query versions older than 7 days. &amp;nbsp; If vacuum is working I should not see versions older than 7 days. &amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 30 Jun 2023 23:09:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36514#M26155</guid>
      <dc:creator>Gil</dc:creator>
      <dc:date>2023-06-30T23:09:14Z</dc:date>
    </item>
    <item>
      <title>Re: DLT optimize and vacuum</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36515#M26156</link>
      <description>&lt;P&gt;If I recall I can query versions older than 30days.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 30 Jun 2023 23:13:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36515#M26156</guid>
      <dc:creator>Gil</dc:creator>
      <dc:date>2023-06-30T23:13:41Z</dc:date>
    </item>
    <item>
      <title>Re: DLT optimize and vacuum</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36665#M26179</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/38758"&gt;@Gil&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Please help us select the best solution by clicking on "Select As Best" if it does.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Your feedback will help us ensure that we are providing the best possible service to you. Thank you!&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 02 Jul 2023 04:02:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/36665#M26179</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-07-02T04:02:43Z</dc:date>
    </item>
    <item>
      <title>Re: DLT optimize and vacuum</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/37412#M26348</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/38758"&gt;@Gil&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;We'd love to hear from you.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Thanks!&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jul 2023 04:35:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-optimize-and-vacuum/m-p/37412#M26348</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-07-12T04:35:45Z</dc:date>
    </item>
  </channel>
</rss>

