โ06-29-2023 12:58 PM
We were finally able to get DLT pipelines to run the optimize and vacuum automatically. We verified this via the the table history. However I am able to still query versions older than 7 days. Has anyone been experiencing this and how were you able to fix it.
โ06-29-2023 03:44 PM
Can you please tell me how you verified the vacuum and optimize it's performing automatically. Because I couldn't figure out so I'm running optimize and vacuum command manually every night. Any help would be appreciated.
โ06-30-2023 09:12 AM
@Gilhow much retention period u r setting to your vacuum command please, looks by default it is 7, but still it is recommended to add retention time
โ06-30-2023 04:01 PM
We left the default so I believe itโs 7 days. Thanks
โ06-30-2023 04:13 PM
If I recall I can query versions older than 30days.
โ06-30-2023 09:18 AM
Even with our case I didn't see the default 7 days didn't work based on what I saw that's why I'm running the command manually. IF @Gil can explain or someone can explain how to validate I can stop my job and see if it's actually working (the automatic Vacuum process)
โ06-30-2023 09:46 AM
@NathanSundarara it looks vacuum and optimize are part of maintenance tasks, these tasks will get triggered only within 24 hours of a table being updated
โ06-30-2023 09:51 AM
That's what I thought as well but I checked the number of files didn't reduce now after adding the job it did show less files and compressed. That's why I asked @Gil for verification. Here is how I did one of the table we get like 24 files every hour. One day I noticed it was like total 300 files then I was under assumption if we add 24 files next day after compression it should go down it kept increasing. Now after I created the job it's now showing like 4 or 5 files when I look in the morning and as day progress I see the files it gets added and next day again it will come down to 4 or 5 files.
โ06-30-2023 04:09 PM
I am verifying that optimize and vacuum is running by looking at table history. I am checking which older versions I am able to query and have found I can still query versions older than 7 days. If vacuum is working I should not see versions older than 7 days.
โ07-01-2023 09:02 PM
Hi @Gil
Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.
Please help us select the best solution by clicking on "Select As Best" if it does.
Your feedback will help us ensure that we are providing the best possible service to you. Thank you!
โ07-11-2023 09:35 PM
Hi @Gil
Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help.
We'd love to hear from you.
Thanks!
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโt want to miss the chance to attend and share knowledge.
If there isnโt a group near you, start one and help create a community that brings people together.
Request a New Group