There are two configurations that govern your retention period:
- delta.deletedFileRetentionDuration - This configuration specifies how long Delta's transaction logs are kept in the history. The default retention period is 30 days, after which older log entries may be deleted.
- delta.logRetentionDuration - This setting determines the retention period for stale data files that are no longer referenced in the transaction log. Stale files remain available for a default retention period of 7 days before they are eligible for deletion via the VACUUM command.
Now, based on the above provided context, I will answer your questions:
Q) Why I can still go back to April 22nd in version which is more than 30 days?
You can't access the data beyond 7 days because delta.logRetentionDuration by default is 7 days. So, if you run the VACUUM operation after 7 days, those data files will have been deleted.
Q) Why version numbers starts from 100, what happened to previous versions?
You can only see versions up to 30 days old because the default value of delta.deletedFileRetentionDuration is 30 days.