Hello Databricks Community,
I am experiencing an issue with Delta Lake where the _delta_log files are not being deleted automatically in GCS bucket, even though I have set the table properties to enable this behavior. Here is the configuration I used:
ALTER TABLE delta.`gs://sample-data`
SET TBLPROPERTIES (
'retentionDurationCheck.enabled'='false',
'delta.logRetentionDuration' = 'interval 1 days',
'delta.deletedFileRetentionDuration' = 'interval 1 days',
'delta.autoOptimize.optimizeWrite' = 'false',
'delta.autoOptimize.autoCompact' = 'true',
'delta.targetFileSize' = '1073741824'
);
Despite these settings, the log files remain in the directory beyond the specified retention period. I understand that log files should be deleted automatically after checkpoint operations, and I have ensured that checkpoints are being created.
Could there be any specific reasons or additional configurations required for these settings to take effect? Is there a known issue with certain environments or configurations that might prevent the automatic deletion of Delta log files?
I appreciate any insights or suggestions from those who have encountered and resolved similar issues.