cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

How to remove checkpoints from DeltaLake table ?

Hunter1604
New Contributor II

How to remove checkpoints from DeltaLake table ?

I see that on my delta table exist a few checkpoints I want to remove the oldest one. 

It seems that existing of it is blocking removing the oldest _delta_logs entries

5 REPLIES 5

Ajay-Pandey
Esteemed Contributor III

Hi @Pawel Wojโ€‹ ,

If you want to keep your checkpoints X days, you can set delta.checkpointRetentionDuration to X days this way:

spark.sql(f"""
        ALTER TABLE delta.`path`
            SET TBLPROPERTIES (
                delta.checkpointRetentionDuration = 'X days'
            )
        """
)

Hi,

This looks well, thanks - will be testing it.

Ajay-Pandey
Esteemed Contributor III

Test and let me know if it's working for you.

Anonymous
Not applicable

Hi @Pawel Wojโ€‹ 

Hope everything is going great.

Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so we can help you. 

Cheers!

Hi

Proposed solution looks good, I will be testing it,

but can you tell me why I can't found this param in offical DB documentation ?

I would like to have more infromation about before apply this configuration on prod evn.

BR

Join 100K+ Data Experts: Register Now & Grow with Us!

Excited to expand your horizons with us? Click here to Register and begin your journey to success!

Already a member? Login and join your local regional user group! If there isn’t one near you, fill out this form and we’ll create one for you to join!