cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

How to set retention period for a delta table lower than the default period? Is it even possible?

manasa
Contributor

I am trying to set retention period for a delta by using following commands.

deltaTable = DeltaTable.forPath(spark,delta_path)

spark.conf.set("spark.databricks.delta.retentionDurationCheck.enabled", "false")

deltaTable.logRetentionDuration = "interval 1 days"

deltaTable.deletedFileRetentionDuration = "interval 1 days"

These commands are not working for me, I mean, they aren't removing any files for the given interval..where am I going wrong?

1 ACCEPTED SOLUTION

Accepted Solutions

Hubert-Dudek
Esteemed Contributor III

There are two ways:

1) Please set in cluster (Clusters -> edit -> Spark -> Spark config):

spark.databricks.delta.retentionDurationCheck.enabled false

imagen.png 

2) or just before DeltaTable.forPath set (I think you need to change order in your code):

spark.conf.set("spark.databricks.delta.retentionDurationCheck.enabled", "false")

View solution in original post

3 REPLIES 3

Hubert-Dudek
Esteemed Contributor III

There are two ways:

1) Please set in cluster (Clusters -> edit -> Spark -> Spark config):

spark.databricks.delta.retentionDurationCheck.enabled false

imagen.png 

2) or just before DeltaTable.forPath set (I think you need to change order in your code):

spark.conf.set("spark.databricks.delta.retentionDurationCheck.enabled", "false")

Hi @Kaniz Fatmaโ€‹ , Yes I was able to solve the issue! Thanks

Hi @Hubert Dudekโ€‹ , thanks for you response!