cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Are Delta tables able to support GDPR compliance?

User16826992666
Valued Contributor

I know that when deletes are made from a Delta table the underlying files are not actually removed. For compliance reasons I need to able to truly delete the records. How can I know which files need to be removed, and is there a way to remove them other than manually?

1 ACCEPTED SOLUTION

Accepted Solutions

sajith_appukutt
Honored Contributor II

Here is a document explaining best practices for GDPR and CCPA compliance using Delta Lake.

Specifically on cleaning up stale data - you can use the VACUUM function to remove files that are no longer referenced by a Delta table and are older than a specified retention threshold - preventing the ability to view history and roll back.

For e.g.

VACUUM gdpr.customers RETAIN 100 HOURS

View solution in original post

1 REPLY 1

sajith_appukutt
Honored Contributor II

Here is a document explaining best practices for GDPR and CCPA compliance using Delta Lake.

Specifically on cleaning up stale data - you can use the VACUUM function to remove files that are no longer referenced by a Delta table and are older than a specified retention threshold - preventing the ability to view history and roll back.

For e.g.

VACUUM gdpr.customers RETAIN 100 HOURS

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group