cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Discover and redact pii

Algocrat
New Contributor II

Hi! What is the best way to discover and redact pii. Does Databricks offer any frameworks, or set of methods, or processes that we may follow?

 

 

1 ACCEPTED SOLUTION

Accepted Solutions

szymon_dybczak
Esteemed Contributor III

Hi @Algocrat ,

I think there is no out of the box solution in datbricks yet for this problem. 

You can try to use some open source tools like Presidio  ( look at below article).

Or if you are on azure, you can try to use Azure Purview

https://medium.com/databricks-platform-sme/identifying-and-tagging-pii-data-with-unity-catalog-87052...

 

View solution in original post

2 REPLIES 2

szymon_dybczak
Esteemed Contributor III

Hi @Algocrat ,

I think there is no out of the box solution in datbricks yet for this problem. 

You can try to use some open source tools like Presidio  ( look at below article).

Or if you are on azure, you can try to use Azure Purview

https://medium.com/databricks-platform-sme/identifying-and-tagging-pii-data-with-unity-catalog-87052...

 

viswesh
New Contributor II

Hey @Algocrat  @szymon_dybczak , just wanted to let you know that Databricks is currently working on a product to tackle PII / sensitive data classification. If you're a current customer, we recommend you reach out to your account representative to learn more about how to try it out!

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local communityโ€”sign up today to get started!

Sign Up Now