04-14-2023 01:17 AM
Hi Databricks Team, would like to implement data quality rules in Databricks, apart from DLT do we have any standard approach to perform/ apply data quality rules on bronze layer before further proceeding to silver and gold layer.
04-14-2023 10:37 AM
Check out dbdemos.ai, you may be interested in the example of applying tests to your DLT pipeline to ensure data quality.
04-18-2023 04:58 AM
07-24-2024 12:00 PM
You could also apply data quality checks using open source libraries such as Great Expectations or pydq on the ETL level.
Another approach is to use no code platforms like Rudol to allow non-technical roles such as Data Stewards to implement data quality validations by themselves.
07-24-2024 01:19 PM
Hi there,
you should check this python library for data quality checks:
https://canimus.github.io/cuallee/
It is very fast and feature rich when it comes to the checks.
07-25-2024 08:32 AM
Looks nice! However I don't see Databricks support in the docs 😕
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now