โ04-14-2023 01:17 AM
Hi Databricks Team, would like to implement data quality rules in Databricks, apart from DLT do we have any standard approach to perform/ apply data quality rules on bronze layer before further proceeding to silver and gold layer.
โ04-14-2023 10:37 AM
Check out dbdemos.ai, you may be interested in the example of applying tests to your DLT pipeline to ensure data quality.
โ04-18-2023 04:58 AM
โ07-24-2024 12:00 PM
You could also apply data quality checks using open source libraries such as Great Expectations or pydq on the ETL level.
Another approach is to use no code platforms like Rudol to allow non-technical roles such as Data Stewards to implement data quality validations by themselves.
โ07-24-2024 01:19 PM
Hi there,
you should check this python library for data quality checks:
https://canimus.github.io/cuallee/
It is very fast and feature rich when it comes to the checks.
โ07-25-2024 08:32 AM
Looks nice! However I don't see Databricks support in the docs ๐
Passionate about hosting events and connecting people? Help us grow a vibrant local communityโsign up today to get started!
Sign Up Now