โ04-14-2023 01:17 AM
Hi Databricks Team, would like to implement data quality rules in Databricks, apart from DLT do we have any standard approach to perform/ apply data quality rules on bronze layer before further proceeding to silver and gold layer.
โ04-14-2023 10:37 AM
Check out dbdemos.ai, you may be interested in the example of applying tests to your DLT pipeline to ensure data quality.
โ04-18-2023 04:58 AM
โ09-05-2023 02:29 AM
Hi @Phani1 ,
โข Databricks recommends applying data quality rules on the bronze layer before proceeding to the silver and gold layer.
โข The recommended approach involves storing data quality rules in a Delta table.
โข The rules are categorized by a tag and are used in dataset definitions to determine which restrictions to apply.
โข A table named โrulesโ is created to maintain the data quality rules.
โข The rules are defined using SQL constraint clauses.
โข A function called โget_rules()โ is created to read the rules from the โrulesโ table and return a Python dictionary containing rules matching the provided tag.
โข The dictionary of rules is then applied using the '@dlt.expect_all_*()' decorators to enforce data quality constraints.
โข The โget_farmers_market_data()โ function is decorated with the '@dlt.expect_all_or_drop()โ decorator, which applies the data quality constraints defined in the โget_rules()โ function to the โraw_farmers_marketโ table.
โ07-24-2024 12:00 PM
You could also apply data quality checks using open source libraries such as Great Expectations or pydq on the ETL level.
Another approach is to use no code platforms like Rudol to allow non-technical roles such as Data Stewards to implement data quality validations by themselves.
โ07-24-2024 01:19 PM
Hi there,
you should check this python library for data quality checks:
https://canimus.github.io/cuallee/
It is very fast and feature rich when it comes to the checks.
โ07-25-2024 08:32 AM
Looks nice! However I don't see Databricks support in the docs ๐
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโt want to miss the chance to attend and share knowledge.
If there isnโt a group near you, start one and help create a community that brings people together.
Request a New Group