cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Data Quality in Databricks

Phani1
Valued Contributor II

Hi Databricks Team, would like to implement data quality rules in Databricks, apart from DLT do we have any standard approach to perform/ apply data quality rules on bronze layer before further proceeding to silver and gold layer.

5 REPLIES 5

AndrewN
Databricks Employee
Databricks Employee

Check out dbdemos.ai, you may be interested in the example of applying tests to your DLT pipeline to ensure data quality.

https://www.dbdemos.ai/demo.html?demoName=dlt-unit-test

Phani1
Valued Contributor II

import_errorThanks for sharing the details, we are using DBR 12.2.

We are facing the below error while importing the libraries/package. can you please help me how to import it

joarobles
New Contributor III

You could also apply data quality checks using open source libraries such as Great Expectations or pydq on the ETL level.

Another approach is to use no code platforms like Rudol to allow non-technical roles such as Data Stewards to implement data quality validations by themselves.

aalanis
New Contributor II

Hi there,

you should check this python library for data quality checks: 

https://canimus.github.io/cuallee/

It is very fast and feature rich when it comes to the checks.

joarobles
New Contributor III

Looks nice! However I don't see Databricks support in the docs ๐Ÿ˜•

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group