by
Phani1
• Valued Contributor II
- 9173 Views
- 5 replies
- 0 kudos
Hi Databricks Team, would like to implement data quality rules in Databricks, apart from DLT do we have any standard approach to perform/ apply data quality rules on bronze layer before further proceeding to silver and gold layer.
- 9173 Views
- 5 replies
- 0 kudos
Latest Reply
Looks nice! However I don't see Databricks support in the docs
4 More Replies
by
Kash
• Contributor III
- 2061 Views
- 1 replies
- 0 kudos
Hi there,We would like to create a data quality database that helps us understand how complete our data is. We would like to run a job each day that basically outputs the same table data as dbutils.data.summarize(df) for a given table and save it to ...
- 2061 Views
- 1 replies
- 0 kudos
Latest Reply
From what I know there's no easy way to save dbutils.data.summarize() into a df.You can still create your custom python/pyspark code to profile your data and save the output.
- 7979 Views
- 4 replies
- 2 kudos
Hi everyone,I want to do some tests regarding data quality and for that I pretend to use PyDeequ on a databricks notebook. Keep in mind that I'm very new to databricks and Spark.First I created a cluster with the Runtime version "10.4 LTS (includes A...
- 7979 Views
- 4 replies
- 2 kudos
Latest Reply
I assumed I wouldn't need to add the Deequ library. Apparently, all I had to do was add it via Maven coordinates and it solved the problem.
3 More Replies