cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Data governance solution

athang
New Contributor

I am here looking for Data governance solution for organization. I also searched this on many different website and found many solutions. We are bit confused to which one to choose. One of my friend suggest me this platform, and i am hoping i will get the solution from here. So can anyone please suggest the good solution platform?

1 ACCEPTED SOLUTION

Accepted Solutions

Commitchell
Databricks Employee
Databricks Employee

Hi @athang,

Databricks has excellent Data Governance capabilities in Unity Catalog. Our governance is best utilized natively as a part of the rest of the Databricks platform as opposed to a stand alone governance solution. Simply put, the biggest value of Databricks Unity Catalog is that it governs across different workflows and users. It covers ingestion, data engineering, data science, analytics & BI, and even generative AI, all with the same configuration applying across these very different workflows.

Could you describe the rest of your tech stack? The other applications you're using as a part of your data architecture could point to whether or not Databricks could be a strong fit for you.

View solution in original post

3 REPLIES 3

Commitchell
Databricks Employee
Databricks Employee

Hi @athang,

Databricks has excellent Data Governance capabilities in Unity Catalog. Our governance is best utilized natively as a part of the rest of the Databricks platform as opposed to a stand alone governance solution. Simply put, the biggest value of Databricks Unity Catalog is that it governs across different workflows and users. It covers ingestion, data engineering, data science, analytics & BI, and even generative AI, all with the same configuration applying across these very different workflows.

Could you describe the rest of your tech stack? The other applications you're using as a part of your data architecture could point to whether or not Databricks could be a strong fit for you.

Kirankumarbs
Contributor

Hey @athang 

If your data ecosystem is already centered around Spark, Delta Lake, etc., then Databricks is a great fit โ€” especially with Unity Catalog, which is really the heart of the platform.

Governance becomes an integral part of the system โ€” you donโ€™t have to put in extra effort to manage it separately.

For example, in our setup:

  • We ingest high-frequency data into Kafka

  • From there:

    • Ingestion into the RAW layer

    • Transformation into structured tables

    • Workflows / Jobs orchestration

  • Everything is organized under catalog.schema (Tables | Models | Volumes)

  • User access management

  • Job compute clusters and all-purpose interactive clusters for development

  • Dashboards for observability and data quality metrics

  • ML lifecycle management and model serving

You name it โ€” itโ€™s covered, and everything is governed through Unity Catalog.

We manage infrastructure as code using Terraform
and deployments through Databricks Asset Bundles.

This solution โ€” including governance โ€” also works perfectly with externally managed locations.

balajij8
Contributor

You can use Unity Catalog for Databricks Lakehouse. You can use Collibra/Open Metadata along with Unity Catalog for complete governance