cancel
Showing results for 
Search instead for 
Did you mean: 
MVP Articles
This page brings together externally published articles written by our MVPs. Discover expert perspectives, real-world guidance, and community contributions from leaders across the ecosystem.
cancel
Showing results for 
Search instead for 
Did you mean: 

Let's talk about Data Governance

scottdavis
New Contributor

As we move into 2026, the mandate for modern data and AI is clear. The industry has pivoted from evaluating isolated features to building foundational futures. Since the 2025 Data + AI Summit, I have consulted with numerous organizations across a range of topics from architecture to AI. Despite their diverse needs, one priority was constant: a focus on Data Governance (DG). Every single one of these companies, regardless of industry or maturity, identified governance as a primary focus.

Throughout these discussions, a recurring theme emerged. While everyone acknowledges that DG is critical, few fully grasp the scope of modern capabilities. This is understandable since the landscape has evolved at a blistering pace since 2022. To succeed in 2026, we must first redefine what Data Governance actually is.

The Four Pillars of Modern Governance

While I would love to jump into “what” companies should do, we first need to define what modern data governance is capable of. Let’s start at a higher level and look at the broad categories of DG

Screenshot 2026-01-06 091426.png

Most organizations struggle to place themselves within this orbit. Typically, they possess a foundation in security that is NOT based on governance while also wondering where their data catalog fits into this model.

To gain clarity, we must look one layer deeper. Each of these four categories contains two critical subcategories:

Gemini_Generated_Image_7out5m7out5m7outa.png

(I hope you enjoy this over the top "Star Wars" inspired governance image)

Areas like Access Control, Catalog (part of Discovery) and even Cost Controls are often well established. If not, this is a great place to start your governance journey while evaluating how to incorporate the new governance features.

Over the past several years, data governance products have been offering more features around Lineage and Data Quality but often infer from observations and leave room for some uncertainty in production. Most companies will opt to combine multiple teams, products, and technologies to try to achieve parts of these eight data governance components.

The Governance Option: Observational vs. Integrated

Governance is generally approached from two directions: Observational or Foundational.

Screenshot 2026-01-06 100717.png

The distinction is vital. Observational governance sits above the stack, watching and inferring to produce metadata. Integrated governance (Foundational Control) fundamentally owns the data. It doesn't just watch, it logs, monitors, learns from, and controls the data flow in real-time.

While I could deep dive into how these approaches impact FinOps or Data Quality, the most critical advancement for 2026 is Business Semantics. This is where governance becomes "Business Aware."

The Evolution of Business Semantics

Business semantics ensure consistent, trusted definitions across an organization. Historically, this was a manual process, tribal knowledge passed from Subject Matter Experts (SMEs) to analysts.

The Legacy Workflow:

Imagine a new analyst is tasked with calculating the "Average revenue by region for active customers." They are immediately met with ambiguity:

  • How is "revenue" defined (gross vs. net)?
  • What qualifies as an "active" customer?
  • Which specific table is the "Source of Truth"?
  • What is the correct way to join my data?
  • How do I ensure the results are accurate?

The Modern (governed) Workflow:

By utilizing Databricks Unity Catalog, we move toward a business aware intelligence model. It is currently the only solution that integrates business logic directly into the query layer. Governance now drives AI and analytics through:

  • AI-Powered Documentation: Automated, context-aware metadata.
  • UC Semantic Model Agents: Low-code, UI-based modeling that bridges the gap between raw data and business logic.
  • Flexible Metric Views: Centralized definitions for window functions and LOD calculations.
  • Universal Connectivity: Querying metric views via direct query mode from any BI tool.

Governance as the Engine for AI

You cannot scale AI without a governed foundation. Unity Catalog provides the "trust layer" required to move AI from a laboratory experiment to a production grade asset.

By integrating governance directly into the data path, organizations achieve:

  1. Accelerated Discovery: Finding the right features for model training in seconds.
  2. Verifiable Trust: Ensuring the data feeding your AI products are accurate.
  3. Secure Collaboration: Sharing governed AI functions and agents across the enterprise without compromising compliance.

Governance has become the accelerator for data

For 2026, the question is no longer if you need governance, but how quickly you can implement a business aware governance foundation. Unity Catalog provides the tracks for your AI engine to run at full speed.

As organizations move past the AI hype and into full-scale production, they have collided with a single, universal bottleneck: Data Governance.

Databricks has solved this bottleneck by decoupling governance from proprietary cloud environments. Unity Catalog has emerged as the industry leader because it is:

  • Cloud Agnostic: Deployable across any cloud provider for a unified experience.
  • Frictionless: Designed for rapid integration into existing workflows.
  • Open Source: A license-free, community-driven foundation that evolves at the speed of the modern data stack.
  • Business Aware: Learns from and controls the data flow in real-time

I love all things data and for 2026 I have my eye on Data Governance and all it helps to unlock. Truly exciting times for the data world.

 

0 REPLIES 0