03-18-2026 05:26 AM
Hello Databricks Community,
Databricks is making great progress towards becoming a truly governed and unified catalog. When it comes to governing and cataloguing physical data assets, it already offers a strong foundation. With the introduction of metric views last year, defining and managing measures across the organisation has become much easier.
However, we are still exploring how to properly govern reports (made in Power BI) and business terms using Databricks. I am curious to learn how others are approaching this.
Does anyone have tips, best practices, or examples of how Databricks can be used as a governed report catalog? And how do you define and manage a business glossary within Unity Catalog?
Looking forward to your insights and experiences.
Maurice Dekker
03-20-2026 11:19 AM
Hello @MauriceDekker ,
Good question, and you've framed it well — UC has genuinely gotten strong for physical data assets and metrics, so reports and business terminology feel like the obvious next frontier. Here's how I see customers approaching this today, along with where the real limits are.
On governing Power BI reports in UC:
Honest answer: You can't register or permission a report directly inside UC today. The pattern that tends to work is keeping the semantic layer centralized in Databricks — curated gold tables and metric views with clear names, descriptions, and owners — and requiring Power BI reports to connect live to those UC-governed objects rather than letting each team build ad hoc datasets on the side.
Access control ends up working in two layers: UC permissions on the underlying tables/views, and Power BI workspace/report permissions on top. If someone doesn't have access to a dataset in UC, they can't see that data in any report consuming it. That's your real governance enforcement point.
For the report catalog piece, since reports aren't native UC assets, many teams build a lightweight registry as a Delta table — something like governance.report_registry with columns for report name, workspace, owner, business domain, primary datasets, PII classification, and status. A small scheduled job pulling from the Power BI REST API keeps it current. Analysts can then browse certified reports and see exactly which governed datasets they reference. Not glamorous, but it works.
On a business glossary in UC:
No built-in glossary UI exists today — no dedicated object type, no lifecycle workflows out of the box. What works is building it from native primitives. A governance schema (e.g., governance.business_glossary) with a terms table covering name, definition, business owner, domain, status, and synonyms gets you most of the way there. Add a relationships table if you need broader/narrower/related term linkage.
From there, UC column and table comments handle concise business descriptions, and tags link assets back to glossary terms — tagging a column with business_term = "Net Revenue" or data_classification = "Confidential", for example. You can then query UC system tables or information_schema to find where a term is actually implemented and which columns carry a given classification. Restrict writes to a small stewardship group, open reads broadly, and optionally surface it through a Databricks SQL dashboard so business users don't have to query tables directly.
If you're already running Purview, Collibra, or Alation, a reasonable model is keeping that as your primary glossary UI and syncing terms and classifications into UC via their connectors. UC enforces access; the external tool handles the governance workflow and UX.
Hope this helps.
Cheers, Louis
Want anything tweaked before you post?
03-20-2026 11:19 AM
Hello @MauriceDekker ,
Good question, and you've framed it well — UC has genuinely gotten strong for physical data assets and metrics, so reports and business terminology feel like the obvious next frontier. Here's how I see customers approaching this today, along with where the real limits are.
On governing Power BI reports in UC:
Honest answer: You can't register or permission a report directly inside UC today. The pattern that tends to work is keeping the semantic layer centralized in Databricks — curated gold tables and metric views with clear names, descriptions, and owners — and requiring Power BI reports to connect live to those UC-governed objects rather than letting each team build ad hoc datasets on the side.
Access control ends up working in two layers: UC permissions on the underlying tables/views, and Power BI workspace/report permissions on top. If someone doesn't have access to a dataset in UC, they can't see that data in any report consuming it. That's your real governance enforcement point.
For the report catalog piece, since reports aren't native UC assets, many teams build a lightweight registry as a Delta table — something like governance.report_registry with columns for report name, workspace, owner, business domain, primary datasets, PII classification, and status. A small scheduled job pulling from the Power BI REST API keeps it current. Analysts can then browse certified reports and see exactly which governed datasets they reference. Not glamorous, but it works.
On a business glossary in UC:
No built-in glossary UI exists today — no dedicated object type, no lifecycle workflows out of the box. What works is building it from native primitives. A governance schema (e.g., governance.business_glossary) with a terms table covering name, definition, business owner, domain, status, and synonyms gets you most of the way there. Add a relationships table if you need broader/narrower/related term linkage.
From there, UC column and table comments handle concise business descriptions, and tags link assets back to glossary terms — tagging a column with business_term = "Net Revenue" or data_classification = "Confidential", for example. You can then query UC system tables or information_schema to find where a term is actually implemented and which columns carry a given classification. Restrict writes to a small stewardship group, open reads broadly, and optionally surface it through a Databricks SQL dashboard so business users don't have to query tables directly.
If you're already running Purview, Collibra, or Alation, a reasonable model is keeping that as your primary glossary UI and syncing terms and classifications into UC via their connectors. UC enforces access; the external tool handles the governance workflow and UX.
Hope this helps.
Cheers, Louis
Want anything tweaked before you post?
03-23-2026 12:24 AM
Thanks for the reply, @Louis_Frolio ! This is a clear and detailed answer. It is indeed quite similar to the approach we had in mind. Do you perhaps know whether Databricks is planning to introduce a dedicated Business Glossary in the future, or is that not yet part of the roadmap? In any case, many thanks for your reply!
2 weeks ago
Hi Maurice, thanks for sharing your experience. We're facing similar challenges! While Databricks shines with data assets and metric views, integrating report governance is tricky. We're experimenting with tagging reports in Power BI and then linking those tags to relevant business terms defined externally (for now!). It's a work in progress, but I'd be keen to hear what you discover too! Cps Test
2 weeks ago
Hi @AnthoRichard, how exciting to see another reply😁. That sounds like one way to do it and I think our temporary solution will come pretty close to this. We decided to extract the metadata of reports from Power BI. We have a SharePoint List that is used as Business Glossary (it has some surprisingly nifty features). We are discussing to build a catalog list in SharePoint for Report metadata as well. A column can then be used to multi-select the business terms from the Glossary List to link them to the reports in the Report Catalog. It is a bit of a workaround. But its a temporary solution as PoC that governance is needed. SharePoint Lists are not my favorite UI for looking up elements of a list. Thats why the list content is exported to Databricks/Power BI to build a dashboard on the metadata to make it discoverable. It is definitely some workaround for now...