Databricks continues to enhance data governance and observability with the introduction of Column Popularity in Unity Catalog's Catalog Explorer. This new capability provides data teams with visibility into which columns are queried most frequently across their organization, helping them make more informed decisions about data management and optimization.

Traditionally, understanding data usage has been limited to table-level metrics. With Column Popularity, organizations can now gain deeper insights into how individual columns are being used by analysts, engineers, and data scientists. This granular visibility helps identify the fields that drive the most business value and deserve focused optimization efforts.
One of the most significant benefits of this feature is its ability to support schema evolution. Before modifying or deprecating columns, data teams can quickly assess their usage patterns and understand the potential impact on downstream users and workloads. This reduces the risk of breaking critical reports, dashboards, or machine learning pipelines.
The feature also contributes to improved query performance and cost optimization. By identifying heavily accessed columns, organizations can prioritize indexing, clustering, partitioning, and storage optimization strategies where they will have the greatest impact. At the same time, rarely used columns can be evaluated for consolidation or clean-up, helping maintain a lean and efficient data architecture.
From a governance perspective, Column Popularity enhances data discovery and stewardship. Data owners can better understand which data assets are most valuable to the business, enabling more effective governance policies, data quality initiatives, and investment decisions.
As data platforms continue to grow in scale and complexity, visibility into actual usage patterns becomes increasingly important. Databricks' Column Popularity feature represents another step toward a more intelligent and data-driven approach to managing enterprise data, empowering teams to optimize performance, reduce risk, and improve overall data governance.
Key Takeaway: Column Popularity transforms column-level usage metrics into actionable insights, helping organizations optimize their Lakehouse environments while making smarter governance and schema management decisions.