on ‎01-10-2024 05:00 PM
The Databricks feature store provides a catalog that enables data scientists to search for existing features in the offline feature store. The feature store UI offers a searchable interface, allowing you to discover features and view the code used for to create them. You can navigate to the notebook or job containing the computation logic for a specific feature. Some of this information is consolidated into Unity Catalog.
Databricks recommends enabling Unity Catalog integration to address the objective of resolving duplicate features or feature conflicts in the offline feature store. This integration allows for the utilization of data lineage, which can be accessed through the Data Explorer by going to Data -> Data Explorer -> Lineage. Additionally, the INFORMATION_SCHEMA can be used to query catalog tables and list all the column names to identify and address naming conflicts.
You can refer to the documentation for detailed examples and guides on utilizing the feature store APIs, best practices, and use cases: