08-22-2024 05:22 AM
Hello Team,
I need some clarification on the below diagram . According to the documentation, the Unity Catalog is set up for each region. If we are using multiple clouds, the diagram shows only one Unity Catalog across regions. Shouldn't there be two Unity Catalogs instead?
Regards,
Janga
08-23-2024 01:40 AM
There are indeed 2 metastores, as displayed in the picture, buth use UC.
The different metastores are not merged into one single metastore.
If one needs access to the other you can use Delta Sharing
https://docs.databricks.com/en/data-governance/unity-catalog/best-practices.html
08-23-2024 02:09 AM
does this mean each metastore is a UC setup for each region?
08-23-2024 02:13 AM
yes, the metastore is the root, linked to physical storage. for each metastore one has to define catalogs, schemas, tables, volumes, permissions etc.
Perhaps in the future databricks will add a META-metastore, over regions, but afaik that is not yet in the pipeline.
08-23-2024 02:16 AM
thanks for your reply. So shall i assume that each metastore is a UC in the diagram for each region?
08-23-2024 02:21 AM
Yes, the reason why they are grouped into a single rectangle is probably to show that they both are Unity Catalog enabled. It can indeed be confusing, represented like that.
If you want to let them connect to each other, delta sharing or metastore federation (or even define one as an external location) is the way to go.
09-10-2024 02:34 AM
Thank you, I didn't know it before.
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group