03-24-2023 07:15 AM
Thank you in advance for your help!
03-24-2023 09:45 AM
@Ashok Zubrewar Please find the answers inline
https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/create-metastore
03-24-2023 09:45 AM
@Ashok Zubrewar Please find the answers inline
https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/create-metastore
03-26-2023 07:06 AM
Thank you so much for response. Based on your answers, i have also set up a POC metastore and able to understand catalog separation.
1. I am clear
2. i am clear
3. I am still not clear, even though it is not mandatory to have 2 separate ADLS Gen2 storage accounts but let's assume in our case we have made a decision to have 2 separate ADLS Gen2.
One - UC Catalog
Second - To store data (Catalog/schema/tables)
In multi subscription environment where UC Path (ADLS Gen2Storage account) should be hosted ?in Prod subscription or Non-Prod subscription? or does it matter ? as long as all strict access control in place can we host in either subscription ? since we are starting from scratch would like to get some feedback on best practice.
03-25-2023 10:59 PM
Hi @Ashok Zubrewar
Your input matters! Help our community thrive by coming back and marking the most helpful and accurate answers. Together, we can make a difference!
Thanks and Regards
03-29-2023 04:37 AM
@Ashok Zubrewar coming to your 3 rd question, if you are using any external tables then non uc ADLS GEN 2 is mandatory, you can not use UC ADLS GEN2. as it hosts metadata and managed table data. there is no restriction in terms of your external buckets (ADLS GEN2 Storage regions should be on same region as UC ADLS GEN2 , but to avoid performance Issues best way is to have in same region). once you configure your non UC ADLS GEN2 and add storage credential and storage location, you should be good to access your ADLS GEN2 in UC, but currently we need to remember we have limitations in UC for external tables ( won't support OPTIMIZATION), databricks recommends to use managed tables . but based on use case we need, as mostly for analytics purpose we will be using external tables, we may not avoid that
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group