cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Unity catalog accounts

Phani1
Valued Contributor

Hi Team,

We have the requirement to have metadata(Unity catalog) in one AWS account and data storage(Delta tables under data) in another account, is it possible to do that , Do we face any technical/Security issue??

1 REPLY 1

Kaniz
Community Manager
Community Manager

Hi @Phani1, Letโ€™s address your requirement regarding Unity Catalog metadata and Delta tables storage in separate AWS accounts.

 

Unity Catalog Accounts:

  • Unity Catalog (UC) is a fine-grained governance solution for data and AI on the Databricks Lakehouse.
  • UC manages metadata, lineage information, and other governance aspects.
  • Each UC metastore requires a Storage Account container path where UC-managed table data is stored.
  • UC delta tablesโ€™ table metadata reside inside this storage account, specifically in a directory named __unitystorage.
  • Other metadata, such as UC lineage info, is stored in the regional Control Plane1.

Separate AWS Accounts:

  • Technically, it is possible to have metadata (Unity catalog) in one AWS account and data storage (Delta tables) in another.
  • However, there are considerations related to security, networking, and access control:
    • Security: Ensure proper IAM (Identity and Access Management) policies and permissions between the two accounts.
    • Networking: Set up appropriate VPC (Virtual Private Cloud) peering or transit gateway connections.
    • Access Control: Define fine-grained access controls to restrict cross-account access.
  • Best Practice: Follow Unity Catalog best practices to simplify security and governance of your data2.

Technical/Security Issues:

  • While technically feasible, itโ€™s essential to evaluate the following:
    • Data Consistency: Ensure consistency between metadata and data storage.
    • Latency: Cross-account communication may introduce latency.
    • Auditability: Maintain audit logs for cross-account interactions.
    • Compliance: Consider any regulatory or compliance requirements.
  • Collaborate with your AWS and Databricks teams to address these concerns.
Join 100K+ Data Experts: Register Now & Grow with Us!

Excited to expand your horizons with us? Click here to Register and begin your journey to success!

Already a member? Login and join your local regional user group! If there isn’t one near you, fill out this form and we’ll create one for you to join!