<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Databricks recommended Approach to load data vault 2.0 in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/databricks-recommended-approach-to-load-data-vault-2-0/m-p/127415#M47955</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;Please share the recommended approach to load Data Vault 2.0 .&lt;/P&gt;&lt;P&gt;Overview&lt;/P&gt;&lt;P&gt;1. Current Landscape -&amp;nbsp; Lakehouse (Bronze/Silver/Gold)&lt;/P&gt;&lt;P&gt;2. Data Vault 2.0 to be created in Silver layer.&lt;/P&gt;&lt;P&gt;3. Bronze data will be made available in delta table using ETL&amp;nbsp;&lt;/P&gt;&lt;P&gt;Questions&lt;/P&gt;&lt;P&gt;1. What should be the strategy to load the data from Bronze to Silver layer&amp;nbsp;&lt;/P&gt;&lt;P&gt;2. Approach to adopt to parallelize the load&amp;nbsp; while loading the data vault 2.0 tables.&lt;/P&gt;&lt;P&gt;3.How to pick the incremental the data from delta tables while loading Silver layer.&lt;/P&gt;&lt;P&gt;4a)How can we reuse the Notebooks to load the Silver layer (Data Vault 2.0) for other source system.&lt;/P&gt;&lt;P&gt;b)Where should the logic to be encapsulated while populating hub/link/satellite table for every entity . ex views&lt;/P&gt;&lt;P&gt;c)How to configure the DQ Rules for every entity / tables&lt;/P&gt;&lt;P&gt;5. What type of meta data driven approach can be adopted.&lt;/P&gt;&lt;P&gt;6. What should be convention to adopt for Unity Catalog&amp;nbsp;&lt;/P&gt;&lt;P&gt;ex - Unity Catalog Name - Bronze , Schema Name- Source System Name, Tables - Tables for every source.&lt;/P&gt;&lt;P&gt;Unity Catalog Name - Silver , Schema - what need to be provided . Tables - Data Vault 2.0 tables.&lt;/P&gt;&lt;P&gt;7. Exception Handling / Reprocessing from the point of failure / Auditing&lt;/P&gt;&lt;P&gt;8. Cluster Configuration (All purpose Cluster ) / Warehouse Cluster&lt;/P&gt;</description>
    <pubDate>Tue, 05 Aug 2025 06:57:21 GMT</pubDate>
    <dc:creator>Subha0920</dc:creator>
    <dc:date>2025-08-05T06:57:21Z</dc:date>
    <item>
      <title>Databricks recommended Approach to load data vault 2.0</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-recommended-approach-to-load-data-vault-2-0/m-p/127415#M47955</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;Please share the recommended approach to load Data Vault 2.0 .&lt;/P&gt;&lt;P&gt;Overview&lt;/P&gt;&lt;P&gt;1. Current Landscape -&amp;nbsp; Lakehouse (Bronze/Silver/Gold)&lt;/P&gt;&lt;P&gt;2. Data Vault 2.0 to be created in Silver layer.&lt;/P&gt;&lt;P&gt;3. Bronze data will be made available in delta table using ETL&amp;nbsp;&lt;/P&gt;&lt;P&gt;Questions&lt;/P&gt;&lt;P&gt;1. What should be the strategy to load the data from Bronze to Silver layer&amp;nbsp;&lt;/P&gt;&lt;P&gt;2. Approach to adopt to parallelize the load&amp;nbsp; while loading the data vault 2.0 tables.&lt;/P&gt;&lt;P&gt;3.How to pick the incremental the data from delta tables while loading Silver layer.&lt;/P&gt;&lt;P&gt;4a)How can we reuse the Notebooks to load the Silver layer (Data Vault 2.0) for other source system.&lt;/P&gt;&lt;P&gt;b)Where should the logic to be encapsulated while populating hub/link/satellite table for every entity . ex views&lt;/P&gt;&lt;P&gt;c)How to configure the DQ Rules for every entity / tables&lt;/P&gt;&lt;P&gt;5. What type of meta data driven approach can be adopted.&lt;/P&gt;&lt;P&gt;6. What should be convention to adopt for Unity Catalog&amp;nbsp;&lt;/P&gt;&lt;P&gt;ex - Unity Catalog Name - Bronze , Schema Name- Source System Name, Tables - Tables for every source.&lt;/P&gt;&lt;P&gt;Unity Catalog Name - Silver , Schema - what need to be provided . Tables - Data Vault 2.0 tables.&lt;/P&gt;&lt;P&gt;7. Exception Handling / Reprocessing from the point of failure / Auditing&lt;/P&gt;&lt;P&gt;8. Cluster Configuration (All purpose Cluster ) / Warehouse Cluster&lt;/P&gt;</description>
      <pubDate>Tue, 05 Aug 2025 06:57:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-recommended-approach-to-load-data-vault-2-0/m-p/127415#M47955</guid>
      <dc:creator>Subha0920</dc:creator>
      <dc:date>2025-08-05T06:57:21Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks recommended Approach to load data vault 2.0</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-recommended-approach-to-load-data-vault-2-0/m-p/127504#M47988</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/178017"&gt;@Subha0920&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;I have implemented previously data vault 2.0 in Databricks, even though it can be too long to mention all the details&lt;BR /&gt;of the implementation, but what helped us to get a lot of insights are these resource by Microsoft:&lt;BR /&gt;&lt;A href="https://techcommunity.microsoft.com/blog/analyticsonazure/data-vault-2-0-using-databricks-lakehouse-architecture-on-azure/3797493" target="_blank"&gt;Data Vault 2.0 using Databricks Lakehouse Architecture on Azure&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.databricks.com/blog/2022/06/24/prescriptive-guidance-for-implementing-a-data-vault-model-on-the-databricks-lakehouse-platform.html" target="_blank"&gt;What’s a Data Vault and How to Implement It on the Databricks Lakehouse Platform - The Databricks Blog&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;They may be a bit old articles, but they are quite a helpful ones.&lt;BR /&gt;&lt;BR /&gt;Hope that helps a bit.&lt;BR /&gt;&lt;BR /&gt;Best, Ilir&lt;/P&gt;</description>
      <pubDate>Tue, 05 Aug 2025 21:06:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-recommended-approach-to-load-data-vault-2-0/m-p/127504#M47988</guid>
      <dc:creator>ilir_nuredini</dc:creator>
      <dc:date>2025-08-05T21:06:53Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks recommended Approach to load data vault 2.0</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-recommended-approach-to-load-data-vault-2-0/m-p/127530#M47997</link>
      <description>&lt;P&gt;Thanks&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/102399"&gt;@ilir_nuredini&lt;/a&gt;&amp;nbsp;. It is helpful.&lt;/P&gt;&lt;P&gt;&amp;nbsp; If you can share the details for the above questions, that will assist to plan further.&lt;/P&gt;</description>
      <pubDate>Wed, 06 Aug 2025 04:24:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-recommended-approach-to-load-data-vault-2-0/m-p/127530#M47997</guid>
      <dc:creator>Subha0920</dc:creator>
      <dc:date>2025-08-06T04:24:30Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks recommended Approach to load data vault 2.0</title>
      <link>https://community.databricks.com/t5/data-engineering/databricks-recommended-approach-to-load-data-vault-2-0/m-p/127654#M48049</link>
      <description>&lt;P&gt;Kindly provide your valuable input and suggestion for the above questions&lt;/P&gt;</description>
      <pubDate>Thu, 07 Aug 2025 10:21:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/databricks-recommended-approach-to-load-data-vault-2-0/m-p/127654#M48049</guid>
      <dc:creator>Subha0920</dc:creator>
      <dc:date>2025-08-07T10:21:07Z</dc:date>
    </item>
  </channel>
</rss>

