<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Most suitable Data Promotion orchestration for multi-tenant data lake in Databricks in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/most-suitable-data-promotion-orchestration-for-multi-tenant-data/m-p/134509#M50148</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;SPAN&gt;sarahbhord !!!&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks very much for the useful reply, it really helps understanding the best approach to follow. In my case I have roughly the following architecture:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="mbanxp_0-1760087606665.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20650iD61EA172D074ABF7/image-size/medium?v=v2&amp;amp;px=400" role="button" title="mbanxp_0-1760087606665.png" alt="mbanxp_0-1760087606665.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Based on the the approach of&amp;nbsp;&lt;STRONG&gt;independent pipelines per table for all clients, &lt;/STRONG&gt;what would be your recommendation ?&lt;/P&gt;</description>
    <pubDate>Fri, 10 Oct 2025 09:25:49 GMT</pubDate>
    <dc:creator>mbanxp</dc:creator>
    <dc:date>2025-10-10T09:25:49Z</dc:date>
    <item>
      <title>Most suitable Data Promotion orchestration for multi-tenant data lake in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/most-suitable-data-promotion-orchestration-for-multi-tenant-data/m-p/134086#M50014</link>
      <description>&lt;P&gt;Hi there !!! I would like to find the most suitable orchestration process to promote data between medallion layers I need to solve the following key architectural decision for scaling my multi-tenant data lake in Databricks.&lt;/P&gt;&lt;P&gt;My setup:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Independent medallion architecture per client&lt;/STRONG&gt; (Landing → Bronze → Silver → Gold per client)&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Identical schema across all clients&lt;/STRONG&gt; (same data model)&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Multiple tables per layer&lt;/STRONG&gt; (each with specific transformations)&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;What would be the best approach in Databricks to orchestrate the data promotion between layers ?&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Independent pipelines per client for all tables&lt;/LI&gt;&lt;LI&gt;Independent pipelines per client and table&lt;/LI&gt;&lt;LI&gt;Independent pipelines per table for all clients&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Thanks in advance.&lt;/P&gt;</description>
      <pubDate>Tue, 07 Oct 2025 14:36:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/most-suitable-data-promotion-orchestration-for-multi-tenant-data/m-p/134086#M50014</guid>
      <dc:creator>mbanxp</dc:creator>
      <dc:date>2025-10-07T14:36:52Z</dc:date>
    </item>
    <item>
      <title>Re: Most suitable Data Promotion orchestration for multi-tenant data lake in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/most-suitable-data-promotion-orchestration-for-multi-tenant-data/m-p/134219#M50054</link>
      <description>&lt;P class="qt3gz91 paragraph"&gt;Hey mbanxp!&lt;/P&gt;
&lt;P class="qt3gz91 paragraph"&gt;The most scalable and maintainable orchestration pattern for multi-tenant medallion architectures in Databricks is to build &lt;STRONG&gt;independent pipelines per table for all clients&lt;/STRONG&gt;, with each pipeline parameterized by client/tenant.&lt;/P&gt;
&lt;P class="qt3gz91 paragraph"&gt;&lt;STRONG&gt;Why this approach?&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL class="qt3gz97 qt3gz92"&gt;
&lt;LI class="qt3gz9a"&gt;Centralizes business logic for each table (reduces code duplication).&lt;/LI&gt;
&lt;LI class="qt3gz9a"&gt;Makes onboarding new clients easy—just add configuration, don't duplicate pipeline code.&lt;/LI&gt;
&lt;LI class="qt3gz9a"&gt;Scales well as data and client count grow.&lt;/LI&gt;
&lt;LI class="qt3gz9a"&gt;Fits perfectly with Databricks Workflows and Delta Live Tables (DLT), which support parameterized, multi-tenant pipelines and robust orchestration.&lt;/LI&gt;
&lt;LI class="qt3gz9a"&gt;Unity Catalog provides strong data isolation and governance at the client level, even when sharing pipelines.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P class="qt3gz91 paragraph"&gt;&lt;STRONG&gt;Platform Features Enabling This Pattern:&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL class="qt3gz97 qt3gz92"&gt;
&lt;LI class="qt3gz9a"&gt;&lt;STRONG&gt;Databricks Workflows:&lt;/STRONG&gt; Orchestrate parameterized, multi-tenant pipelines.&lt;/LI&gt;
&lt;LI class="qt3gz9a"&gt;&lt;STRONG&gt;Delta Live Tables (DLT):&lt;/STRONG&gt; Declaratively define ETL flows partitioned by client.&lt;/LI&gt;
&lt;LI class="qt3gz9a"&gt;&lt;STRONG&gt;Unity Catalog:&lt;/STRONG&gt; Fine-grained access control and catalog/schema separation per client.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P class="qt3gz91 paragraph"&gt;&lt;STRONG&gt;Extra tips:&lt;/STRONG&gt;&lt;BR /&gt;Leverage partitioning and schema separation by client within each layer, and use centralized pipelines to tune job frequencies and resource usage.&lt;/P&gt;
&lt;P class="qt3gz91 paragraph"&gt;&lt;STRONG&gt;Summary:&lt;/STRONG&gt;&lt;BR /&gt;Organizing by per-table, multi-tenant pipelines is Databricks’ best practice for efficient, standardized, and easily-governed medallion data flows at scale.&lt;/P&gt;
&lt;P class="qt3gz91 paragraph"&gt;I hope this helps.&lt;/P&gt;
&lt;P class="qt3gz91 paragraph"&gt;Best,&lt;/P&gt;
&lt;P class="qt3gz91 paragraph"&gt;Sarah&lt;/P&gt;</description>
      <pubDate>Wed, 08 Oct 2025 13:09:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/most-suitable-data-promotion-orchestration-for-multi-tenant-data/m-p/134219#M50054</guid>
      <dc:creator>sarahbhord</dc:creator>
      <dc:date>2025-10-08T13:09:08Z</dc:date>
    </item>
    <item>
      <title>Re: Most suitable Data Promotion orchestration for multi-tenant data lake in Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/most-suitable-data-promotion-orchestration-for-multi-tenant-data/m-p/134509#M50148</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;SPAN&gt;sarahbhord !!!&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks very much for the useful reply, it really helps understanding the best approach to follow. In my case I have roughly the following architecture:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="mbanxp_0-1760087606665.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20650iD61EA172D074ABF7/image-size/medium?v=v2&amp;amp;px=400" role="button" title="mbanxp_0-1760087606665.png" alt="mbanxp_0-1760087606665.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Based on the the approach of&amp;nbsp;&lt;STRONG&gt;independent pipelines per table for all clients, &lt;/STRONG&gt;what would be your recommendation ?&lt;/P&gt;</description>
      <pubDate>Fri, 10 Oct 2025 09:25:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/most-suitable-data-promotion-orchestration-for-multi-tenant-data/m-p/134509#M50148</guid>
      <dc:creator>mbanxp</dc:creator>
      <dc:date>2025-10-10T09:25:49Z</dc:date>
    </item>
  </channel>
</rss>

