<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Multi-tenant recommendation system (Machine learning) in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/multi-tenant-recommendation-system-machine-learning/m-p/136532#M4384</link>
    <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/60347"&gt;@Kasen&lt;/a&gt;&amp;nbsp;, sorry for the delayed response.&amp;nbsp; Here are some things to consider regarding your question.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;DIV class="paragraph"&gt;Azure Databricks is well-suited for a shared-architecture, tenant‑isolated recommender system. Below is a pragmatic blueprint, the isolation model options, and concrete best practices with Databricks-native services you can adopt.&lt;/DIV&gt;
&lt;DIV class="paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;H3 class="paragraph"&gt;Recommended multi-tenant architecture on Azure Databricks&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Use &lt;STRONG&gt;Unity Catalog (UC)&lt;/STRONG&gt; as the governance backbone with a single metastore per region and isolate tenants at the catalog or schema level (preferred over multiple metastores).&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Bind catalogs and storage credentials to specific workspaces if you need environment isolation (e.g., dev vs prod and tenant-specific endpoints) while retaining centralized governance across the region.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Run shared compute safely with &lt;STRONG&gt;Lakeguard&lt;/STRONG&gt; to enforce data governance at runtime on multi-user clusters and SQL warehouses; this lets you share cost-efficient compute without relaxing isolation controls.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;For cost attribution and noisy-neighbor avoidance, prefer &lt;STRONG&gt;compute-per-tenant&lt;/STRONG&gt; (dedicated job clusters or per-tenant serverless concurrency) even if data governance is centralized in UC.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Isolation controls and governance&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Use &lt;STRONG&gt;catalog-per-tenant&lt;/STRONG&gt; (preferred) or &lt;STRONG&gt;schema-per-tenant&lt;/STRONG&gt; in a shared workspace; both patterns give strong isolation with simpler operations than workspace-per-tenant (250 workspace hard limit).&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Apply &lt;STRONG&gt;workspace–catalog binding&lt;/STRONG&gt; and &lt;STRONG&gt;credential binding to workspaces&lt;/STRONG&gt; to constrain where production data is accessible and to segment endpoints and identities per environment or tenant.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Leverage &lt;STRONG&gt;row/column‑level security and ABAC&lt;/STRONG&gt; for finer-grained controls where needed; UC supports policy-based filtering and masking across governed tables.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Feature engineering and serving&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Use &lt;STRONG&gt;Databricks Feature Store in Unity Catalog&lt;/STRONG&gt; to register feature tables and models with governance, lineage, and cross-workspace discovery; training automatically tracks feature lineage, and inference can auto‑lookup features to prevent training/serving skew.&lt;/LI&gt;
&lt;LI&gt;For low-latency online inference, enable &lt;STRONG&gt;Online Feature Stores (Lakebase‑powered)&lt;/STRONG&gt; and publish per‑tenant feature tables (latest values or full time series as needed).&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Model lifecycle per tenant&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Keep a single &lt;STRONG&gt;model architecture&lt;/STRONG&gt; (e.g., Two‑Tower retrieval plus DLRM re‑ranking) and register each tenant’s model/version in UC under that tenant’s catalog/schema using MLflow.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;For scalable training, use &lt;STRONG&gt;TorchDistributor&lt;/STRONG&gt; with &lt;STRONG&gt;Mosaic StreamingDataset&lt;/STRONG&gt; (and TorchRec for sharded embeddings) to handle millions of users/items efficiently on multi‑GPU clusters/serverless GPU.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;If you’re earlier in the journey, Databricks solution accelerators provide &lt;STRONG&gt;wide‑and‑deep, ALS, market‑basket, image similarity&lt;/STRONG&gt; notebooks to bootstrap tenant builds on a common codebase.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Inference, A/B testing, and monitoring&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Serve tenant models with &lt;STRONG&gt;Mosaic AI Model Serving&lt;/STRONG&gt;. You can either deploy one endpoint per tenant or use a &lt;STRONG&gt;multi‑model endpoint&lt;/STRONG&gt; (served_entities) with &lt;STRONG&gt;traffic splitting&lt;/STRONG&gt; to route per‑tenant traffic or run challenger vs current for A/B tests.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;For high‑QPS/low‑latency tenants, enable &lt;STRONG&gt;route optimization&lt;/STRONG&gt; (dedicated URL + OAuth) to reduce overhead latency and raise QPS versus standard endpoints.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Turn on &lt;STRONG&gt;AI Gateway usage tracking&lt;/STRONG&gt; and &lt;STRONG&gt;inference tables&lt;/STRONG&gt; for each endpoint to log requests/responses to a UC Delta table for evaluation, drift monitoring, and corpus creation for fine‑tuning or re‑rankers.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Apply &lt;STRONG&gt;rate limits&lt;/STRONG&gt; (endpoint, user, group) to protect shared capacity across tenants; monitor limits and regions with the Serving limits/regions guide.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Cross-region or cross-organization sharing&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Keep one UC metastore per region; share data across regions/orgs with &lt;STRONG&gt;Databricks‑to‑Databricks Delta Sharing&lt;/STRONG&gt; (foreign catalogs), noting lineage/ACLs don’t cross the share boundary and must be re‑applied in the recipient.&lt;/LI&gt;
&lt;LI&gt;If you need governed open sharing to external tools (e.g., Power BI), use &lt;STRONG&gt;OIDC federation&lt;/STRONG&gt; for Delta Sharing to avoid long‑lived bearer tokens and retain MFA/IdP policy enforcement.&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Cost, quotas, and limits&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Treat &lt;STRONG&gt;compute as the attribution layer&lt;/STRONG&gt; (per‑tenant clusters/concurrency), and use serverless budget policies and tags for granular billing.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Review UC quotas and request increases if needed (e.g., large numbers of catalogs, tables, or models per tenant) with the UC quota SOP.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Check &lt;STRONG&gt;Model Serving limits&lt;/STRONG&gt; (QPS, payload, concurrency, compliance) and route optimization requirements when designing endpoints at scale.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;External access patterns and guardrails&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Avoid external systems writing to the same tables outside Databricks, as &lt;STRONG&gt;UC doesn’t govern direct object‑store writes&lt;/STRONG&gt;; use managed tables or explicit external‑volume patterns and credential vending to preserve consistency and security.&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Concrete blueprint (step-by-step)&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Identity and governance: Provision principals via SCIM at the account, enable UC, create a catalog per tenant, and bind catalogs/credentials to the correct workspaces and environments (dev/stg/prod).&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Data ingestion and isolation: Land each tenant’s data into their catalog/schema, applying RLS/CLS or ABAC where needed; use Lakeguard on shared compute clusters to enforce governance at runtime.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Feature engineering: Build tenant feature tables in UC, track lineage, and publish hot features to Online Feature Stores for low-latency inference.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Model training: Use common repos/notebooks with TorchDistributor/Mosaic Streaming for Two‑Tower retrieval and DLRM reranking; register each tenant’s model in UC (same architecture, different weights), tracked by MLflow.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Model serving: Create per-tenant endpoints or multi‑model endpoints with traffic split and route optimization; enable AI Gateway usage tracking, rate limits, and inference tables for monitoring and A/B testing.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Cross-region access (optional): Use D2D Delta Sharing and re‑grant ACLs in the recipient catalog; don’t attempt cross‑region metastore assignment.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Resources to read and use&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;What is Unity Catalog and Azure UC best practices (metastore per region, isolation at catalog/schema, workspace binding).&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Isolation in Multi‑Tenant Applications (catalog/schema vs workspace per tenant; compute-per-tenant guidance).&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Unity Catalog Lakeguard overview for multi-user governance on shared compute.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Feature Store in UC and Online Feature Stores (setup, auto feature lookup, online serving patterns).&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Model Serving docs: create endpoints, multi‑model traffic splitting, route optimization, usage tracking, inference tables, limits/regions.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Delta Sharing architecture and OIDC federation (cross‑region/org data sharing patterns).&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Recommender systems on Databricks: Two‑Tower, DLRM, wide‑and‑deep, ALS, accelerators and blogs.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;DIV class="paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV class="paragraph"&gt;Hope this helps, Louis.&lt;/DIV&gt;</description>
    <pubDate>Wed, 29 Oct 2025 10:29:40 GMT</pubDate>
    <dc:creator>Louis_Frolio</dc:creator>
    <dc:date>2025-10-29T10:29:40Z</dc:date>
    <item>
      <title>Multi-tenant recommendation system (Machine learning)</title>
      <link>https://community.databricks.com/t5/machine-learning/multi-tenant-recommendation-system-machine-learning/m-p/88817#M3642</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am looking to build a multi-tenant machine learning recommender system in Azure Databricks. The idea is to have a single shared model, where each tenant can use the same model to train on their own unique dataset. Essentially, while the model architecture remains the same for all tenants, the data used for training and inference would be specific to each one. Any resources that I can refer or best practices for implementing such a system? Thank you!&lt;/P&gt;</description>
      <pubDate>Fri, 06 Sep 2024 04:27:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/multi-tenant-recommendation-system-machine-learning/m-p/88817#M3642</guid>
      <dc:creator>Kasen</dc:creator>
      <dc:date>2024-09-06T04:27:46Z</dc:date>
    </item>
    <item>
      <title>Re: Multi-tenant recommendation system (Machine learning)</title>
      <link>https://community.databricks.com/t5/machine-learning/multi-tenant-recommendation-system-machine-learning/m-p/136532#M4384</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/60347"&gt;@Kasen&lt;/a&gt;&amp;nbsp;, sorry for the delayed response.&amp;nbsp; Here are some things to consider regarding your question.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;DIV class="paragraph"&gt;Azure Databricks is well-suited for a shared-architecture, tenant‑isolated recommender system. Below is a pragmatic blueprint, the isolation model options, and concrete best practices with Databricks-native services you can adopt.&lt;/DIV&gt;
&lt;DIV class="paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;H3 class="paragraph"&gt;Recommended multi-tenant architecture on Azure Databricks&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Use &lt;STRONG&gt;Unity Catalog (UC)&lt;/STRONG&gt; as the governance backbone with a single metastore per region and isolate tenants at the catalog or schema level (preferred over multiple metastores).&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Bind catalogs and storage credentials to specific workspaces if you need environment isolation (e.g., dev vs prod and tenant-specific endpoints) while retaining centralized governance across the region.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Run shared compute safely with &lt;STRONG&gt;Lakeguard&lt;/STRONG&gt; to enforce data governance at runtime on multi-user clusters and SQL warehouses; this lets you share cost-efficient compute without relaxing isolation controls.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;For cost attribution and noisy-neighbor avoidance, prefer &lt;STRONG&gt;compute-per-tenant&lt;/STRONG&gt; (dedicated job clusters or per-tenant serverless concurrency) even if data governance is centralized in UC.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Isolation controls and governance&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Use &lt;STRONG&gt;catalog-per-tenant&lt;/STRONG&gt; (preferred) or &lt;STRONG&gt;schema-per-tenant&lt;/STRONG&gt; in a shared workspace; both patterns give strong isolation with simpler operations than workspace-per-tenant (250 workspace hard limit).&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Apply &lt;STRONG&gt;workspace–catalog binding&lt;/STRONG&gt; and &lt;STRONG&gt;credential binding to workspaces&lt;/STRONG&gt; to constrain where production data is accessible and to segment endpoints and identities per environment or tenant.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Leverage &lt;STRONG&gt;row/column‑level security and ABAC&lt;/STRONG&gt; for finer-grained controls where needed; UC supports policy-based filtering and masking across governed tables.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Feature engineering and serving&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Use &lt;STRONG&gt;Databricks Feature Store in Unity Catalog&lt;/STRONG&gt; to register feature tables and models with governance, lineage, and cross-workspace discovery; training automatically tracks feature lineage, and inference can auto‑lookup features to prevent training/serving skew.&lt;/LI&gt;
&lt;LI&gt;For low-latency online inference, enable &lt;STRONG&gt;Online Feature Stores (Lakebase‑powered)&lt;/STRONG&gt; and publish per‑tenant feature tables (latest values or full time series as needed).&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Model lifecycle per tenant&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Keep a single &lt;STRONG&gt;model architecture&lt;/STRONG&gt; (e.g., Two‑Tower retrieval plus DLRM re‑ranking) and register each tenant’s model/version in UC under that tenant’s catalog/schema using MLflow.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;For scalable training, use &lt;STRONG&gt;TorchDistributor&lt;/STRONG&gt; with &lt;STRONG&gt;Mosaic StreamingDataset&lt;/STRONG&gt; (and TorchRec for sharded embeddings) to handle millions of users/items efficiently on multi‑GPU clusters/serverless GPU.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;If you’re earlier in the journey, Databricks solution accelerators provide &lt;STRONG&gt;wide‑and‑deep, ALS, market‑basket, image similarity&lt;/STRONG&gt; notebooks to bootstrap tenant builds on a common codebase.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Inference, A/B testing, and monitoring&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Serve tenant models with &lt;STRONG&gt;Mosaic AI Model Serving&lt;/STRONG&gt;. You can either deploy one endpoint per tenant or use a &lt;STRONG&gt;multi‑model endpoint&lt;/STRONG&gt; (served_entities) with &lt;STRONG&gt;traffic splitting&lt;/STRONG&gt; to route per‑tenant traffic or run challenger vs current for A/B tests.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;For high‑QPS/low‑latency tenants, enable &lt;STRONG&gt;route optimization&lt;/STRONG&gt; (dedicated URL + OAuth) to reduce overhead latency and raise QPS versus standard endpoints.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Turn on &lt;STRONG&gt;AI Gateway usage tracking&lt;/STRONG&gt; and &lt;STRONG&gt;inference tables&lt;/STRONG&gt; for each endpoint to log requests/responses to a UC Delta table for evaluation, drift monitoring, and corpus creation for fine‑tuning or re‑rankers.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Apply &lt;STRONG&gt;rate limits&lt;/STRONG&gt; (endpoint, user, group) to protect shared capacity across tenants; monitor limits and regions with the Serving limits/regions guide.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Cross-region or cross-organization sharing&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Keep one UC metastore per region; share data across regions/orgs with &lt;STRONG&gt;Databricks‑to‑Databricks Delta Sharing&lt;/STRONG&gt; (foreign catalogs), noting lineage/ACLs don’t cross the share boundary and must be re‑applied in the recipient.&lt;/LI&gt;
&lt;LI&gt;If you need governed open sharing to external tools (e.g., Power BI), use &lt;STRONG&gt;OIDC federation&lt;/STRONG&gt; for Delta Sharing to avoid long‑lived bearer tokens and retain MFA/IdP policy enforcement.&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Cost, quotas, and limits&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Treat &lt;STRONG&gt;compute as the attribution layer&lt;/STRONG&gt; (per‑tenant clusters/concurrency), and use serverless budget policies and tags for granular billing.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Review UC quotas and request increases if needed (e.g., large numbers of catalogs, tables, or models per tenant) with the UC quota SOP.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Check &lt;STRONG&gt;Model Serving limits&lt;/STRONG&gt; (QPS, payload, concurrency, compliance) and route optimization requirements when designing endpoints at scale.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;External access patterns and guardrails&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Avoid external systems writing to the same tables outside Databricks, as &lt;STRONG&gt;UC doesn’t govern direct object‑store writes&lt;/STRONG&gt;; use managed tables or explicit external‑volume patterns and credential vending to preserve consistency and security.&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Concrete blueprint (step-by-step)&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Identity and governance: Provision principals via SCIM at the account, enable UC, create a catalog per tenant, and bind catalogs/credentials to the correct workspaces and environments (dev/stg/prod).&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Data ingestion and isolation: Land each tenant’s data into their catalog/schema, applying RLS/CLS or ABAC where needed; use Lakeguard on shared compute clusters to enforce governance at runtime.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Feature engineering: Build tenant feature tables in UC, track lineage, and publish hot features to Online Feature Stores for low-latency inference.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Model training: Use common repos/notebooks with TorchDistributor/Mosaic Streaming for Two‑Tower retrieval and DLRM reranking; register each tenant’s model in UC (same architecture, different weights), tracked by MLflow.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Model serving: Create per-tenant endpoints or multi‑model endpoints with traffic split and route optimization; enable AI Gateway usage tracking, rate limits, and inference tables for monitoring and A/B testing.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Cross-region access (optional): Use D2D Delta Sharing and re‑grant ACLs in the recipient catalog; don’t attempt cross‑region metastore assignment.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Resources to read and use&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;What is Unity Catalog and Azure UC best practices (metastore per region, isolation at catalog/schema, workspace binding).&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Isolation in Multi‑Tenant Applications (catalog/schema vs workspace per tenant; compute-per-tenant guidance).&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Unity Catalog Lakeguard overview for multi-user governance on shared compute.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Feature Store in UC and Online Feature Stores (setup, auto feature lookup, online serving patterns).&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Model Serving docs: create endpoints, multi‑model traffic splitting, route optimization, usage tracking, inference tables, limits/regions.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Delta Sharing architecture and OIDC federation (cross‑region/org data sharing patterns).&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Recommender systems on Databricks: Two‑Tower, DLRM, wide‑and‑deep, ALS, accelerators and blogs.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;DIV class="paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV class="paragraph"&gt;Hope this helps, Louis.&lt;/DIV&gt;</description>
      <pubDate>Wed, 29 Oct 2025 10:29:40 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/multi-tenant-recommendation-system-machine-learning/m-p/136532#M4384</guid>
      <dc:creator>Louis_Frolio</dc:creator>
      <dc:date>2025-10-29T10:29:40Z</dc:date>
    </item>
  </channel>
</rss>

