<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Databricks delta sharing design in Administration &amp; Architecture</title>
    <link>https://community.databricks.com/t5/administration-architecture/databricks-delta-sharing-design/m-p/115361#M3244</link>
    <description>&lt;P class=""&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/124839"&gt;@noorbasha534&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P class=""&gt;Let me share a bit about our use case and how we’re handling Delta Sharing.&lt;BR /&gt;&lt;BR /&gt;Delta Sharing is indeed a simple and lightweight solution, and one of its main advantages is that it’s free to use. However, it still has several limitations, especially when it comes to performance. The biggest challenge we’ve faced is that it loads the entire table into memory before returning the data, which becomes unsustainable with large datasets or wide tables.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P class=""&gt;In our case, we had to implement a &lt;SPAN class=""&gt;custom workaround&lt;/SPAN&gt; to make it usable, filtering the data at a fine-grained level &lt;SPAN class=""&gt;before sharing&lt;/SPAN&gt;, because in some situations you simply can’t afford to wait 7 minutes for a table to load, especially when the consumer has limited resources and the request just times out or fails.&lt;/P&gt;&lt;P class=""&gt;&lt;STRONG&gt;My recomendations:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;Use Delta Sharing only for specific, low-volume scenarios&lt;SPAN class=""&gt;, where response time isn’t critical.&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;In our case, we went with &lt;SPAN class=""&gt;one share per team&lt;/SPAN&gt;, as each team owns and controls both what they consume and what they expose.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;For &lt;SPAN class=""&gt;external clients&lt;/SPAN&gt;, it’s always best to create a &lt;SPAN class=""&gt;dedicated share&lt;/SPAN&gt;.&lt;/P&gt;&lt;P class=""&gt;Regarding how many &lt;SPAN class=""&gt;shares&lt;/SPAN&gt; to create: that really depends on your use case and data sensitivity.&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;&lt;BR /&gt;If the data volume is high:&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;I strongly recommend &lt;SPAN class=""&gt;&lt;STRONG&gt;not using Delta Sharing&lt;/STRONG&gt;&lt;/SPAN&gt; for heavy use cases. Instead:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Create a &lt;/SPAN&gt;dedicated Service Principal&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;Grant access to specific tables/views&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;Make the data available through a &lt;SPAN class=""&gt;SQL Warehouse (Classic or Serverless)&lt;/SPAN&gt; depending on your budget and performance needs.&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;Hope this helps clarify things! &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;Isi&lt;/P&gt;&lt;P class=""&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
    <pubDate>Sun, 13 Apr 2025 14:29:33 GMT</pubDate>
    <dc:creator>Isi</dc:creator>
    <dc:date>2025-04-13T14:29:33Z</dc:date>
    <item>
      <title>Databricks delta sharing design</title>
      <link>https://community.databricks.com/t5/administration-architecture/databricks-delta-sharing-design/m-p/114612#M3212</link>
      <description>&lt;P&gt;Dears&lt;/P&gt;&lt;P&gt;I wanted to have a mindshare around delta sharing - how do you decide how many shares to be created and share with other departments if you are maintaining an enterprise wide data warehoouse/lakehouse using Azure Databricks. I see from the documentation that there can be 1000 shares at metastore level ((though this limit is not fixed)).&lt;/P&gt;&lt;P&gt;per request from a department, or per department etc. etc.&lt;/P&gt;&lt;P&gt;I see the access is at a share level, and we cannot further restrict access with a share; so, may be it ends up in one share per request from departments, then we end up in many shares..&lt;/P&gt;&lt;P&gt;appreciate thoughts exchange here...&lt;/P&gt;</description>
      <pubDate>Sat, 05 Apr 2025 20:40:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/databricks-delta-sharing-design/m-p/114612#M3212</guid>
      <dc:creator>noorbasha534</dc:creator>
      <dc:date>2025-04-05T20:40:46Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks delta sharing design</title>
      <link>https://community.databricks.com/t5/administration-architecture/databricks-delta-sharing-design/m-p/115361#M3244</link>
      <description>&lt;P class=""&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/124839"&gt;@noorbasha534&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P class=""&gt;Let me share a bit about our use case and how we’re handling Delta Sharing.&lt;BR /&gt;&lt;BR /&gt;Delta Sharing is indeed a simple and lightweight solution, and one of its main advantages is that it’s free to use. However, it still has several limitations, especially when it comes to performance. The biggest challenge we’ve faced is that it loads the entire table into memory before returning the data, which becomes unsustainable with large datasets or wide tables.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P class=""&gt;In our case, we had to implement a &lt;SPAN class=""&gt;custom workaround&lt;/SPAN&gt; to make it usable, filtering the data at a fine-grained level &lt;SPAN class=""&gt;before sharing&lt;/SPAN&gt;, because in some situations you simply can’t afford to wait 7 minutes for a table to load, especially when the consumer has limited resources and the request just times out or fails.&lt;/P&gt;&lt;P class=""&gt;&lt;STRONG&gt;My recomendations:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;Use Delta Sharing only for specific, low-volume scenarios&lt;SPAN class=""&gt;, where response time isn’t critical.&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;In our case, we went with &lt;SPAN class=""&gt;one share per team&lt;/SPAN&gt;, as each team owns and controls both what they consume and what they expose.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;For &lt;SPAN class=""&gt;external clients&lt;/SPAN&gt;, it’s always best to create a &lt;SPAN class=""&gt;dedicated share&lt;/SPAN&gt;.&lt;/P&gt;&lt;P class=""&gt;Regarding how many &lt;SPAN class=""&gt;shares&lt;/SPAN&gt; to create: that really depends on your use case and data sensitivity.&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;&lt;BR /&gt;If the data volume is high:&lt;/STRONG&gt;&lt;/P&gt;&lt;P class=""&gt;I strongly recommend &lt;SPAN class=""&gt;&lt;STRONG&gt;not using Delta Sharing&lt;/STRONG&gt;&lt;/SPAN&gt; for heavy use cases. Instead:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Create a &lt;/SPAN&gt;dedicated Service Principal&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;Grant access to specific tables/views&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P class=""&gt;Make the data available through a &lt;SPAN class=""&gt;SQL Warehouse (Classic or Serverless)&lt;/SPAN&gt; depending on your budget and performance needs.&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P class=""&gt;Hope this helps clarify things! &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;Isi&lt;/P&gt;&lt;P class=""&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 13 Apr 2025 14:29:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/databricks-delta-sharing-design/m-p/115361#M3244</guid>
      <dc:creator>Isi</dc:creator>
      <dc:date>2025-04-13T14:29:33Z</dc:date>
    </item>
  </channel>
</rss>

