<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Best practice:Using Databricks managed storage vs customer‑owned ADLS for enterprise production in Administration &amp; Architecture</title>
    <link>https://community.databricks.com/t5/administration-architecture/best-practice-using-databricks-managed-storage-vs-customer-owned/m-p/152044#M5068</link>
    <description>&lt;P&gt;Hey, your research is correct. The DBFS is for logs and inner databricks workings, not for your production data. We would recommend having your own ADLS Gen2 storage container for all your production data. The DBFS is available to all users and has no governance over it. You would need to set up the ADLS storage container and then register it as Managed Storage.&lt;/P&gt;
&lt;P&gt;You would then want to have managed tables storing the data in the ADLS Gen2 storage. Important to note these are not the same as managed storage. There is a choice between managed and external tables but we tend to recommend managed tables as it has improved security and a lot of out of the box optimisation. If you have other tools that need to access this data they can do this via the API.&lt;/P&gt;
&lt;P&gt;I hope this helps. Let me know if anything doesn't make sense.&lt;/P&gt;
&lt;P&gt;Here are a couple of docs with more info&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/best-practices" target="_blank"&gt;https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/best-practices&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/storage" target="_blank"&gt;https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/storage&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;BR /&gt;&lt;BR /&gt;Emma&lt;/P&gt;</description>
    <pubDate>Wed, 25 Mar 2026 16:14:24 GMT</pubDate>
    <dc:creator>emma_s</dc:creator>
    <dc:date>2026-03-25T16:14:24Z</dc:date>
    <item>
      <title>Best practice:Using Databricks managed storage vs customer‑owned ADLS for enterprise production data</title>
      <link>https://community.databricks.com/t5/administration-architecture/best-practice-using-databricks-managed-storage-vs-customer-owned/m-p/152000#M5065</link>
      <description>&lt;DIV&gt;&lt;P&gt;We are currently setting up Azure Databricks for enterprise analytics and wanted to validate our storage architecture against Databricks best practices.&lt;/P&gt;&lt;P&gt;Today, we are ingesting data directly from external enterprise sources (Oracle DB, SQL Server, etc.) using Databricks connectors, and the data is landing in the &lt;STRONG&gt;Databricks managed storage account (DBFS / ADLS in the managed resource group)&lt;/STRONG&gt; created along with the workspace.&lt;/P&gt;&lt;P&gt;We do not currently have a &lt;STRONG&gt;customer‑owned external ADLS Gen2&lt;/STRONG&gt; configured as a centralized enterprise data lake.&lt;/P&gt;&lt;P&gt;I understand from Databricks documentation that managed storage is primarily intended for &lt;STRONG&gt;workspace/internal use&lt;/STRONG&gt; (logs, libraries, temp data, internal tables), and that for &lt;STRONG&gt;production and enterprise data&lt;/STRONG&gt;, Databricks recommends using a &lt;STRONG&gt;customer‑owned ADLS Gen2&lt;/STRONG&gt; accessed via &lt;STRONG&gt;Unity Catalog external locations&lt;/STRONG&gt;.&lt;/P&gt;&lt;P&gt;My questions are:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Is it recommended to store enterprise production data from external systems in Databricks managed storage (even when it is ADLS Gen2 with HNS enabled)?&lt;/LI&gt;&lt;LI&gt;For enterprise‑scale deployments with multiple workspaces and downstream consumers (ADF, Fabric), is a &lt;STRONG&gt;single customer‑owned ADLS Gen2&lt;/STRONG&gt; the recommended system of record?&lt;/LI&gt;&lt;LI&gt;Would Databricks consider managed storage an acceptable long‑term data lake, or should it be treated strictly as workspace‑scoped/internal storage?&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Any clarification or confirmation from Databricks engineers or the community would be greatly appreciated.&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 25 Mar 2026 11:43:20 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/best-practice-using-databricks-managed-storage-vs-customer-owned/m-p/152000#M5065</guid>
      <dc:creator>LokeshChikuru</dc:creator>
      <dc:date>2026-03-25T11:43:20Z</dc:date>
    </item>
    <item>
      <title>Re: Best practice:Using Databricks managed storage vs customer‑owned ADLS for enterprise production</title>
      <link>https://community.databricks.com/t5/administration-architecture/best-practice-using-databricks-managed-storage-vs-customer-owned/m-p/152044#M5068</link>
      <description>&lt;P&gt;Hey, your research is correct. The DBFS is for logs and inner databricks workings, not for your production data. We would recommend having your own ADLS Gen2 storage container for all your production data. The DBFS is available to all users and has no governance over it. You would need to set up the ADLS storage container and then register it as Managed Storage.&lt;/P&gt;
&lt;P&gt;You would then want to have managed tables storing the data in the ADLS Gen2 storage. Important to note these are not the same as managed storage. There is a choice between managed and external tables but we tend to recommend managed tables as it has improved security and a lot of out of the box optimisation. If you have other tools that need to access this data they can do this via the API.&lt;/P&gt;
&lt;P&gt;I hope this helps. Let me know if anything doesn't make sense.&lt;/P&gt;
&lt;P&gt;Here are a couple of docs with more info&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/best-practices" target="_blank"&gt;https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/best-practices&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/storage" target="_blank"&gt;https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/storage&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;BR /&gt;&lt;BR /&gt;Emma&lt;/P&gt;</description>
      <pubDate>Wed, 25 Mar 2026 16:14:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/best-practice-using-databricks-managed-storage-vs-customer-owned/m-p/152044#M5068</guid>
      <dc:creator>emma_s</dc:creator>
      <dc:date>2026-03-25T16:14:24Z</dc:date>
    </item>
    <item>
      <title>Re: Best practice:Using Databricks managed storage vs customer‑owned ADLS for enterprise production</title>
      <link>https://community.databricks.com/t5/administration-architecture/best-practice-using-databricks-managed-storage-vs-customer-owned/m-p/152047#M5069</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/176516"&gt;@emma_s&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for the update and appreciate the immediate response.&lt;/P&gt;</description>
      <pubDate>Wed, 25 Mar 2026 16:36:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/best-practice-using-databricks-managed-storage-vs-customer-owned/m-p/152047#M5069</guid>
      <dc:creator>LokeshChikuru</dc:creator>
      <dc:date>2026-03-25T16:36:38Z</dc:date>
    </item>
  </channel>
</rss>

