<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Where exactly I should create Volume in a catalog? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81825#M36418</link>
    <description>&lt;P&gt;Hi hpant,&lt;/P&gt;
&lt;P&gt;You need to set up a new volume using a new external location (and potentially storage credential). Docs here:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/sql-ref-external-locations" target="_blank"&gt;https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/sql-ref-external-locations&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 05 Aug 2024 08:26:15 GMT</pubDate>
    <dc:creator>holly</dc:creator>
    <dc:date>2024-08-05T08:26:15Z</dc:date>
    <item>
      <title>Where exactly I should create Volume in a catalog?</title>
      <link>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81468#M36316</link>
      <description>&lt;P&gt;Currently my Databricks looks like this:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="hpant_0-1722505474676.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/10077i976074A9FF4172EE/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="hpant_0-1722505474676.png" alt="hpant_0-1722505474676.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;I want to create volume to access external location. Where exactly should I create it? Should a create new schema in "poe" catalog and create a volume inside it or create it in a existing schema? What is the best practice?&lt;/P&gt;</description>
      <pubDate>Thu, 01 Aug 2024 09:47:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81468#M36316</guid>
      <dc:creator>hpant</dc:creator>
      <dc:date>2024-08-01T09:47:09Z</dc:date>
    </item>
    <item>
      <title>Re: Where exactly I should create Volume in a catalog?</title>
      <link>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81484#M36326</link>
      <description>&lt;P&gt;Hello! Volumes go inside of schemas (screenshot below). It's up to you how to keep your data organised, but a few considerations:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;If you're going to have lots of volumes, does it make sense to group them together?&lt;/LI&gt;
&lt;LI&gt;As it's raw data, it's probably categorised as 'bronze' data, you could consider keeping it with that  &lt;/LI&gt;
&lt;LI&gt;Will you have to manage access to this data? Does it make sense to group it with other data you may want to prevent / promote access to?&lt;/LI&gt;
&lt;LI&gt;Do you want it to inherit other properties from things in the schema like tags, features or access patterns?&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;At the end of the day, schemas and catalogs are there to keep data organised. With external volumes (and tables) it has no baring on where the data is stored so it doesn't have much technical impact.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;My messy demo example:&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2024-08-01 at 13.27.39.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/10082i60C0ED5A96EEC7F5/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Screenshot 2024-08-01 at 13.27.39.png" alt="Screenshot 2024-08-01 at 13.27.39.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Aug 2024 12:35:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81484#M36326</guid>
      <dc:creator>holly</dc:creator>
      <dc:date>2024-08-01T12:35:12Z</dc:date>
    </item>
    <item>
      <title>Re: Where exactly I should create Volume in a catalog?</title>
      <link>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81490#M36329</link>
      <description>&lt;P&gt;Alright, thanks for your explanation. I have one more question, after creating a volume, how would you get it connect to a container? Imagine, you have created volume at external location a and you want to connect it external location b?&lt;/P&gt;</description>
      <pubDate>Thu, 01 Aug 2024 12:45:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81490#M36329</guid>
      <dc:creator>hpant</dc:creator>
      <dc:date>2024-08-01T12:45:06Z</dc:date>
    </item>
    <item>
      <title>Re: Where exactly I should create Volume in a catalog?</title>
      <link>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81502#M36334</link>
      <description>&lt;P&gt;Hi hpant, each volume is mapped to one location only. If you need to get data from two different locations, you'd make two separate volumes and join them as part of your pipeline.&lt;/P&gt;
&lt;P&gt;If you wanted to read in from one location and write to another, again, you'd do that with two separate volumes.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When I said 'group them together' above - you can have multiple volumes in one schema, even if their locations are very different.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Aug 2024 13:57:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81502#M36334</guid>
      <dc:creator>holly</dc:creator>
      <dc:date>2024-08-01T13:57:55Z</dc:date>
    </item>
    <item>
      <title>Re: Where exactly I should create Volume in a catalog?</title>
      <link>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81506#M36336</link>
      <description>&lt;P&gt;Hey, thanks for your response. currently, I have my data in one of the container in azure which gets added to the container through azure factory pipeline.. I have created a unity catalog workspace in different resource group. It has a container but there is no data in it. I have created a volume in it. Now I want to connect the volume to&amp;nbsp; the data present in a container of different storage account of different resource group. How can I make that connection? Do I need some sort of access key mechanism?&lt;/P&gt;</description>
      <pubDate>Thu, 01 Aug 2024 14:08:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81506#M36336</guid>
      <dc:creator>hpant</dc:creator>
      <dc:date>2024-08-01T14:08:56Z</dc:date>
    </item>
    <item>
      <title>Re: Where exactly I should create Volume in a catalog?</title>
      <link>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81825#M36418</link>
      <description>&lt;P&gt;Hi hpant,&lt;/P&gt;
&lt;P&gt;You need to set up a new volume using a new external location (and potentially storage credential). Docs here:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/sql-ref-external-locations" target="_blank"&gt;https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/sql-ref-external-locations&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 05 Aug 2024 08:26:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81825#M36418</guid>
      <dc:creator>holly</dc:creator>
      <dc:date>2024-08-05T08:26:15Z</dc:date>
    </item>
    <item>
      <title>Re: Where exactly I should create Volume in a catalog?</title>
      <link>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81843#M36426</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/36301"&gt;@holly&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Thanks so much for your response. I have one last question in this regard. Whenever I want add an extra location, ( external location), do I need to give&amp;nbsp;&lt;STRONG&gt;Contributor&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;role or higher on the access connector resource in Azure to add the storage credential first?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/connect/unity-catalog/storage-credentials" target="_blank"&gt;Create a storage credential for connecting to Azure Data Lake Storage Gen2 - Azure Databricks | Microsoft Learn&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Hiamnshu Pant&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 05 Aug 2024 09:01:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81843#M36426</guid>
      <dc:creator>hpant1</dc:creator>
      <dc:date>2024-08-05T09:01:55Z</dc:date>
    </item>
    <item>
      <title>Re: Where exactly I should create Volume in a catalog?</title>
      <link>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81917#M36449</link>
      <description>&lt;P&gt;The docs say 'Contributor or Owner of an Azure resource group' and I don't have any reason to contradict that&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 05 Aug 2024 15:18:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81917#M36449</guid>
      <dc:creator>holly</dc:creator>
      <dc:date>2024-08-05T15:18:24Z</dc:date>
    </item>
    <item>
      <title>Re: Where exactly I should create Volume in a catalog?</title>
      <link>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81991#M36468</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&amp;nbsp;.&lt;/P&gt;&lt;P&gt;I am trying to do that but couldn't find the option.&lt;/P&gt;</description>
      <pubDate>Tue, 06 Aug 2024 08:38:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81991#M36468</guid>
      <dc:creator>hpant1</dc:creator>
      <dc:date>2024-08-06T08:38:47Z</dc:date>
    </item>
    <item>
      <title>Re: Where exactly I should create Volume in a catalog?</title>
      <link>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81996#M36471</link>
      <description>&lt;P&gt;No, I don't have.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="hpant1_0-1722933848032.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/10156i9E7190B1220A3B87/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="hpant1_0-1722933848032.png" alt="hpant1_0-1722933848032.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 06 Aug 2024 08:44:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/where-exactly-i-should-create-volume-in-a-catalog/m-p/81996#M36471</guid>
      <dc:creator>hpant1</dc:creator>
      <dc:date>2024-08-06T08:44:18Z</dc:date>
    </item>
  </channel>
</rss>

