<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to use external locations in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/how-to-use-external-locations/m-p/64010#M6818</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I am struggling with truly understanding how to work with external locations. As far as I am able to read, you have:&lt;/P&gt;&lt;P&gt;1) Managed catalogs&lt;BR /&gt;2) Managed schemas&lt;BR /&gt;3) Managed tables/volumes etc.&lt;BR /&gt;4) External locations that contains external tables and/or volumes&lt;BR /&gt;5) External volumes that can reside inside managed catalogs/schemas&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Most of the time, we want to write data inside of databricks - so managed catalogs, schemas and tables/volums seems natural. However, there are times when we want to write data (that we need to access inside of databricks) outside of databricks. In those cases, I understand that the way to do so, is using external locations.&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, working with external locations afterwards, I don't find straight forward.&lt;/P&gt;&lt;P&gt;For volumes, I like how I can create an external volume inside of a catalog. Then I have my raw catalog, with domain schmas and belonging managed tables end external volumes are organized within. However, when working with tabular data I find it harder to understand what you are supposed to do with it.&lt;/P&gt;&lt;P&gt;Databricks says: "Don't grant general READ FILES [...] permission on external locations to end users". Then how exactly should my users (I am a platform engineer, my users are data engineers, scientists and analysts) access these files? I don't want to do the work of creating managed tables for every table in an external location - when new data appears, those tables must be refreshed with new data. We have a lot of streaming use cases as well. ideally, I want tables to be organized in my catalogs and schemas the same way you can do with external volumes.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 18 Mar 2024 14:55:48 GMT</pubDate>
    <dc:creator>pernilak</dc:creator>
    <dc:date>2024-03-18T14:55:48Z</dc:date>
    <item>
      <title>How to use external locations</title>
      <link>https://community.databricks.com/t5/get-started-discussions/how-to-use-external-locations/m-p/64010#M6818</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I am struggling with truly understanding how to work with external locations. As far as I am able to read, you have:&lt;/P&gt;&lt;P&gt;1) Managed catalogs&lt;BR /&gt;2) Managed schemas&lt;BR /&gt;3) Managed tables/volumes etc.&lt;BR /&gt;4) External locations that contains external tables and/or volumes&lt;BR /&gt;5) External volumes that can reside inside managed catalogs/schemas&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Most of the time, we want to write data inside of databricks - so managed catalogs, schemas and tables/volums seems natural. However, there are times when we want to write data (that we need to access inside of databricks) outside of databricks. In those cases, I understand that the way to do so, is using external locations.&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, working with external locations afterwards, I don't find straight forward.&lt;/P&gt;&lt;P&gt;For volumes, I like how I can create an external volume inside of a catalog. Then I have my raw catalog, with domain schmas and belonging managed tables end external volumes are organized within. However, when working with tabular data I find it harder to understand what you are supposed to do with it.&lt;/P&gt;&lt;P&gt;Databricks says: "Don't grant general READ FILES [...] permission on external locations to end users". Then how exactly should my users (I am a platform engineer, my users are data engineers, scientists and analysts) access these files? I don't want to do the work of creating managed tables for every table in an external location - when new data appears, those tables must be refreshed with new data. We have a lot of streaming use cases as well. ideally, I want tables to be organized in my catalogs and schemas the same way you can do with external volumes.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 18 Mar 2024 14:55:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/how-to-use-external-locations/m-p/64010#M6818</guid>
      <dc:creator>pernilak</dc:creator>
      <dc:date>2024-03-18T14:55:48Z</dc:date>
    </item>
  </channel>
</rss>

