<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Accessing the S3 Files in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/accessing-the-s3-files/m-p/36698#M354</link>
    <description>&lt;P&gt;I am using the Unity Catalog Cluster. I have a requirement to read the files placed by the source team in a specific location (landing) in S3. I am already using a metastore pointing to a different bucket. Do I need to use an external location pointing to the landing bucket in S3? Additionally, how can I read the data from those files?&lt;/P&gt;</description>
    <pubDate>Mon, 03 Jul 2023 05:50:15 GMT</pubDate>
    <dc:creator>Databricks3</dc:creator>
    <dc:date>2023-07-03T05:50:15Z</dc:date>
    <item>
      <title>Accessing the S3 Files</title>
      <link>https://community.databricks.com/t5/get-started-discussions/accessing-the-s3-files/m-p/36698#M354</link>
      <description>&lt;P&gt;I am using the Unity Catalog Cluster. I have a requirement to read the files placed by the source team in a specific location (landing) in S3. I am already using a metastore pointing to a different bucket. Do I need to use an external location pointing to the landing bucket in S3? Additionally, how can I read the data from those files?&lt;/P&gt;</description>
      <pubDate>Mon, 03 Jul 2023 05:50:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/accessing-the-s3-files/m-p/36698#M354</guid>
      <dc:creator>Databricks3</dc:creator>
      <dc:date>2023-07-03T05:50:15Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing the S3 Files</title>
      <link>https://community.databricks.com/t5/get-started-discussions/accessing-the-s3-files/m-p/36955#M385</link>
      <description>&lt;P&gt;You have a couple of options to consider:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;
&lt;P&gt;External Location: You can create an external location in your Unity metastore that points to the landing bucket in S3. This allows Unity to access the files in that location without having to copy or move them to the default location managed by Unity. You can configure the external location using the Unity Catalog's administration tools or by using the Unity SDK/API.&lt;/P&gt;
&lt;P&gt;To create the external location, specify the S3 bucket and prefix (folder) where the files are located. Unity will be able to read the data directly from the specified S3 location without any data movement.&lt;/P&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;Direct Read: Unity also provides the ability to directly read data from files in S3 without the need for an external location. In this approach, you can directly query the files in the S3 landing bucket using SQL or Spark commands. Unity will use its underlying query engine to perform distributed processing and retrieve the data from the S3 files.&lt;/P&gt;
&lt;P&gt;To read the data directly from the S3 landing bucket, you can use the Unity Catalog's SQL or Spark interfaces to interact with the data and perform the necessary operations like filtering, aggregating, or joining the datasets.&lt;/P&gt;
&lt;/LI&gt;
&lt;/OL&gt;</description>
      <pubDate>Wed, 05 Jul 2023 05:30:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/accessing-the-s3-files/m-p/36955#M385</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-07-05T05:30:25Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing the S3 Files</title>
      <link>https://community.databricks.com/t5/get-started-discussions/accessing-the-s3-files/m-p/37028#M399</link>
      <description>&lt;P&gt;If you could share an example of reading the file of both the cases it would be really helpful.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 05 Jul 2023 17:43:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/accessing-the-s3-files/m-p/37028#M399</guid>
      <dc:creator>Databricks3</dc:creator>
      <dc:date>2023-07-05T17:43:15Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing the S3 Files</title>
      <link>https://community.databricks.com/t5/get-started-discussions/accessing-the-s3-files/m-p/37477#M473</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/67180"&gt;@Databricks3&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Hope you are well. Just wanted to see if you were able to find an answer to your question and would you like to mark an answer as best? It would be really helpful for the other members too.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Cheers!&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jul 2023 09:54:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/accessing-the-s3-files/m-p/37477#M473</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-07-12T09:54:18Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing the S3 Files</title>
      <link>https://community.databricks.com/t5/get-started-discussions/accessing-the-s3-files/m-p/83492#M3633</link>
      <description>&lt;P&gt;did anyone get any solution on this topic?&amp;nbsp; I am also facing the challenges reading the file from s3 using the boto3 with unity enabled cluster, created the s3 external location and granted the enough access. any help on this ?&lt;/P&gt;&lt;P&gt;same path and data accessible using the pyspark without any issues,&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2024 21:04:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/accessing-the-s3-files/m-p/83492#M3633</guid>
      <dc:creator>krikotti</dc:creator>
      <dc:date>2024-08-19T21:04:19Z</dc:date>
    </item>
  </channel>
</rss>

