<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: mount bucket s3 in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/mount-bucket-s3/m-p/72049#M34478</link>
    <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/105521"&gt;@thiagoawstest&lt;/a&gt;&amp;nbsp;&lt;SPAN&gt;To mount an S3 bucket in Databricks on AWS so that all clusters and users have access to it without needing to remount each time, and without creating an access key in AWS, follow these steps:&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Mounting an S3 Bucket Using an AWS Instance Profile&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;1. Configure your cluster with an instance profile:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;- Ensure your AWS instance profile has the necessary permissions to access the S3 bucket.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;- Attach the instance profile to your Databricks cluster.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;2. Mount the S3 bucket:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;- Use the&amp;nbsp;&lt;/SPAN&gt;&lt;CODE class="c-mrkdwn__code" data-stringify-type="code"&gt;dbutils.fs.mount&lt;/CODE&gt;&lt;SPAN&gt;&amp;nbsp;command to mount the S3 bucket. The mount point will be accessible to all users and clusters, and it will persist across cluster restarts.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Example in Python:&lt;/SPAN&gt;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;python
aws_bucket_name = "&amp;lt;aws-bucket-name&amp;gt;"
mount_name = "&amp;lt;mount-name&amp;gt;"

dbutils.fs.mount(
    source=f"s3a://{aws_bucket_name}",
    mount_point=f"/mnt/{mount_name}"
)

# Verify the mount
display(dbutils.fs.ls(f"/mnt/{mount_name}"))&lt;/LI-CODE&gt;
&lt;P&gt;Reference:&amp;nbsp;&lt;A href="https://docs.databricks.com/en/dbfs/mounts.html#mount-a-bucket-using-an-aws-instance-profile" target="_blank"&gt;https://docs.databricks.com/en/dbfs/mounts.html#mount-a-bucket-using-an-aws-instance-profile&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 07 Jun 2024 14:01:29 GMT</pubDate>
    <dc:creator>Yeshwanth</dc:creator>
    <dc:date>2024-06-07T14:01:29Z</dc:date>
    <item>
      <title>mount bucket s3</title>
      <link>https://community.databricks.com/t5/data-engineering/mount-bucket-s3/m-p/72044#M34477</link>
      <description>&lt;P&gt;Hi, I have Databricks configured on AWS, I need to mount some S3 buckets on Databricks in /mnt, but I have some questions:&lt;/P&gt;&lt;P&gt;- How can a bucket be mounted for all clusters and users to have access to, so as not to need to mount it every time the cluster starts?&lt;/P&gt;&lt;P&gt;- and there is a way to mount it without creating an access key in AWS, what is the best practice for a production environment?&lt;/P&gt;&lt;P&gt;thanks.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2024 13:47:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/mount-bucket-s3/m-p/72044#M34477</guid>
      <dc:creator>thiagoawstest</dc:creator>
      <dc:date>2024-06-07T13:47:11Z</dc:date>
    </item>
    <item>
      <title>Re: mount bucket s3</title>
      <link>https://community.databricks.com/t5/data-engineering/mount-bucket-s3/m-p/72049#M34478</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/105521"&gt;@thiagoawstest&lt;/a&gt;&amp;nbsp;&lt;SPAN&gt;To mount an S3 bucket in Databricks on AWS so that all clusters and users have access to it without needing to remount each time, and without creating an access key in AWS, follow these steps:&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Mounting an S3 Bucket Using an AWS Instance Profile&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;1. Configure your cluster with an instance profile:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;- Ensure your AWS instance profile has the necessary permissions to access the S3 bucket.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;- Attach the instance profile to your Databricks cluster.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;2. Mount the S3 bucket:&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;WBR /&gt;&lt;SPAN&gt;- Use the&amp;nbsp;&lt;/SPAN&gt;&lt;CODE class="c-mrkdwn__code" data-stringify-type="code"&gt;dbutils.fs.mount&lt;/CODE&gt;&lt;SPAN&gt;&amp;nbsp;command to mount the S3 bucket. The mount point will be accessible to all users and clusters, and it will persist across cluster restarts.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Example in Python:&lt;/SPAN&gt;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;python
aws_bucket_name = "&amp;lt;aws-bucket-name&amp;gt;"
mount_name = "&amp;lt;mount-name&amp;gt;"

dbutils.fs.mount(
    source=f"s3a://{aws_bucket_name}",
    mount_point=f"/mnt/{mount_name}"
)

# Verify the mount
display(dbutils.fs.ls(f"/mnt/{mount_name}"))&lt;/LI-CODE&gt;
&lt;P&gt;Reference:&amp;nbsp;&lt;A href="https://docs.databricks.com/en/dbfs/mounts.html#mount-a-bucket-using-an-aws-instance-profile" target="_blank"&gt;https://docs.databricks.com/en/dbfs/mounts.html#mount-a-bucket-using-an-aws-instance-profile&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2024 14:01:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/mount-bucket-s3/m-p/72049#M34478</guid>
      <dc:creator>Yeshwanth</dc:creator>
      <dc:date>2024-06-07T14:01:29Z</dc:date>
    </item>
  </channel>
</rss>

