<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How do I configure my interactive compute in databricks to access files from an EFS filesystem? in Data Governance</title>
    <link>https://community.databricks.com/t5/data-governance/how-do-i-configure-my-interactive-compute-in-databricks-to/m-p/136437#M2647</link>
    <description>&lt;P&gt;I have an S3 account in which I have full administrator privileges. In that account I have a databricks workspace and an EFS filesystem setup.&amp;nbsp; I created an interactive compute inside databricks workspace with the default config. How do I configure my interactive compute in databricks to access files from an EFS filesystem?&lt;/P&gt;</description>
    <pubDate>Tue, 28 Oct 2025 16:30:41 GMT</pubDate>
    <dc:creator>mtaraviya-QA</dc:creator>
    <dc:date>2025-10-28T16:30:41Z</dc:date>
    <item>
      <title>How do I configure my interactive compute in databricks to access files from an EFS filesystem?</title>
      <link>https://community.databricks.com/t5/data-governance/how-do-i-configure-my-interactive-compute-in-databricks-to/m-p/136437#M2647</link>
      <description>&lt;P&gt;I have an S3 account in which I have full administrator privileges. In that account I have a databricks workspace and an EFS filesystem setup.&amp;nbsp; I created an interactive compute inside databricks workspace with the default config. How do I configure my interactive compute in databricks to access files from an EFS filesystem?&lt;/P&gt;</description>
      <pubDate>Tue, 28 Oct 2025 16:30:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/how-do-i-configure-my-interactive-compute-in-databricks-to/m-p/136437#M2647</guid>
      <dc:creator>mtaraviya-QA</dc:creator>
      <dc:date>2025-10-28T16:30:41Z</dc:date>
    </item>
    <item>
      <title>Re: How do I configure my interactive compute in databricks to access files from an EFS filesystem?</title>
      <link>https://community.databricks.com/t5/data-governance/how-do-i-configure-my-interactive-compute-in-databricks-to/m-p/136652#M2648</link>
      <description>&lt;P&gt;*When I said S3 account, I meant AWS account. This question is strictly about EFS. I am trying to use EFS as attempts to use S3 instead did not work, and furthermore EFS suits my usage requirements more closely&lt;/P&gt;</description>
      <pubDate>Wed, 29 Oct 2025 22:48:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/how-do-i-configure-my-interactive-compute-in-databricks-to/m-p/136652#M2648</guid>
      <dc:creator>mtaraviya-QA</dc:creator>
      <dc:date>2025-10-29T22:48:10Z</dc:date>
    </item>
    <item>
      <title>Re: How do I configure my interactive compute in databricks to access files from an EFS filesystem?</title>
      <link>https://community.databricks.com/t5/data-governance/how-do-i-configure-my-interactive-compute-in-databricks-to/m-p/137241#M2660</link>
      <description>&lt;P&gt;Greetings&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/195073"&gt;@mtaraviya-QA&lt;/a&gt;&amp;nbsp;,&amp;nbsp; Here’s how to configure your interactive Databricks compute to access files in AWS EFS.&lt;/P&gt;
&lt;H3 class="paragraph"&gt;Prerequisites on AWS networking&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;Ensure the &lt;STRONG&gt;Databricks cluster VPC/subnets can reach EFS mount targets&lt;/STRONG&gt;. Place EFS mount targets in subnets reachable from your cluster, and open NFS (TCP 2049) in the relevant security groups.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;If EFS is in a &lt;STRONG&gt;different VPC or AWS account&lt;/STRONG&gt;, set up VPC peering or a transit gateway and routing between VPCs.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Enable &lt;STRONG&gt;VPC DNS resolution and hostnames&lt;/STRONG&gt; and test reachability from the cluster network (for example, &lt;CODE&gt;nslookup fs-xxxx.efs.&amp;lt;region&amp;gt;.amazonaws.com&lt;/CODE&gt; and &lt;CODE&gt;nc -vz &amp;lt;efs-hostname&amp;gt; 2049&lt;/CODE&gt;). If cross‑VPC DNS is not available, plan to mount using the mount target IP address.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Configure the Databricks cluster (API-based mount)&lt;/H3&gt;
&lt;DIV class="paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV class="paragraph"&gt;Mount EFS via the Clusters API using the experimental cluster_mount_infos field. Do not use init scripts for EFS on typical shared/E2 workspaces.&lt;/DIV&gt;
&lt;UL&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Create or edit your cluster to include cluster_mount_infos; example: &lt;CODE&gt;json
{
  "cluster_name": "efs-cluster",
  "spark_version": "15.4.x-scala2.12",
  "node_type_id": "i3.xlarge",
  "num_workers": 2,
  "cluster_mount_infos": [
    {
      "network_filesystem_info": {
        "server_address": "fs-abcdef0123456789.efs.us-east-1.amazonaws.com",
        "mount_options": "nfsvers=4.1,rsize=1048576,wsize=1048576,hard,timeo=600,retrans=2,noresvport"
      },
      "local_mount_dir_path": "/mnt/volumes/efs-mount",
      "remote_mount_dir_path": "/"
    }
  ]
}
&lt;/CODE&gt;&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;If DNS doesn’t resolve (common in cross‑VPC setups), use the &lt;STRONG&gt;mount target IP&lt;/STRONG&gt; for &lt;CODE&gt;server_address&lt;/CODE&gt;. Optionally pin the cluster’s AZ with &lt;CODE&gt;aws_attributes.zone_id&lt;/CODE&gt; to match the mount target’s AZ.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Access the mount at the path you specified in &lt;STRONG&gt;local_mount_dir_path&lt;/STRONG&gt; (for example, &lt;CODE&gt;/mnt/volumes/efs-mount&lt;/CODE&gt;). In some environments mounts are presented under &lt;CODE&gt;/db-mnt/...&lt;/CODE&gt;; if you don’t see your path at the root, check under &lt;CODE&gt;/db-mnt&lt;/CODE&gt;.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Terraform example&lt;/H3&gt;
&lt;DIV class="paragraph"&gt;&lt;CODE&gt;&lt;/CODE&gt;&lt;/DIV&gt;
&lt;DIV class="paragraph"&gt;&lt;CODE&gt;hcl
resource "databricks_cluster" "with_efs" {
  # ...
  cluster_mount_info {
    network_filesystem_info {
      server_address = "fs-abcdef0123456789.efs.us-east-1.amazonaws.com"
      mount_options  = "nfsvers=4.1,rsize=1048576,wsize=1048576,hard,timeo=600,retrans=2,noresvport"
    }
    remote_mount_dir_path = "/"
    local_mount_dir_path  = "/mnt/volumes/efs-mount"
  }
}
&lt;/CODE&gt;&lt;/DIV&gt;
&lt;DIV class="paragraph"&gt;### Verify from a notebook * Check the mount and list files: &lt;CODE&gt;bash
  %sh
  mount | grep efs
  ls -la /mnt/volumes/efs-mount
  &lt;/CODE&gt;&lt;/DIV&gt;
&lt;UL&gt;
&lt;LI&gt;Access files via POSIX paths: &lt;CODE&gt;python
with open("/mnt/volumes/efs-mount/somefile.txt") as f:
    print(f.readline())
&lt;/CODE&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Important notes and limitations&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;&lt;STRONG&gt;Init scripts&lt;/STRONG&gt; are not the supported method for mounting EFS on shared/E2 workspaces; use the &lt;STRONG&gt;Clusters API&lt;/STRONG&gt; (&lt;CODE&gt;cluster_mount_infos&lt;/CODE&gt;) or Terraform.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;The &lt;STRONG&gt;amazon-efs-utils IAM/TLS mount helper&lt;/STRONG&gt; is not supported in this integration. Use NFSv4.1 with standard mount options (as shown above).&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;After cluster edits or restarts, ensure the &lt;STRONG&gt;mount configuration remains in the cluster definition&lt;/STRONG&gt;; avoid editing mounts in the UI, as custom properties can be lost. If the cluster autoscale adds nodes, Databricks will apply the configured mount during node setup.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;This applies to &lt;STRONG&gt;classic clusters&lt;/STRONG&gt;. For serverless compute, use S3 or Unity Catalog Volumes rather than EFS.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Troubleshooting checklist&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;From a same‑subnet test EC2 instance (with the same security group rules), try: &lt;CODE&gt;bash
  sudo apt-get update &amp;amp;&amp;amp; sudo apt-get install -y nfs-common
  sudo mkdir /efs
  sudo mount -t nfs4 -o nfsvers=4.1,rsize=1048576,wsize=1048576,hard,timeo=600,retrans=2,noresvport &amp;lt;efs-host-or-ip&amp;gt;:/ /efs
  mount | grep efs
  &lt;/CODE&gt; This validates network, security group, and DNS routing independently of Databricks.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;If DNS fails, &lt;STRONG&gt;use the mount target IP&lt;/STRONG&gt; and pin the cluster AZ. Confirm security groups allow TCP 2049 and that routing to peering/TGW is correct.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;If you’re using RStudio on Databricks, EFS via &lt;CODE&gt;cluster_mount_infos&lt;/CODE&gt; is a good way to persist user data; ensure the cluster can write to the mount (for example, &lt;CODE&gt;chmod a+w /mnt/volumes/efs-mount&lt;/CODE&gt;).&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;DIV class="paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV class="paragraph"&gt;Hope this helps, Louis.&lt;/DIV&gt;</description>
      <pubDate>Sat, 01 Nov 2025 12:44:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/how-do-i-configure-my-interactive-compute-in-databricks-to/m-p/137241#M2660</guid>
      <dc:creator>Louis_Frolio</dc:creator>
      <dc:date>2025-11-01T12:44:52Z</dc:date>
    </item>
    <item>
      <title>Re: How do I configure my interactive compute in databricks to access files from an EFS filesystem?</title>
      <link>https://community.databricks.com/t5/data-governance/how-do-i-configure-my-interactive-compute-in-databricks-to/m-p/141214#M2706</link>
      <description>&lt;P&gt;&lt;BR /&gt;&lt;SPAN&gt;if you’re trying to mount EFS directly to an interactive cluster, you’ll usually need to handle it through init scripts since EFS requires the NFS client to be installed and mounted at cluster startup. One thing to double-check is whether your workspace has the right VPC routing + security groups for NFS traffic (2049). Without that, the mount command will succeed locally but the cluster won’t see the filesystem.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;If you already tried an init script, sharing the snippet or the cluster logs might help narrow down where it’s failing.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Dec 2025 07:02:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/how-do-i-configure-my-interactive-compute-in-databricks-to/m-p/141214#M2706</guid>
      <dc:creator>EllieFarrell</dc:creator>
      <dc:date>2025-12-05T07:02:00Z</dc:date>
    </item>
  </channel>
</rss>

