cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Reading data from S3 in Azure Databricks

ossinova
Contributor II

Is it possible to create an external volume in Azure Databricks that points to an external S3 bucket so that I can read files for processing? Or is it only limited to ADLSv2?

3 REPLIES 3

PiotrU
Contributor II

I don't think so - but there are some other ways to "mount" it as external storage (some time ago I have used sc._jsc.hadoopConfiguration().set - not sure if It does still work ) or maybe things like s3fs ? 

szymon_dybczak
Contributor III

Hi @ossinova ,

I think currently it's not possible to create external volume to S3 bucket in UC. But still you can access S3 data using following techniques:

  1. Access S3 buckets with URIs and AWS keys
  2. Access S3 with open-source Hadoop options
  3. You can mount S3 bucket (but this method is depracted)

Connect to Amazon S3 - Azure Databricks | Microsoft Learn

Ashley1
Contributor

Yep, I'm keen to see this functionality as well.

I think it is reasonable to expect external locations can be on diverse storage types (at least the big players). I can nicely control access to azure storage in UC but not S3.

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group