cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Warehousing & Analytics
Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

What is the most efficient way to start an S3 bucket?

christys
Databricks Employee
Databricks Employee
 
1 REPLY 1

Taha
Databricks Employee
Databricks Employee

So if you've got an S3 bucket with your data in it, the first thing you'll need to do is connect it to a Databricks workspace to grant access. Then you can start querying the contents of the bucket from notebooks (or running jobs) by using clusters (compute resources) within the Databricks workspace to execute commands.

Here's a guide on the docs site that walks through the process to connect a bucket: https://docs.databricks.com/data/data-sources/aws/amazon-s3.html

Although it shares several options, I'd recommend using instance profiles and mounting via DBFS for simplicity.

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group