von Google Cloud Storage

Administration & Architecture

Explore discussions on Databricks administration, deployment strategies, and architectural best practices. Connect with administrators and architects to optimize your Databricks environment for performance, scalability, and security.

Hi everyone,

I'm new to Databricks and am trying to connect my Google Cloud Storage bucket to my Databricks workspace. I have a 43GB CSV file stored in a GCP bucket that I want to work with. Here’s what I've done so far:

Bucket Setup:
- I created a GCP bucket (in the west6 region) where my CSV file is stored.
Databricks Configuration:
- I have a Databricks workspace (in the west2 region).
- I created a storage credential in Unity Catalog using a GCP Service Account, and I noted down the service account email.
IAM Roles:
- In the Google Cloud Console, I granted the service account the Storage Legacy Bucket Reader and Storage Object Admin roles on my bucket.
External Location:
- I attempted to create an external location in Databricks, pointing to gs://<my-bucket-name>/, using the storage credential I created.

Despite following these steps, I’m unable to see or access my CSV file from Databricks. I’m not sure if the region difference (bucket in west6 vs. workspace in west2) or something else is causing the issue.

Has anyone experienced a similar problem or can provide guidance on troubleshooting this connection? Any help would be greatly appreciated!

Thanks in advance!

56 KB

0 REPLIES 0

Photos

Upload Upload
URL URL
Saved Photos Saved Photos

Upload location

Upload location

Add Photos to Album:

New Album

Drag here to start uploading

Drag photos here or

Tap for upload options

You must install or upgrade to the latest version of Adobe Flash Player before you can upload images.