Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-04-2022 05:26 AM
Hey Team!
All I'm trying is to download a csv file stored on S3 and read it using Spark.
Here's what I mean:
!wget https://s3.amazonaws.com/nyc-tlc/trip+data/yellow_tripdata_2020-01.csvIf i download this "yellow_tripdata_2020-01.csv" where exactly it would be stored?
The response to wget is as below:
--2022-01-04 12:38:48-- https://s3.amazonaws.com/nyc-tlc/trip+data/yellow_tripdata_2020-01.csv
Resolving s3.amazonaws.com (s3.amazonaws.com)... 54.231.193.8
Connecting to s3.amazonaws.com (s3.amazonaws.com)|54.231.193.8|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 593610736 (566M) [text/csv]
Saving to: ‘yellow_tripdata_2020-01.csv’
yellow_tripdata_202 100%[===================>] 566.11M 14.9MB/s in 42s
2022-01-04 12:39:31 (13.5 MB/s) - ‘yellow_tripdata_2020-01.csv’ saved [593610736/593610736]Any help would be appreciated.
Tagging
@Kaniz Fatma , @Harikrishnan Kunhumveettil for better reach.
Riz
Labels:
- Labels:
-
Data Ingestion & connectivity