pandas.read_csv
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-13-2017 02:23 PM
HI all
i have uploaded a file on my cluster , at location
/FileStore/tables/qmwxhxvi1505337108590/PastHires.csv
However, whenever i try to read it using panda
df = pd.read_csv('dbfs:/FileStore/tables/qmwxhxvi1505337108590/PastHires.csv')
, i alwasy get a
File dbfs:/FileStore/tables/qmwxhxvi1505337108590/PastHires.csv does not exist
how can i get around it?
kind regards
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-24-2017 09:19 AM
Hi, i also struggled to get pandas read from csv. Use the below code with your path with a replacement of dbfs: with /dbfs and remove the header=True to make it works in databricks python notebook. you will end up with: pandas_df = pd.read_csv("/dbfs/FileStore/tables/2esy8tnj1455052720017/part_001-86465.tsv");
FYI reference Databricks Docs :https://docs.databricks.com/user-guide/importing-data.html Original statement not working : pandas_df = pd.read_csv("/dbfs/FileStore/tables/2esy8tnj1455052720017/part_001-86465.tsv", header=True)
Good Luck IT
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-24-2017 01:27 PM
Hello
thanks.. that helped
also for some unknown reason my notebook didnt display any output at all and i thought there was something going on withe code
Now i can see my original dataframe. many thanks
kr
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-07-2020 03:57 AM
I am also having same issue, I have uploaded file in DBFS and it gives some default code which itself is not working.
Is anyone has solved this issue ?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-07-2020 08:28 AM
I'm facing the same issue. However there is a workaround posted here: https://forums.databricks.com/questions/18254/unable-to-read-file-using-pandas.html
Basically read the csv using spark and then convert to pandas

