Where / how does DBFS store files?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-05-2022 12:19 AM
I tried to use %fs head to print the contents of a CSV file used in a training
%fs head "/mnt/path/file.csv"
but got an error saying cannot head a directory!?
Then I did %fs ls on the same CSV file and got a list of 4 files under a directory named as a CSV file.
A little confused how DBFS stores files on underlying storage and where they are stored. I am using community edition.
any pointers appreciated. Thanks.
EDIT: Apache iceberg has a illustration that shows the internal logical/storage structure well. Is there something similar for DBFS?
- Labels:
-
CSV
-
DBFS
-
Files
-
Mount Point
-
Storage
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-07-2022 07:47 AM
Is this a fresh cluster startup? I noticed that if you browse tot he location using the UI you get a prompt asking you to attach a cluster on the first startup. I would make sure that's setup and make sure you can see it in the directory.
Also have you tried loading the csv into pandas to display it?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-02-2022 06:56 AM
Hi @Al Jo , are you still seeing the error while printing the contents of te CSV file?

