I have run the WordCount program and have saved the output into a directory as follows
counts.saveAsTextFile("/users/data/hobbit-out1")
subsequently I check that the output directory contains the expected number of files
%fs ls /users/data/hobbit-out1
and I see that three files exist
dbfs:/users/data/hobbit-out1/_SUCCESS_SUCCESSdbfs:/users/data/hobbit-out1/part-00000part-00000dbfs:/users/data/hobbit-out1/part-00001part-00001
now I want to get the file dbfs:/users/data/hobbit-out1/part-00000 into my local computer.
i understand that to access these files i have to point my browser to an URL like
https://community.cloud.databricks.com/files/my-stuff/my-file.txt?o=######
in my notebook URL i note that the URL contains o=7892876048313913 and so the URL to my file should be
https://community.cloud.databricks.com/files/users/data/hobbit-out1/part-00000?o=7892876048313913
but this is leading me to 404 file not found error.
Can someone please tell me what is my error? Either in the approach or in the construction of the URL.
I have not yet tried the S3 route and but will try that if that is the ONLY way to get files out of dbfs.
Thanks for any help or guidance.