โ11-30-2022 11:20 AM
I am doing the "Data Engineering with Databricks V2" learning path.
I cannot run "DE 4.2 - Providing Options for External Sources", as the first code cell does not run successful:
%run ../Includes/Classroom-Setup-04.2
Screenshot 1:
Inside the setup notebook, the code crashes at the following command (see screenshot 2):
df = pd.read_parquet(path = datasource_path.replace("dbfs:/", '/dbfs/'))
The error message is:
FileNotFoundError: [Errno 2] No such file or directory: '/dbfs/mnt/dbacademy-datasets/data-engineering-with-databricks/v02/ecommerce/raw/users-historical'
Screenshot 2:
There seems to be an issue with the path, even though it actually exists:
Screenshot 3:
I played around a little with the path specification, but nothing helped:
Screenshot 4:
โ11-30-2022 11:42 AM
Hi @John Bโ
Can you please try by removing the dbfs and starting with /mnt only.
Also, if this does not work, can you please upload that notebooks DBC archive, so that I would be able to check the details.
Cheers..
โ12-16-2022 03:47 AM
โ11-30-2022 11:54 AM
Also @John Bโ
Assuming this is an old training course, check the same using a community cluster with DBR version less than 7. Some old training courses mount points are disabled in DBR 7+.
Cheers...
โ12-03-2022 12:22 AM
@John Bโ
Did your issue get resolved?
If not through the above methods, do ping the fix you did.
Cheers..
โ12-16-2022 04:33 AM
@Uma Maheswara Rao Desulaโ I solved the issue using ss2's suggestion (see below). After reading in a Spark DataFrame I converted it into a pandas DataFrame using the ToPandas() method.
โ12-16-2022 03:35 AM
Hi!
I can only use Runtime 7.3, 9.1., ..., 12.0. Minimum is 7.3. I am using DBR commnunity edition.
Br.
โ12-03-2022 07:14 PM
Can u try like this.spark.read.parquet("dbfs:/mnt/.......")โ
โ12-16-2022 03:58 AM
Hi @S Sโ
Reading in the file was successful. However, I got a pyspark.sql.dataframe.DataFrame object. This is not the same as a pandas DataFrame, right?
Br.
โ12-16-2022 07:04 AM
Hey @S Sโ ,
I can understand your issue
so to solve this import that DBC file and instead of question one there will be a folder for all solutions so explore solution one it will work.
Please upvote if you got some hint from my answer
Thanks
Aviral Bhardwaj
โ03-29-2023 08:11 AM
Hello All,
I am getting the exact issue as motioned in the first pot here. I have tried all the solutions listed: -
Any ideas what else I can try please.
Thanks.
โ01-04-2024 01:35 AM
I used spark.read.parquet and then convereted that to pandas dataframe and it worked for me.
Upvote if it helped you.
โ06-28-2024 07:37 AM
Worked for me too! Thanks
โ08-29-2024 06:38 PM
Thanks it helped.
โ02-02-2024 06:34 AM
Thanks for sharing this helped me too ๐ค
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโt want to miss the chance to attend and share knowledge.
If there isnโt a group near you, start one and help create a community that brings people together.
Request a New Group