Databricks Community

Data_Engineer_3 · ‎10-27-2021

I am new to learning Spark and working on some practice; I have uploaded a zip file in DBFS /FileStore/tables directory and trying to run a python code to unzip the file; The python code is as:

from zipfile import *

with ZipFile("/FileStore/tables/flight_data.zip", "r") as zipObj:

zipObj.extractall()

It throws an error:

FileNotFoundError: [Errno 2] No such file or directory: '/FileStore/tables/flight_data.zip'

When i check manually and also through the code dbutils.fs.ls("/FileStore/tables/") it returns

Out[13]: [ FileInfo(path='dbfs:/FileStore/tables/flight_data.zip', name='flight_data.zip', size=59082358)]

Can someone please review and advise; I am using community edition to run this on cluster with configuration:

Data Bricks Runtime Version 8.3 (includes Apache Spark 3.1.1, Scala 2.12)

Hubert-Dudek · ‎10-27-2021

It is on dbfs mount so in most scenarios you should prefix everything with /dbfs (or dbfs:/ in databricks native functions, in many is not even needed as they handle only dbfs like dbutils). So please try:

from zipfile import *
 
with ZipFile("/dbfs/FileStore/tables/flight_data.zip", "r") as zipObj:
    zipObj.extractall()

Data_Engineer_3 · ‎10-27-2021

@Hubert Dudek

Hello Sir,

I tried in the way you suggested as well.However no luck! Still gives the same error.

Thank you,

Goutam

Hubert-Dudek · ‎10-27-2021

Do you have maybe high-concurrency server or some limited trial version (trial/free can make problem with reading with not native libraries).

Try also to explorer filesystem using shell commands by putting magic %sh in the first line in notebook to see is there /dbfs folder

%sh
ls /

Data_Engineer_3 · ‎10-27-2021

@Hubert Dudek

Hi Sir,

Working in community edition; Tried with magic commands as well.No luck! It says the command is not recognized.

Hubert-Dudek · ‎10-28-2021

so it seems that in community edition you can not direct access filesystem. You have access only to dbfs storage but you need to load there uncompressed object. So you need everywhere to prefix with dbfs:/ if it is not work for some function it will not work. As a last chance you can give a try like that:

    from zipfile import *
     
    with ZipFile("dbfs:/FileStore/tables/flight_data.zip", "r") as zipObj:
        zipObj.extractall()

Data_Engineer_3 · ‎10-28-2021

@Hubert Dudek

Hi Sir, No luck with this way also. :(.

Thank you for all the great suggestions though.😊

jose_gonzalez · ‎11-18-2021

Hi @Goutam Pal ,

Are you still having this issue? I think @Kaniz Fatma example will work great to solve your issue.

Data_Engineer_3 · ‎11-18-2021

@Jose Gonzalez @Kaniz Fatma : The issue still persists. Please find attached the screenshot of the error.

Thanks,

Goutam Pal

Anonymous · ‎11-19-2021

@Goutam Pal - Thank you for letting us know. I apologize about the inconvenience.

Data_Engineer_3 · ‎12-03-2021

Hello Kaniz..Will try and revert you back.

Mahima_Lalwani · ‎01-29-2022

I changed the Databricks runtime version in cluster and it worked in my case. Thank you @Kaniz Fatma

883022 · ‎07-19-2023

What if changing the runtime is not an option? I'm experiencing a similar issue using the following:

%pip install -r /dbfs/path/to/file.txt

This worked for a while, but now I'm getting the Errno 2 mentioned above. I am still able to print the same file using dbutils.fs.head('dbfs:/path/to/file.txt')

Databricks Community

FileNotFoundError: [Errno 2] No such file or directory: '/FileStore/tables/flight_data.zip' The data and file exists in location mentioned above

Join Us as a Local Community Builder!

🚀 Weekly Delta (8 - 14 October): A Look Back at This Week’s Top Community Highlights

Databricks Community Champion - September 2025 - Nayanjyoti Sonowal

🌟 Community Sparks of the Week | September 26 – October 2 🌟

Solution Accelerator Series | #4 - Toxicity Detection for Gaming

Level Up with Databricks Specialist Sessions