reading multiple csv files using pathos.multiprocessing

Prototype998
New Contributor III

I'm using PySpark and Pathos to read numerous CSV files and create many DF, but I keep getting this problem.

dbx_errorcode for the same:-

from pathos.multiprocessing import ProcessingPool

def readCsv(path):

  return spark.read.csv(path,header=True)

csv_file_list = [file[0][5:] for file in dbutils.fs.ls("/databricks-datasets/COVID/coronavirusdataset/") if file[1].endswith(".csv")]

pool = ProcessingPool(2)

results = pool.map(readCsv, csv_file_list)