Re: High Concurrency Pass Through Cluster : pyarro...

Hubert-Dudek · ‎01-20-2022

You need to use pandas library written on top of spark dataframes. Please use for example:

~~from pandas import read_csv~~

from pyspark.pandas import read_csv

pdf = read_csv("data.csv")

My blog: https://databrickster.medium.com/