Siebert_Looije
Contributor

Hi,

Thanks for you message. 
You might want to directly load the dataframe into the spark dataframe. A couple of example can be found in this stack overflow link: https://stackoverflow.com/questions/56426069/how-to-read-xlsx-or-xls-files-as-spark-dataframe. 

If this doesn't help, please ask because I will deep further into it.

Kind regards,