Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-11-2024 02:11 AM
Hi @baert23 ,
To read Excel just use pandas and convert it to spark dataframe.
That's straightforward way to work with excel.
import pandas as pd
pandas_df = pd.read_excel("path_to_your_excel_file.xlsx")
spark_df = spark.createDataFrame(pandas_df)If you have still any issues, share the transformations you are doing. Let's see whether we can optimize further