- 4018 Views
- 7 replies
- 9 kudos
Today, many R packages are pre-installed on the standard clusters on Databricks. Libraries like "tidyverse", "ggplot2", etc are there. Also the great library "readxl" to load Excel files. But unfortunately, its counterpart "writexl" is not pre-instal...
- 4018 Views
- 7 replies
- 9 kudos
Latest Reply
I just need to figure who decides which R packages are pre-installed on the cluster.
6 More Replies
by
Mado
• Valued Contributor II
- 12049 Views
- 4 replies
- 17 kudos
Hi, I want to read an Excel file by:filepath_xlsx = "dbfs:/FileStore/data.xlsx"
sampleDF = (spark.read.format("com.crealytics.spark.excel")
.option("Header", "true")
.option("inferSchema", "false")
.option("treatEmptyValuesAsNulls", ...
- 12049 Views
- 4 replies
- 17 kudos
Latest Reply
Mado
Valued Contributor II
For this dataset, I also tried binary file reading as below: xldf_xlsx = (
spark.read.format("binaryFile")
.option("pathGlobFilter", "*.xls*")
.load(filepath_xlsx)
)
excel_content = xldf_xlsx.head(1)[0].content
file_like_obj = io.BytesIO(excel...
3 More Replies