I want to read an Excel xlsx file on DBR 17.3. On the Cluster the library dev.mauch:spark-excel_2.13:4.0.0_0.31.2 is installed.
V1 Implementation works fine:
df = spark.read.format("dev.mauch.spark.excel").schema(schema).load(excel_file)
display(df)
V2 throws [EXCEL_DATA_SOURCE_NOT_ENABLED] Excel data source is not enabled in this cluster. SQLSTATE: 56038
df = spark.read.format("excel").schema(schema).load(excel_file)
display(df)
https://learn.microsoft.com/en-gb/azure/databricks/error-messages/error-classes#excel_data_source_no...
https://docs.databricks.com/aws/en/error-messages/excel-file-error-error-class
I wonder, why do we have specific excel error codes and how do we enable the excel data source?