cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

EXCEL_DATA_SOURCE_NOT_ENABLED Excel data source is not enabled in this cluster

der
Contributor

I want to read an Excel xlsx file on DBR 17.3. On the Cluster the library dev.mauch:spark-excel_2.13:4.0.0_0.31.2 is installed. 

V1 Implementation works fine:

df = spark.read.format("dev.mauch.spark.excel").schema(schema).load(excel_file)
display(df)

V2 throws [EXCEL_DATA_SOURCE_NOT_ENABLED] Excel data source is not enabled in this cluster. SQLSTATE: 56038

df = spark.read.format("excel").schema(schema).load(excel_file)
display(df)

https://learn.microsoft.com/en-gb/azure/databricks/error-messages/error-classes#excel_data_source_no...

https://docs.databricks.com/aws/en/error-messages/excel-file-error-error-class

I wonder, why do we have specific excel error codes and how do we enable the excel data source? 

 

 

1 REPLY 1

der
Contributor

If I build the spark-excel library with another short name (example "excelv2"), everything works fine. 

https://github.com/nightscape/spark-excel/issues/896#issuecomment-3486861693