How to load xml files with spark-xml ?

leaw
Databricks Partner

Hello,

I cannot load xml files.

First, I tried to install Maven library com.databricks:spark-xml_2.12:0.14.0 as it is told in documentation, but I could not find it. I only have HyukjinKwon:spark-xml:0.1.1-s_2.10, with this one I have this error: DRIVER_LIBRARY_INSTALLATION_FAILURE. Error Message: Library resolution failed because unresolved dependency: com.databricks:spark-xml_2.12:0.17.0: not found

Then I tried to install library via dbfs using a JAR file. I tried spark_xml_2_12_0_15_0.jar and spark_xml_2_12_0_17_0.jar, doing this I progressed a little but I have now this error: java.lang.NoClassDefFoundError: scala/$less$colon$less

My cluster Runtime Version is: 13.3 LTS (includes Apache Spark 3.4.1, Scala 2.12)

I have to read my xml files via notebook, thank you in advance for your help.