01-12-2024 08:34 AM
Hello,
I cannot load xml files.
First, I tried to install Maven library com.databricks:spark-xml_2.12:0.14.0 as it is told in documentation, but I could not find it. I only have HyukjinKwon:spark-xml:0.1.1-s_2.10, with this one I have this error: DRIVER_LIBRARY_INSTALLATION_FAILURE. Error Message: Library resolution failed because unresolved dependency: com.databricks:spark-xml_2.12:0.17.0: not found
Then I tried to install library via dbfs using a JAR file. I tried spark_xml_2_12_0_15_0.jar and spark_xml_2_12_0_17_0.jar, doing this I progressed a little but I have now this error: java.lang.NoClassDefFoundError: scala/$less$colon$less
My cluster Runtime Version is: 13.3 LTS (includes Apache Spark 3.4.1, Scala 2.12)
I have to read my xml files via notebook, thank you in advance for your help.
01-15-2024 04:23 AM
I think I have resolved my issue by dowloading and adding last version jar file for scala 2.12, but I don't know if it is a long term solution.
(yesterday it was working then it was not, then yes, it is not very steady.)
If anybody faces this problem, I'll be grateful for sharing experience about reading xml files in databricks.
01-12-2024 11:47 AM
Hi @leaw , you can install the maven library in your cluster as below:
After that you just need to follow the document: https://docs.databricks.com/en/query/formats/xml.html
01-15-2024 04:15 AM
Thanks for your answer. I had already tried this but I have an error as I don't have this library on my databricks.
01-15-2024 04:23 AM
I think I have resolved my issue by dowloading and adding last version jar file for scala 2.12, but I don't know if it is a long term solution.
(yesterday it was working then it was not, then yes, it is not very steady.)
If anybody faces this problem, I'll be grateful for sharing experience about reading xml files in databricks.
01-15-2024 05:49 AM
Hi @leaw , The option I suggested should have downloaded the jar directly from maven but it seems like due to some issue it is unable to download.
01-15-2024 05:50 AM
Anyway, glad to know that you were able to find an alternate solution.
01-31-2024 04:03 AM
Hi All,
Installed spark-xml_2.13-0.17.0.jar on runtime 14.2 scala 2.12 and also receiving the error when attempting to read XML. Any advice would be appreciated around how to resolve.
"java.lang.NoClassDefFoundError: scala/$less$colon$less"
02-01-2024 01:29 AM
Mismatch on Scala version, my bad! Sorted
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group