<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to load xml files with spark-xml ? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57093#M30733</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I cannot load xml files.&lt;/P&gt;&lt;P&gt;First, I tried to install Maven library&amp;nbsp;&lt;SPAN&gt;com.databricks:spark-xml_2.12:0.14.0 as it is told in documentation, but I could not find it. I only have&amp;nbsp;HyukjinKwon:spark-xml:0.1.1-s_2.10, with this one I have this error: D&lt;SPAN class=""&gt;RIVER_LIBRARY_INSTALLATION_FAILURE. Error Message: Library resolution failed because unresolved dependency: com.databricks:spark-xml_2.12:0.17.0: not found&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Then I tried to install library via dbfs using a JAR file. I tried&amp;nbsp;spark_xml_2_12_0_15_0.jar and&amp;nbsp;spark_xml_2_12_0_17_0.jar, doing this I progressed a little but I have now this error:&amp;nbsp;&lt;STRONG&gt;java.lang.NoClassDefFoundError: scala/$less$colon$less&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;My cluster Runtime Version is:&amp;nbsp;&lt;SPAN&gt;13.3 LTS (includes Apache Spark 3.4.1, Scala 2.12)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I have to read my xml files via notebook, thank you in advance for your help.&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 12 Jan 2024 16:34:29 GMT</pubDate>
    <dc:creator>leaw</dc:creator>
    <dc:date>2024-01-12T16:34:29Z</dc:date>
    <item>
      <title>How to load xml files with spark-xml ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57093#M30733</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I cannot load xml files.&lt;/P&gt;&lt;P&gt;First, I tried to install Maven library&amp;nbsp;&lt;SPAN&gt;com.databricks:spark-xml_2.12:0.14.0 as it is told in documentation, but I could not find it. I only have&amp;nbsp;HyukjinKwon:spark-xml:0.1.1-s_2.10, with this one I have this error: D&lt;SPAN class=""&gt;RIVER_LIBRARY_INSTALLATION_FAILURE. Error Message: Library resolution failed because unresolved dependency: com.databricks:spark-xml_2.12:0.17.0: not found&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Then I tried to install library via dbfs using a JAR file. I tried&amp;nbsp;spark_xml_2_12_0_15_0.jar and&amp;nbsp;spark_xml_2_12_0_17_0.jar, doing this I progressed a little but I have now this error:&amp;nbsp;&lt;STRONG&gt;java.lang.NoClassDefFoundError: scala/$less$colon$less&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;My cluster Runtime Version is:&amp;nbsp;&lt;SPAN&gt;13.3 LTS (includes Apache Spark 3.4.1, Scala 2.12)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I have to read my xml files via notebook, thank you in advance for your help.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 12 Jan 2024 16:34:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57093#M30733</guid>
      <dc:creator>leaw</dc:creator>
      <dc:date>2024-01-12T16:34:29Z</dc:date>
    </item>
    <item>
      <title>Re: How to load xml files with spark-xml ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57195#M30734</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/92661"&gt;@leaw&lt;/a&gt;&amp;nbsp;, you can install the maven library in your cluster as below:&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2024-01-13 at 1.15.45 AM.png" style="width: 999px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/5810iA163E578B9ED9A6E/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screenshot 2024-01-13 at 1.15.45 AM.png" alt="Screenshot 2024-01-13 at 1.15.45 AM.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;After that you just need to follow the document:&amp;nbsp;&lt;A href="https://docs.databricks.com/en/query/formats/xml.html" target="_blank"&gt;https://docs.databricks.com/en/query/formats/xml.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 12 Jan 2024 19:47:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57195#M30734</guid>
      <dc:creator>Lakshay</dc:creator>
      <dc:date>2024-01-12T19:47:50Z</dc:date>
    </item>
    <item>
      <title>Re: How to load xml files with spark-xml ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57355#M30761</link>
      <description>&lt;P&gt;Thanks for your answer. I had already tried this but I have an error as I don't have this library on my databricks.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="leaw_0-1705320875596.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/5828i9DBA701807F5DE7C/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="leaw_0-1705320875596.png" alt="leaw_0-1705320875596.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 15 Jan 2024 12:15:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57355#M30761</guid>
      <dc:creator>leaw</dc:creator>
      <dc:date>2024-01-15T12:15:48Z</dc:date>
    </item>
    <item>
      <title>Re: How to load xml files with spark-xml ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57357#M30762</link>
      <description>&lt;P&gt;I think I have resolved my issue by dowloading and adding last version jar file for scala 2.12, but I don't know if it is a long term solution.&lt;/P&gt;&lt;P&gt;(yesterday it was working then it was not, then yes, it is not very steady.)&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="leaw_1-1705321241586.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/5829i81B77206C3EA8396/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="leaw_1-1705321241586.png" alt="leaw_1-1705321241586.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;If anybody faces this problem, I'll be grateful for sharing experience about reading xml files in databricks.&lt;/P&gt;</description>
      <pubDate>Mon, 15 Jan 2024 12:23:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57357#M30762</guid>
      <dc:creator>leaw</dc:creator>
      <dc:date>2024-01-15T12:23:17Z</dc:date>
    </item>
    <item>
      <title>Re: How to load xml files with spark-xml ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57361#M30764</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/92661"&gt;@leaw&lt;/a&gt;&amp;nbsp;, The option I suggested should have downloaded the jar directly from maven but it seems like due to some issue it is unable to download.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 15 Jan 2024 13:49:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57361#M30764</guid>
      <dc:creator>Lakshay</dc:creator>
      <dc:date>2024-01-15T13:49:56Z</dc:date>
    </item>
    <item>
      <title>Re: How to load xml files with spark-xml ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57362#M30765</link>
      <description>&lt;P&gt;Anyway, glad to know that you were able to find an alternate solution.&lt;/P&gt;</description>
      <pubDate>Mon, 15 Jan 2024 13:50:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/57362#M30765</guid>
      <dc:creator>Lakshay</dc:creator>
      <dc:date>2024-01-15T13:50:29Z</dc:date>
    </item>
    <item>
      <title>Re: How to load xml files with spark-xml ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/58820#M31270</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&amp;nbsp;Installed&amp;nbsp;&lt;SPAN&gt;spark-xml_2.13-0.17.0.jar on runtime 14.2&amp;nbsp; scala 2.12 and also receiving the error when attempting to read XML. Any advice would be appreciated around how to resolve.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;"&lt;STRONG&gt;java.lang.NoClassDefFoundError: scala/$less$colon$less"&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 31 Jan 2024 12:03:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/58820#M31270</guid>
      <dc:creator>Frustrated_DE</dc:creator>
      <dc:date>2024-01-31T12:03:11Z</dc:date>
    </item>
    <item>
      <title>Re: How to load xml files with spark-xml ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/58966#M31304</link>
      <description>&lt;P&gt;Mismatch on Scala version, my bad! Sorted&lt;/P&gt;</description>
      <pubDate>Thu, 01 Feb 2024 09:29:39 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-load-xml-files-with-spark-xml/m-p/58966#M31304</guid>
      <dc:creator>Frustrated_DE</dc:creator>
      <dc:date>2024-02-01T09:29:39Z</dc:date>
    </item>
  </channel>
</rss>

