<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Getting spark/scala versioning issues while running the spark jobs through Jar in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/getting-spark-scala-versioning-issues-while-running-the-spark/m-p/31915#M23267</link>
    <description>&lt;P&gt;indeed you have to take into account the spark version.  Spark2 is not supported anymore by databricks.&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.microsoft.com/en-us/azure/databricks/release-notes/runtime/7.x-migration" alt="https://docs.microsoft.com/en-us/azure/databricks/release-notes/runtime/7.x-migration" target="_blank"&gt;This link might help&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;About the toList method: it works fine here.  I think the issue here is that the clicked_at column is not considered to be a valid date column.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 13 Sep 2022 09:10:33 GMT</pubDate>
    <dc:creator>-werners-</dc:creator>
    <dc:date>2022-09-13T09:10:33Z</dc:date>
    <item>
      <title>Getting spark/scala versioning issues while running the spark jobs through Jar</title>
      <link>https://community.databricks.com/t5/data-engineering/getting-spark-scala-versioning-issues-while-running-the-spark/m-p/31914#M23266</link>
      <description>&lt;P&gt;&amp;nbsp;We tried moving our scala script from standalone cluster to databricks platform.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Our script is compatible with following version:&lt;/P&gt;&lt;P&gt;Spark: 2.4.8&amp;nbsp;Scala: 2.11.12&lt;/P&gt;&lt;P&gt;The databricks cluster has spark/scala following with version:&lt;/P&gt;&lt;P&gt;Spark: 3.2.1.&amp;nbsp;Scala: 2.12&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;1: we are not able to run the script through Jar in "Yarn" mode &lt;/P&gt;&lt;P&gt;2: Getting issues while parsing dates &lt;/P&gt;&lt;P&gt;Few date functions were not working on the current version of spark so we had to change the&amp;nbsp;timeParserPolicy in configuration when passed.&lt;/P&gt;&lt;P&gt;&lt;I&gt;Corresponding Error: Fail to parse '2022-07-07 02:14:26.233' in the new parser. You can set spark.sql.legacy.timeParserPolicy to LEGACY to restore the behavior before Spark 3.0, or set to CORRECTED and treat it as an invalid datetime string.&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;3: Getting issue in using ".toList" function.&lt;/P&gt;&lt;P&gt;var df2=df.select("clicked_at").distinct().map(f=&amp;gt;f.getDate(0)).collect().&lt;B&gt;toList&lt;/B&gt;&lt;/P&gt;&lt;P&gt;(this toList function is causing error)&lt;/P&gt;&lt;P&gt;&lt;I&gt;Corresponding Error: Exception in thread "main" java.lang.NoSuchMethodError: scala.Predef$.refArrayOps([Ljava/lang/Object;)Lscala/collection/mutable/ArrayOps&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Any help on this will be really helpful&lt;/P&gt;</description>
      <pubDate>Tue, 13 Sep 2022 07:18:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/getting-spark-scala-versioning-issues-while-running-the-spark/m-p/31914#M23266</guid>
      <dc:creator>Monika8991</dc:creator>
      <dc:date>2022-09-13T07:18:06Z</dc:date>
    </item>
    <item>
      <title>Re: Getting spark/scala versioning issues while running the spark jobs through Jar</title>
      <link>https://community.databricks.com/t5/data-engineering/getting-spark-scala-versioning-issues-while-running-the-spark/m-p/31915#M23267</link>
      <description>&lt;P&gt;indeed you have to take into account the spark version.  Spark2 is not supported anymore by databricks.&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.microsoft.com/en-us/azure/databricks/release-notes/runtime/7.x-migration" alt="https://docs.microsoft.com/en-us/azure/databricks/release-notes/runtime/7.x-migration" target="_blank"&gt;This link might help&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;About the toList method: it works fine here.  I think the issue here is that the clicked_at column is not considered to be a valid date column.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 13 Sep 2022 09:10:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/getting-spark-scala-versioning-issues-while-running-the-spark/m-p/31915#M23267</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-09-13T09:10:33Z</dc:date>
    </item>
    <item>
      <title>Re: Getting spark/scala versioning issues while running the spark jobs through Jar</title>
      <link>https://community.databricks.com/t5/data-engineering/getting-spark-scala-versioning-issues-while-running-the-spark/m-p/31916#M23268</link>
      <description>&lt;P&gt;Hi @Monika Samant​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;We'd love to hear from you.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 27 Sep 2022 07:29:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/getting-spark-scala-versioning-issues-while-running-the-spark/m-p/31916#M23268</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-09-27T07:29:27Z</dc:date>
    </item>
  </channel>
</rss>

