<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: I am getting ParseException: error while running the spark SQL query in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/i-am-getting-parseexception-error-while-running-the-spark-sql/m-p/34950#M25643</link>
    <description>&lt;P&gt;@Agha Zair Ali​&amp;nbsp;Thanks for looking into this. Below is the error screenshot. I also added &amp;nbsp;` but no success&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image.png"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/1632i03688EFAD425C0B9/image-size/large?v=v2&amp;amp;px=999" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 10 Aug 2022 10:01:12 GMT</pubDate>
    <dc:creator>AJ270990</dc:creator>
    <dc:date>2022-08-10T10:01:12Z</dc:date>
    <item>
      <title>I am getting ParseException: error while running the spark SQL query</title>
      <link>https://community.databricks.com/t5/data-engineering/i-am-getting-parseexception-error-while-running-the-spark-sql/m-p/34948#M25641</link>
      <description>&lt;P&gt;I am using below code to create the Spark session and also loading the csv file. Spark session and loading csv is running well. However SQL query is generating the Parse Exception.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;%python&lt;/P&gt;&lt;P&gt;from pyspark.sql import SparkSession&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;# Create a SparkSession&lt;/P&gt;&lt;P&gt;spark = (SparkSession&lt;/P&gt;&lt;P&gt;&amp;nbsp;.builder&lt;/P&gt;&lt;P&gt;&amp;nbsp;.appName("SparkSQLExampleApp")&lt;/P&gt;&lt;P&gt;&amp;nbsp;.getOrCreate())&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;# Path to data set&lt;/P&gt;&lt;P&gt;csv_file = "dbfs:/mnt/Testing.csv"&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;# Read and create a temporary view&lt;/P&gt;&lt;P&gt;# Infer schema (note that for larger files you&amp;nbsp;&lt;/P&gt;&lt;P&gt;# may want to specify the schema)&lt;/P&gt;&lt;P&gt;df = (spark.read.format("csv")&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;.option("inferSchema", "true")&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;.option("header", "true")&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;.load(csv_file))&lt;/P&gt;&lt;P&gt;df.createOrReplaceTempView("US_CPSC_AEP_TBL")&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;spark.sql("""select sum(cast(enrollment as float)), sum(cast(growth as float)), [plan type], [Parent Organization], state, [Special Needs Plan], [Plan Name Sec A],&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;CASE when [Plan ID] between '800' and '899' then '899'&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;else '1'&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;END as plan_id&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;FROM US_CPSC_AEP_TBL&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;WHERE [Plan Name Sec A] is not null&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;group by [Parent Organization],[plan type],&amp;nbsp;state, [Special Needs Plan], [Plan Name Sec A],&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;CASE when [Plan ID] between '800' and '899' then '899'&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;else '1'&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;END&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;having sum(cast(enrollment as float)) = 0 and sum(cast(growth as float)) = 0""")&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2022 06:54:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/i-am-getting-parseexception-error-while-running-the-spark-sql/m-p/34948#M25641</guid>
      <dc:creator>AJ270990</dc:creator>
      <dc:date>2022-08-10T06:54:30Z</dc:date>
    </item>
    <item>
      <title>Re: I am getting ParseException: error while running the spark SQL query</title>
      <link>https://community.databricks.com/t5/data-engineering/i-am-getting-parseexception-error-while-running-the-spark-sql/m-p/34949#M25642</link>
      <description>&lt;P&gt;Hi @Abhishek Jain​,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;can you post the exact error as well,  try one thing use ` and ` to inclose your fields e.g., [`plan type`]&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2022 07:38:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/i-am-getting-parseexception-error-while-running-the-spark-sql/m-p/34949#M25642</guid>
      <dc:creator>Zair</dc:creator>
      <dc:date>2022-08-10T07:38:47Z</dc:date>
    </item>
    <item>
      <title>Re: I am getting ParseException: error while running the spark SQL query</title>
      <link>https://community.databricks.com/t5/data-engineering/i-am-getting-parseexception-error-while-running-the-spark-sql/m-p/34950#M25643</link>
      <description>&lt;P&gt;@Agha Zair Ali​&amp;nbsp;Thanks for looking into this. Below is the error screenshot. I also added &amp;nbsp;` but no success&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image.png"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/1632i03688EFAD425C0B9/image-size/large?v=v2&amp;amp;px=999" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 10 Aug 2022 10:01:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/i-am-getting-parseexception-error-while-running-the-spark-sql/m-p/34950#M25643</guid>
      <dc:creator>AJ270990</dc:creator>
      <dc:date>2022-08-10T10:01:12Z</dc:date>
    </item>
    <item>
      <title>Re: I am getting ParseException: error while running the spark SQL query</title>
      <link>https://community.databricks.com/t5/data-engineering/i-am-getting-parseexception-error-while-running-the-spark-sql/m-p/34951#M25644</link>
      <description>&lt;P&gt;This is resolved. Below query works fine now&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;sqldf = spark.sql("select sum(cast(enrollment as float)), sum(cast(growth as float)),`plan type`,`Parent Organization`,state,`Special Needs Plan`,`Plan Name Sec A`, CASE when `Plan ID` between '800' and '899' then '899' else '1' END as plan_id from US_CPSC_AEP_TBL WHERE `Plan Name Sec A` is not null group by `Parent Organization`,`plan type`,&amp;nbsp;state, `Special Needs Plan`, `Plan Name Sec A`, CASE when `Plan ID` between '800' and '899' then '899' else '1' END having sum(cast(enrollment as float)) = 0 and sum(cast(growth as float)) = 0")&lt;/P&gt;</description>
      <pubDate>Thu, 11 Aug 2022 05:49:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/i-am-getting-parseexception-error-while-running-the-spark-sql/m-p/34951#M25644</guid>
      <dc:creator>AJ270990</dc:creator>
      <dc:date>2022-08-11T05:49:33Z</dc:date>
    </item>
  </channel>
</rss>

