<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic If I write pandas code using koalas and have photon enabled, will my pandas code run on photon? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/if-i-write-pandas-code-using-koalas-and-have-photon-enabled-will/m-p/26238#M18346</link>
    <description />
    <pubDate>Fri, 04 Jun 2021 19:35:28 GMT</pubDate>
    <dc:creator>User16752240150</dc:creator>
    <dc:date>2021-06-04T19:35:28Z</dc:date>
    <item>
      <title>If I write pandas code using koalas and have photon enabled, will my pandas code run on photon?</title>
      <link>https://community.databricks.com/t5/data-engineering/if-i-write-pandas-code-using-koalas-and-have-photon-enabled-will/m-p/26238#M18346</link>
      <description />
      <pubDate>Fri, 04 Jun 2021 19:35:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/if-i-write-pandas-code-using-koalas-and-have-photon-enabled-will/m-p/26238#M18346</guid>
      <dc:creator>User16752240150</dc:creator>
      <dc:date>2021-06-04T19:35:28Z</dc:date>
    </item>
    <item>
      <title>Re: If I write pandas code using koalas and have photon enabled, will my pandas code run on photon?</title>
      <link>https://community.databricks.com/t5/data-engineering/if-i-write-pandas-code-using-koalas-and-have-photon-enabled-will/m-p/65786#M32920</link>
      <description>&lt;P&gt;Hi there! Appreciate this reply is 3 years later than it was originally asked, but people might be coming across it still. A few things:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;Koalas was deprecated in spark 3.2 (runtime 10.4). Instead, the recommendation is to use pandas on spark with `import pyspark.pandas as ps` You can find a link &lt;A href="https://spark.apache.org/docs/latest/api/python/migration_guide/koalas_to_pyspark.html" target="_blank" rel="noopener"&gt;here&lt;/A&gt; to the spark migration guide, and here for &lt;A href="https://spark.apache.org/docs/3.5.0/api/python/user_guide/pandas_on_spark/index.html" target="_blank" rel="noopener"&gt;more usage&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;As of writing, photon works with SQL and equivalent &lt;A href="https://docs.databricks.com/en/compute/photon.html#what-is-photon-used-for" target="_blank" rel="noopener"&gt;DataFrame API&lt;/A&gt; statements. So SQL-ish statements like filter, join, and aggregates will work, but more complex ones for analytics or data science it won't.&lt;/LI&gt;
&lt;LI&gt;In the future, there may be more functionality bought out, but keep in mind that &lt;A href="https://docs.databricks.com/en/compute/photon.html#limitations" target="_blank" rel="noopener"&gt;UDFs and RDDs&lt;/A&gt; are unlikely to ever work with photon as they bypass sparks catalyst optimizer which is needed for it to work.&amp;nbsp;&lt;/LI&gt;
&lt;/OL&gt;</description>
      <pubDate>Mon, 08 Apr 2024 10:30:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/if-i-write-pandas-code-using-koalas-and-have-photon-enabled-will/m-p/65786#M32920</guid>
      <dc:creator>holly</dc:creator>
      <dc:date>2024-04-08T10:30:16Z</dc:date>
    </item>
  </channel>
</rss>

