cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

If I write pandas code using koalas and have photon enabled, will my pandas code run on photon?

User16752240150
New Contributor II
 
1 ACCEPTED SOLUTION

Accepted Solutions

holly
New Contributor III
New Contributor III

Hi there! Appreciate this reply is 3 years later than it was originally asked, but people might be coming across it still. A few things:

  1. Koalas was deprecated in spark 3.2 (runtime 10.4). Instead, the recommendation is to use pandas on spark with `import pyspark.pandas as ps` You can find a link here to the spark migration guide, and here for more usage
  2. As of writing, photon works with SQL and equivalent DataFrame API statements. So SQL-ish statements like filter, join, and aggregates will work, but more complex ones for analytics or data science it won't.
  3. In the future, there may be more functionality bought out, but keep in mind that UDFs and RDDs are unlikely to ever work with photon as they bypass sparks catalyst optimizer which is needed for it to work. 

View solution in original post

1 REPLY 1

holly
New Contributor III
New Contributor III

Hi there! Appreciate this reply is 3 years later than it was originally asked, but people might be coming across it still. A few things:

  1. Koalas was deprecated in spark 3.2 (runtime 10.4). Instead, the recommendation is to use pandas on spark with `import pyspark.pandas as ps` You can find a link here to the spark migration guide, and here for more usage
  2. As of writing, photon works with SQL and equivalent DataFrame API statements. So SQL-ish statements like filter, join, and aggregates will work, but more complex ones for analytics or data science it won't.
  3. In the future, there may be more functionality bought out, but keep in mind that UDFs and RDDs are unlikely to ever work with photon as they bypass sparks catalyst optimizer which is needed for it to work. 
Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.