cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Converting between Pandas to Koalas

Anonymous
Not applicable

When and why should I convert b/w a Pandas to Koalas dataframe? What are the implications?

1 ACCEPTED SOLUTION

Accepted Solutions

Ryan_Chynoweth
Honored Contributor III
Koalas is distributed on a Databricks cluster similar to how Spark dataframes are also distributed. Pandas dataframes only live on the spark driver in memory. If you are a pandas user and are using a multi-node cluster then you should use koalas to process the data. If you are able to use a single node databricks cluster then pandas could fit your needs as the data likely fits on a single computer.

View solution in original post

1 REPLY 1

Ryan_Chynoweth
Honored Contributor III
Koalas is distributed on a Databricks cluster similar to how Spark dataframes are also distributed. Pandas dataframes only live on the spark driver in memory. If you are a pandas user and are using a multi-node cluster then you should use koalas to process the data. If you are able to use a single node databricks cluster then pandas could fit your needs as the data likely fits on a single computer.
Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.