by
Qarol
• New Contributor
- 686 Views
- 2 replies
- 0 kudos
0I have a database with two columns: name (str) and probability (float).I am running this command:df[['name','probability']].groupby('name').prod()on a Databricks (runtime 7.3) notebook and df is a koalas dataframe.The error I get is:PandasNotImpleme...
- 686 Views
- 2 replies
- 0 kudos
- 1553 Views
- 4 replies
- 1 kudos
I have a delta live table that I'm trying to run GroupBy on, but getting an error: "RuntimeError: Query function must return either a Spark or Koalas DataFrame". Here is my code:@dlt.table
def groups_hierarchy():
df = dlt.read_stream("groups_h...
- 1553 Views
- 4 replies
- 1 kudos
Latest Reply
Hi @Preben Olsen​ Does @Debayan Mukherjee​ response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!
3 More Replies
- 653 Views
- 1 replies
- 0 kudos
When and why should I convert b/w a Pandas to Koalas dataframe? What are the implications?
- 653 Views
- 1 replies
- 0 kudos
Latest Reply
Koalas is distributed on a Databricks cluster similar to how Spark dataframes are also distributed. Pandas dataframes only live on the spark driver in memory. If you are a pandas user and are using a multi-node cluster then you should use koalas to p...