by
Qarol
• New Contributor
- 1503 Views
- 2 replies
- 0 kudos
0I have a database with two columns: name (str) and probability (float).I am running this command:df[['name','probability']].groupby('name').prod()on a Databricks (runtime 7.3) notebook and df is a koalas dataframe.The error I get is:PandasNotImpleme...
- 1503 Views
- 2 replies
- 0 kudos
- 3384 Views
- 4 replies
- 1 kudos
I have a delta live table that I'm trying to run GroupBy on, but getting an error: "RuntimeError: Query function must return either a Spark or Koalas DataFrame". Here is my code:@dlt.table
def groups_hierarchy():
df = dlt.read_stream("groups_h...
- 3384 Views
- 4 replies
- 1 kudos
Latest Reply
Hi @Preben Olsen Does @Debayan Mukherjee response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!
3 More Replies
- 1197 Views
- 1 replies
- 0 kudos
When and why should I convert b/w a Pandas to Koalas dataframe? What are the implications?
- 1197 Views
- 1 replies
- 0 kudos
Latest Reply
Koalas is distributed on a Databricks cluster similar to how Spark dataframes are also distributed. Pandas dataframes only live on the spark driver in memory. If you are a pandas user and are using a multi-node cluster then you should use koalas to p...