Databricks Community

memo · ‎11-28-2023

I want to pass multiple column as argument to pivot a dataframe in pyspark pivot like

mydf.groupBy("id").pivot("day","city").agg(F.sum("price").alias("price"),F.sum("units").alias("units")).show().

One way I found is to create multiple df with different pivot and join them which will result in multiple scan. But is there any other way to do this?

memo · ‎11-29-2023

Like how will the pass multiple values to the pivot function? It only takes one argument. I tried with sending an array, list. But it is throwing errors

Databricks Community

Pivot on multiple columns

Connect with Databricks Users in Your Area

Databricks Learning Festival (Virtual): 15 January - 31 January 2025

Milestone: DatabricksTV Reaches 100 Videos!

Announcing the new Meta Llama 3.3 model on Databricks

Databricks Community Champion - December 2024 - Sujesh Menon

Dotmatics and Databricks Partner to Advance Scientific Intelligence in Life Sciences