โ02-21-2024 11:57 AM
Hi guys,
You have any idea how can I do a groupBy without aggregation (Pyspark API)
like:
df.groupBy('field1', 'field2', 'field3')
My target is make a group but in this case is not necessary count records or aggregation
Thank you
โ02-21-2024 04:28 PM
df.select("field1","field2","field3").distinct()
do you mean get distinct rows for selected column?
never-displayed
Passionate about hosting events and connecting people? Help us grow a vibrant local communityโsign up today to get started!