groupBy without aggregation (Pyspark API)
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-21-2024 11:57 AM
Hi guys,
You have any idea how can I do a groupBy without aggregation (Pyspark API)
like:
df.groupBy('field1', 'field2', 'field3')
My target is make a group but in this case is not necessary count records or aggregation
Thank you
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-21-2024 04:28 PM
df.select("field1","field2","field3").distinct()do you mean get distinct rows for selected column?