I have a dataframe with a schema similar to the following:
id: string
array_field: array
element: struct
field1: string
field2: string
array_field2: array
element: struct
nested_field: string
I am trying to flatten this into rows.
The issue that I am having is that when I do something like e_df = df.select("id", F.explode("array_field")) it is only returning the exploded values for the first id. I am not sure if this is something simple, but I have wasted a lot of time trying to sort out what the issue is. When I look every id has an associated array field and I would think that the result should be something like:
id, col
1, first element struct
1, second element struct
2, first element struct
2, second element struct
and so on. Any insight here would be very helpful.