Re: Created nested struct schema SPARK - Schema Ji...

weldermartins · ‎10-10-2022

It stayed the same.

-werners- · ‎10-11-2022

if columns are missing, that particular data is not present in the json. I am not aware of spark skipping columns when reading json with inferschema. There is an option dropFieldIfAllNull but that is False by default.

That makes me think: you might wanna look into the options of read.json

https://spark.apache.org/docs/latest/sql-data-sources-json.html

weldermartins · ‎10-11-2022

Now it's working, when the message returned that it was not parallelized I searched and found the answer. When creating the Dataframe I changed it to:

@Werner Stinckens Thanks for the support.

df = spark.read.json(sc.parallelize([answer.text]))

View solution in original post