I don't understand why from_utc_time(col("original_time"), "Europe/Berlin") changes the timestamp instead of just setting the timezone. That's a non-intuitive behaviour. spark.conf.set("spark.sql.session.timeZone", "UTC")from pyspark.sql import Row...