Panda
Valued Contributor

@阳光彩虹小白马

The issue you're encountering seems to involve inconsistent behavior in partition overwrites using PySpark with ADLS.

Can you validate the below along with what @Himanshu6 mentioned.

  1. Force Spark to refresh the metadata of the data lake directory.
  2. Ensure that the mode(partitionOverwriteMode) is set properly before executing the overwrite operation.