I am trying to read a folder with partition files where each partition is date/hour/timestamp.csv where timestamp is the exact timestamp in ISO format, e.g. 09-2022-12-05T20:35:15.2786966Z
It seems like spark having issues with reading files with colons, which is quiet ridiculous.
The issue was raised in 2019 https://issues.apache.org/jira/browse/SPARK-28841 but it seems like the issue remains.
What is the solution, rather than now renaming zillions of file names, which in s3 requires copy...
I am using Spark 3.2.1