Hi aleksandra_ch,
Thank you so much for the detailed explanation! I feel a bit embarrassed realizing I hadn't thoroughly checked the documentation before asking.
As you pointed out, since my Auto Loader runs as an hourly batch, the "7 incremental + 1 full listing" logic perfectly explains why I was seeing performance spikes every 8 hours. After discovering that the default for useIncrementalListing was changed to false in the March 10 release, I explicitly set it to true, and the issue has been resolved.
I am aware that using incremental listing alone carries a risk of missing files. However, given that our S3 data is Hive-partitioned (year/month/day/hour) and the filenames themselves include timestamps, the risk seems low—though I agree it's not 100% foolproof.
Also, your P.S. was a real eye-opener! I had always associated "File Events" mode exclusively with real-time streaming, so I hadn't even explored using it for our hourly batches. I'll definitely look into implementing that to see if it provides better stability for our pipeline.
Thank you again for your help and for sharing such great insights.
Best regards, Jiwon