How to increase autoloader speed while working with s3 (AWS)
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-04-2024 08:10 AM
Hey everyone!
I am trying to switch from a batch processing job to a autoloader (directory listing mode) on my s3 bucket that has millions of files. I am using modifiedAfter to create my initial checkpoint for the autoloader and want to speed up the process. Does anyone have suggestions to tune the workflows to speed up the process?
I read that autoloader gets 1000 files on every call in s3 and want to know if that bottlenecks the process?
Labels:
- Labels:
-
Delta Lake
-
Spark
-
Workflows
1 REPLY 1
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-04-2024 08:41 AM
What mode are you using? File notification?

