That makes sense @Anand Laddaโ !
One major improvement that will have a direct impact on the performance is the architectural difference. S3-SQS uses an internal implementation of the Delta table to store the checkpoint details about the source files. From customers we have seen the usage of Delta table for checkpointing the source files details is not the efficient way. A database that ensures faster retrieval and insertion is needed. Hence the Auto-loader has improved checkpointing using RocksBD. This will have direct performance improvement on the streaming queries.
Some of the issues faced in S3-SQS and addressed in Auto loader are below:
- Latency in starting the streaming query
- Streaming query pause every one hour for a long time
- Synchronous fetching and deleting causing issues.