- Subscribe to RSS Feed
- Mark as New
- Mark as Read
- Bookmark
- Subscribe
- Printer Friendly Page
- Report Inappropriate Content
In this video, Vikas Reddy, a Staff Backline Engineer at Databricks, goes over the use and importance of using checkpointing in streaming applications. Asynchronous state checkpointing maintains exactly-once guarantees for streaming queries but can reduce overall latency for some Structured Streaming stateful workloads bottlenecked on state updates. This is accomplished by beginning to process the next micro-batch as soon as the computation of the previous micro-batch has been completed without waiting for state checkpointing to complete. This is explained in a lot more detail (with various examples and demos!) in the 1 hour long video here. Target Audience - Data Engineers, Spark Engineers, Data/Solutions Architects
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.