- spark streaming
- structured streaming
- micro-batch processing
- continuous processing
- Distributed Snapshots: Determining Global States of Distributed Systems
- MillWheel: Fault-Tolerant Stream Processing at Internet Scale
- The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing
- Lightweight Asynchronous Snapshots for Distributed Dataflows
- Discretized Streams: Fault-Tolerant Streaming Computation at Scale