Slide 24
Slide 24 text
state.backend.local-recovery: true
Local recovery: Only re-download the state on
failed machines
After a failure without local recovery:
All TaskManagers download the state
TM1 TM2 TM3 TM4
1 TM4 fails
TM1 TM2 TM3 TM4
2 Recovery
With local recovery: Most machines use local
disks, only one needs to download
TM1 TM2 TM3 TM4
1 TM4 fails
TM1 TM2 TM3 TM4
2 Recovery
Reliable, Fast Checkpointing