Slide 27
Slide 27 text
© 2016 Mesosphere, Inc. All Rights Reserved. 27
● Don’t have “backup” scheduler instances running all the time
○ Instead, use Marathon to launch scheduler instances, detect when they
have failed, and launch replacements
● Similar to the approach described before
○ Still achieve HA via redundancy
○ Still at most one leading scheduler at any given time
● Tradeoffs:
○ Simpler to write
○ Depend on Marathon behavior for determining when to launch
replacement schedulers, how to handle network partitions, etc.
● Recommended: still use Zk for leader election and to ensure mutual exclusion
on updates to scheduler state
Alternative: Use Marathon