When Worst is Best in Distributed Systems Design

WHEN WORST IS BEST Peter Bailis Stanford CS @pbailis in
distributed systems design StrangeLoop 2015 25 September, St. Louis

What if we designed computer systems for worst case scenarios?

Cluster provisioning: 7.3B simultaneous users many idle resources! What if
we designed computer systems for worst case scenarios?

Cluster provisioning: 7.3B simultaneous users many idle resources! Hardware: chips
for the next Mars rover hugely expensive packaging! What if we designed computer systems for worst case scenarios?

Cluster provisioning: 7.3B simultaneous users many idle resources! Hardware: chips
for the next Mars rover hugely expensive packaging! Security: all our developers are malicious expensive code deployment! What if we designed computer systems for worst case scenarios?

Designing for the worst case often penalizes the average case

Average case performance Worst case performance

Average case performance Worst case performance ??? this talk

This talk: When can designing for the worst case improve
the average case? Structure Distributed systems and the network Beyond the network Lessons

Almost every non-trivial application today is (or is becoming) distributed
Distributed Systems Matter Distribution happens over a network

Almost every non-trivial application today is (or is becoming) distributed
Corollary: Almost every non-trivial application today needs to worry about the network Distributed Systems Matter Distribution happens over a network

Networks make design hard Many things can go wrong:

Networks make design hard Many things can go wrong: Packets
may be delayed Packets may be dropped Sometimes called an asynchronous network

any replica can respond to any request Handling Worst-Case Net
Behavior availability addresses delays, drops:

Behavior if our system is available, then even when network is ﬁne, we still don’t have to talk! availability addresses delays, drops:

Behavior if our system is available, then even when network is ﬁne, we still don’t have to talk! NO COORDINATION availability addresses delays, drops:

Coordination-free systems What if we don’t have to talk?

Coordination-free systems: 1.) Enable inﬁnite scale-out What if we don’t
have to talk?

A B C D E F G H DISTRIBUTED TRANSACTIONS
(EC2) 1 2 3 4 5 6 7 Number of Items per Transaction Throughput (txns/s) Number of Servers (Items) Accessed per Transaction Number of Servers (Items) Accessed per Transaction

A B C D E F G H IN-MEMORY LOCKING
COORDINATED 1 2 3 4 5 6 7 Number of Items per Transaction Throughput (txns/s) DISTRIBUTED TRANSACTIONS (EC2) Number of Servers (Items) Accessed per Transaction Number of Servers (Items) Accessed per Transaction

COORDINATED 1 2 3 4 5 6 7 Number of Items per Transaction Throughput (txns/s) DISTRIBUTED TRANSACTIONS (EC2) LOG SCALE! -398x Number of Servers (Items) Accessed per Transaction Number of Servers (Items) Accessed per Transaction

1 2 3 4 5 6 7 Number of Items per Transaction Throughput (txns/s) COORDINATED COORDINATION-FREE DISTRIBUTED TRANSACTIONS (EC2) -398x Number of Servers (Items) Accessed per Transaction

Coordination-free systems: 1.) Enable inﬁnite scale-out 2.) Improve throughput What
if we don’t have to talk?

133.7+ ms RTT

133.7+ ms RTT 85.1+ ms RTT

What if we don’t have to talk? Coordination-free systems: 1.)
Enable inﬁnite scale-out 2.) Improve throughput 3.) Ensure low latency 4.) Guarantee “always on" response

Coordination-free systems: 1.) Enable inﬁnite scale-out 2.) Improve throughput 3.)
Ensure low latency 4.) Guarantee “always on" response What if we don’t have to talk?

But wait! What about CAP?!?! • CAP Thm.: Famous result
from Eric Brewer, Inktomi • Takeaway (+ related results): properties like serializability require unavailability (or require coordination) • Common (incorrect) conclusion: availability is too expensive, only matters during failures, so forget about it

But wait! What about CAP?!?! • CAP Thm.: Famous result
from Eric Brewer, Inktomi • Takeaway (+ related results): properties like serializability require unavailability (or require coordination) • Common (incorrect) conclusion: availability is too expensive, only matters during failures, so forget about it surprise: many useful guarantees don’t require coordination (or unavailability)!

“Worst” is a Design Tool legacy implementations: designed for single-
node context, use coordination research question: what if we built systems that didn’t have to coordinate? result: new designs that avoid coordination unless strictly necessary Example: Coordination-Avoiding Databases

Simple Example: Read Committed legacy implementation: lock records during access
research question: is coordination necessary? goal: never read from uncommitted transactions

Simple Example: Read Committed legacy implementation: lock records during access
research question: is coordination necessary? result: no! for example, buﬀer writes until commit result: OOM speedups over classic implementations goal: never read from uncommitted transactions VLDB 2014, SIGMOD 2015

What if we don’t have to talk? Coordination-free systems: 1.)
Enable inﬁnite scale-out 2.) Improve throughput 3.) Ensure low latency 4.) Guarantee “always on" response

Ensure low latency 4.) Guarantee “always on" response What if we don’t have to talk?

Ensure low latency 4.) Guarantee “always on" response What if we don’t have to talk? Accounting for worst case improves average case

Punchline: Distributed Systems & Networks • Systems that behave well
during network faults can behave better in non-faulty environments too • With good designs, popular guarantees from today’s RDBMSs can beneﬁt! (see also Martin’s talk, 11AM Sat) • Research on coordination-avoiding systems highlights potential for huge speedups (see bailis.org) • Keywords: CRDTs, I-conﬂuence, RAMP, HAT, Bloom^L

Replication for fault tolerance can increase request capacity Replication helps
Capacity

Fail-over helps (Dev)Ops

If services can auto-fail-over… can kill processes: to perform upgrades
to manage stragglers to revoke resources Fail-over helps (Dev)Ops

99.9th %ile latency: 100ms avg latency: 1.2ms YOUR SERVICE HERE
Tail Latency in (Micro)services

10ms Tail Latency in (Micro)services

10ms 1.09ms Tail Latency in (Micro)services

99.9th %ile latency: 100ms Tail Latency in (Micro)services

front-end avg. latency: 64ms at 100x fan-out, 99.9th %ile latency:
100ms Tail Latency in (Micro)services

front-end avg. latency: 64ms at 100x fan-out, 99.9th %ile latency:
100ms 10ms Tail Latency in (Micro)services

front-end avg. latency: 64ms at 100x fan-out, 6.7ms 99.9th %ile
latency: 100ms 10ms Tail Latency in (Micro)services

YOUR SERVICE’S CORNER CASE MAY BE ITS CONSUMER’S AVERAGE CASE

Universal Design

There is also a strong business case for accessibility. Accessibility
overlaps with other best practices such as mobile web design, device independence, multi-modal interaction, usability, design for older users, and search engine optimization (SEO). Case studies show that accessible websites have better search results, reduced maintenance costs, and increased audience reach, among other beneﬁts.

x f(x) When “Best” Is Brittle Idealized function Optimum

x f(x) When “Best” Is Brittle Idealized function Optimum Less
well-behaved x f(x) Optimum

well-behaved x f(x) Optimum Missed the target

well-behaved x f(x) Optimum Missed the target “Stable” solution

well-behaved x f(x) Optimum Missed the target “Stable” solution Robust Optimization studies ﬁnding the stable solution

the average case?

When does this apply? When corner cases are common When
environmental conditions are variable When “normal” isn’t what we think This talk: When can designing for the worst case improve the average case?

DEFINING “NORMAL” DEFINES OUR DESIGNS

“Worst” raises tough questions

Cluster provisioning: what’s our scale-out strategy? “Worst” raises tough questions

Cluster provisioning: what’s our scale-out strategy? Hardware: what happens during
bit ﬂips? do we need ECC? “Worst” raises tough questions

Cluster provisioning: what’s our scale-out strategy? Hardware: what happens during
bit ﬂips? do we need ECC? Security: how to do we manage internal data accesses? “Worst” raises tough questions

EXAMINE YOUR BIASES

Reasoning about worst-case scenarios can be a powerful design tool
Key to coordination avoiding distributed systems designs Can often improve performance and robustness, also combat bias @PBAILIS // bailis.org

Special thanks to David Andersen, Ali Ghodsi, Joe Hellerstein, Eddie
Kohler, Phil Levis, Alex Miller, Oscar Moll, Barzan Mozafari, Ion Stoica, Eugene Wu, Jean Yang, Matei Zaharia

Reasoning about worst-case scenarios can be a powerful design tool
Key to coordination avoiding distributed systems designs Can often improve performance and robustness, also combat bias @PBAILIS // bailis.org

When Worst is Best in Distributed Systems Design

When Worst is Best in Distributed Systems Design

More Decks by pbailis

Other Decks in Technology

Featured

Transcript