Harvest, Yield, and Scalable Tolerant Systems

@ordepdev @ordepdev

1999 https://users.ece.cmu.edu/~adrian/731-sp04/readings/FB-cap.pdf

“We propose two strategies for improving overall availability using simple
mechanisms that scale over large applications whose output behavior tolerates graceful degradation.”

Tolerate partial failures and provide smoothly- degrading functionality.

“We characterize this degradation in terms of harvest and yield.”

1975 https://tools.ietf.org/html/rfc677

“[…] originally proposed as a rule of thumb, without precise
deﬁnitions, with the goal of starting a discussion about trade-oﬀs in databases.”

The CAP Principle

It was originally called the CAP Principle by Fox and
Brewer.

After the principle was formalized by Gilbert and Lynch (2002)
it became known as the CAP Theorem.

All nodes should see the same data at the same
time. Consistency (C)

Consistency (C) x=1 x=1

When a failure occurs, the system should keep going, switching
over to a replica, if required. Availability (A)

Availability (A) ✅

The system should continue to operate despite arbitrary message loss
or failure of part of the system. Partition resilience (P)

Partition resilience (P) ⚡ ✅ ✅

⚡A network partition is a communication fault that splits the
network into subsets of nodes that cannot communicate with each other.⚡

Partition resilience (P) ⚡ ✅ ✅

Strong CAP Principle

Strong Consistency, High Availability, Partition- resilience: Pick at most 2.

x=? x=?

⚡ x=? x=?

⚡ 1. set(‘x’,1) x=? x=?

⚡ 1. set(‘x’,1) 2. send(‘x’) x=1 x=?

⚡ 1. set(‘x’,1) 2. send(‘x’) x=1 x=? ⚡

⚡ 1. set(‘x’,1) 2. send(‘x’) x=1 x=? ⚡ 1. set(‘x’,2)

⚡ 1. set(‘x’,1) 2. send(‘x’) x=1 x=2 ⚡ 1. set(‘x’,2)
2. send(‘x’)

⚡ 1. set(‘x’,1) 2. send(‘x’) x=1 x=2 ⚡ 1. set(‘x’,2)
2. send(‘x’) Both nodes are available, although there’s no consistency!

It’s all about trade-oﬀs

Partition Tolerance Consistency Availability

Partition tolerance is mandatory in distributed systems. You cannot not
choose it!

To achieve atomic reads and writes we must wait for
a response from the partitioned node. CP without A

To achieve maximum availability it should return the most recent
version of (stale) data. AP without C

“The stronger the guarantees made about any two, the weaker
the guarantees that can be made about the third.”

Harvest & Yield

Probability of completing a request. Yield

The fraction of the data reﬂected in the response. Harvest

“In the presence of faults there is typically a tradeoﬀ
between providing no answer and providing an imperfect answer.”

⚡ COUNT WHERE x = 1; ? x=1 x=1

⚡ COUNT WHERE x = 1; No answer. x=1 x=1

⚡ COUNT WHERE x = 1; 1. x=1 x=1

“Instead of CAP, you should think about your availability in
terms of yield and harvest and which of these two your system will sacriﬁce when failures happen.” https://codahale.com/you-cant-sacriﬁce-partition-tolerance

Problems with CAP

Asymmetry between A & C http://dbmsmusings.blogspot.com/2010/04/problems-with-cap-and-yahoos-little.html

“consistent and tolerant of network partitions, but not available” CP
reads like…

Availability is only sacriﬁced when there is a network partition.
Of course, it’s not the case…

Sacriﬁce consistency all the time, not just when there is
a network partition. While an AP system…

Lack of latency considerations

“In its classic interpretation, CAP theorem ignores latency…”

“… although in practice, latency and partitions are deeply related.”

“Systems that tend to give up consistency for availability when
there is a partition also tend to give up consistency for latency when there is no partition.” http://dbmsmusings.blogspot.com/2010/04/problems-with-cap-and-yahoos-little.html

Guarantee Consistency Performance Availability Strong Consistency Excellent Poor Poor Eventual
Consistency Poor Excellent Excellent

“When you go with an AP system, you choose latency
over consistency.”

“Pick 2 of 3” is misleading http://cs609.cs.ua.edu/CAP12.pdf

“The CAP theorem asserts that any networked shared-data system can
have only two of three desirable properties.” https://www.infoq.com/articles/cap-twelve-years-later-how-the-rules-have-changed

⚡Network faults: you don’t have a choice — they will
happen whether you like it or not!⚡

⚡ Consistency Availability ?

“A better way of phrasing CAP would be either Consistent
or Available when Partitioned.”

CRDTs: we can write safely and consistently even when the
cluster is totally partitioned.

“[…] by explicitly handling partitions, designers can optimize consistency and
availability, thereby achieving some tradeoﬀ of all three.” https://www.infoq.com/articles/cap-twelve-years-later-how-the-rules-have-changed

Improving Knowledge

“Whatever way you choose to learn, I encourage you to
be curious and patient – this stuﬀ doesn’t come easy.” https://martin.kleppmann.com/2015/05/11/please-stop-calling-databases-cp-or-ap.html

“But whatever you do, please stop talking about CP and
AP, because they just don’t make any sense.” https://martin.kleppmann.com/2015/05/11/please-stop-calling-databases-cp-or-ap.html

Harvest, Yield, and Scalable Tolerant Systems

Harvest, Yield, and Scalable Tolerant Systems

Harvest, Yield, and Scalable Tolerant Systems

More Decks by Pedro Tavares

Other Decks in Programming

Featured

Transcript