Neha Narula on The Scalable Commutativity Rule

The Scalable Commutativity Rule by Austin Clements, Frans Kaashoek, Nickolai
Zeldovich, Robert Morris, and Eddie Kohler Papers We Love NYC April 1, 2015

Neha Narula Ph.D. candidate at MIT – Working on high performance
concurrency control in databases and distributed systems – How do we get high performance and strong consistency? Formerly @Google http://nehanaru.la @neha

A Few Caveats

Talk Outline •  Problem •  Scalable Commutativity Rule •  Applying
the Rule •  Speculation

CPU Trends

A Scalability Bottleneck one contended cache line

Cost of One Contended Cache Line

Current Software Development •  Benchmark, re-design, test •  Hard to
know what problems might arise in the future •  The real bottlenecks might be in the interface design, not just the implementation

What Scales on Today’s Multicores? •  Cache coherence: the MESI
protocol •  Reads do not conflict, reads and writes or writes and writes do •  Conflict-free is a good proxy for scalability Two operations are scalable if they are conflict-free.

Interface Scalability

Interface Scalability Change the interface?

The Scalable Commutativity Rule Whenever interface operations commute, they can
be implemented in a way that scales. Commutes Scalable implementation exists creat with lowest fd ? creat -> 3 creat -> 4

be implemented in a way that scales. Commutes Scalable implementation exists creat with lowest fd

be implemented in a way that scales. Commutes Scalable implementation exists creat with lowest fd creat with any fd ? creat -> 13 creat -> 47

be implemented in a way that scales. Commutes Scalable implementation exists creat with lowest fd creat with any fd rule

Intuition Behind Rule When operations commute – The results are independent
of order – Communication is unnecessary – And without communication, no conﬂicts

Example: Reference Counter T1 T2 T3 T4 T5 iszero() F
iszero() F dec() 2 dec() 1 dec() 0 R1 commutes; conﬂict free implementation: shared counter R2 does not commute because dec() returns counter value R1 R2

Example: Reference Counter T1 T2 T3 T4 T5 iszero() F
iszero() F dec() ok dec() ok dec() ok R1 commutes; conﬂict free implementation: shared counter R2 does not commute because dec() returns counter value R2’ does commute; conﬂict-free implementation: per-core counter R3 depends on state Initial value > 3 Initial value ≤ 3 R1 R2’ R3

Formalizing the Rule •  History •  Speciﬁcation •  Reordering • 
Commutativity

Histories and Specifications A history H is sequence of invocations
and responses on threads. A specification ζ defines an interface. ζ is the set of legal histories given the allowed behavior of the interface.

Reordering A reordering H’ is a permutation of H that
maintains operations order for each individual thread (H|t = H’|t for all t).

Commutativity A region Y of a legal history XY SIM-
commutes if every reordering Y’ of Y also yields a legal history and every legal extension Z of XY is also a legal extension of XY’. (And this must be true for every preﬁx of every reordering of Y.)

The Formal Rule Let ζ be a speciﬁcation with a
reference implementation M. Consider a history where XY where Y commutes in XY and M can generate XY. There exists a correct implementation of ζ whose execution of XY is conﬂict-free in the commutative region Y.

Commuter •  Input: Symbolic Model •  Analyzer computes commutativity conditions
•  Testgen computes test cases •  Mtrace detects conﬂict

Example: rename() rename(a, b) and rename(c, d) commute if: • 
Both source files exist and all names are different •  Neither source file exists •  a xor c exists, and it is not the other rename's destination •  One call is a self-rename of an existing file and a ≠ c •  a and c are hard links to the same inode, a ≠ c, and b = d •  Both calls are self-renames Important to have discriminating commutativity conditions •  ∀states, rename almost never commutes •  More commutative cases ⇒ more opportunities to scale •  Captures more operations applications usually do

Commuter Finds Non-scalable Cases in Linux •  Directory-wide locking • 
File descriptor reference counts •  Address space-wide locking

sv6: A Scalable OS •  POSIX-like operating system •  File
system and virtual memory system follow commutativity rule •  Implementation using standard parallel programming techniques, but guided by Commuter

Remaining 1% Idempotent Updates •  Two lseeks of same FD
to the same offset •  Two pwrites of same data to same offset

Reﬁning POSIX with the Rule •  Lowest FD versus any
FD •  stat versus xstat •  Unordered sockets •  Delayed munmap •  fork+exec versus posix_spawn

What Can We Learn? •  Embrace non-determinism •  Decompose compound
operations •  Permit weak ordering •  Release resources asynchronously

Commutative Operations Matter

Limitations of the Rule •  Rule says a scalable implementation
exists. – It might not have the best raw performance – You might need different scalable implementations for different regions – How do I ﬁnd this implementation? •  The non-scalable non-commutativity rule •  Synchronized clocks

Distributed Systems and Databases •  Reads still don’t conﬂict, but
no cache coherence for invalidations •  Rule should still apply to message passing systems •  Commutative concurrency control

Thanks! The Scalable Commutativity Rule http://pdos.csail.mit.edu/commuter/ @neha

Neha Narula on The Scalable Commutativity Rule

Neha Narula on The Scalable Commutativity Rule

More Decks by Papers_We_Love

Other Decks in Research

Featured

Transcript