Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Berlin 2013 - Session - Ryan Smith
Search
Monitorama
September 19, 2013
0
270
Berlin 2013 - Session - Ryan Smith
Monitorama
September 19, 2013
Tweet
Share
More Decks by Monitorama
See All by Monitorama
Monitorama PDX 2017 - Ian Bennett
monitorama
1
520
PDX 2017 - Pedro Andrade
monitorama
0
560
PDX 2017 - Roy Rapoport
monitorama
4
830
PDX 2017 - Julia Evans
monitorama
0
370
Berlin 2013 - Session - Brad Lhotsky
monitorama
5
650
Berlin 2013 - Session - Alex Petrov
monitorama
6
630
Berlin 2013 - Session - Jeff Weinstein
monitorama
2
560
Berlin 2013 - Session - Oliver Hankeln
monitorama
1
480
Berlin 2013 - Session - David Goodlad
monitorama
0
350
Featured
See All Featured
10 Git Anti Patterns You Should be Aware of
lemiorhan
648
58k
YesSQL, Process and Tooling at Scale
rocio
164
13k
Practical Orchestrator
shlominoach
182
9.7k
Designing for humans not robots
tammielis
248
25k
Product Roadmaps are Hard
iamctodd
44
9.7k
Building an army of robots
kneath
300
41k
StorybookのUI Testing Handbookを読んだ
zakiyama
13
4.6k
Building Flexible Design Systems
yeseniaperezcruz
319
37k
4 Signs Your Business is Dying
shpigford
175
21k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
125
32k
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
2
3.4k
Building Better People: How to give real-time feedback that sticks.
wjessup
355
18k
Transcript
Predictable Failure Building systems that fail in predictable ways.
Failure is the inability to handle failure.
Southern Airways Flight 242
None
What went wrong • Radar misguided pilots into the storm
• Pilots applied thrust to a stalling engine • Close landing field was not suggested
How will the system fail?
How will Redis fail?
None
British Airways Flight 009
What went wrong • All engines failed • St. Elmos
fire but nothing on the radar • Radio to tower was not 100% • 1st officers oxygen mask broke
What happens when the system fails
No Redundancy - Simple func Query(c Conn, query string) Result
{ return c.DoQuery(query) }
Redundancy - Complex func Query(conns []Conn, query string) Result {
ch := make(chan Result, len(conns)) for _, conn := range conns { go func(c Conn) { ch <- c.DoQuery(query): }(conn) } return <-ch }
Independence
Same function Different implementation
How many redundancies?
A component that just works
Dormant Failures
Don't wait until disaster strikes to find out that Your
secondary RDMS has a full disk
Propagation
Danke! @ryandotsmith