Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Berlin 2013 - Session - Ryan Smith
Search
Monitorama
September 19, 2013
0
320
Berlin 2013 - Session - Ryan Smith
Monitorama
September 19, 2013
Tweet
Share
More Decks by Monitorama
See All by Monitorama
Monitorama PDX 2017 - Ian Bennett
monitorama
1
620
PDX 2017 - Pedro Andrade
monitorama
0
790
PDX 2017 - Roy Rapoport
monitorama
4
980
PDX 2017 - Julia Evans
monitorama
0
510
Berlin 2013 - Session - Brad Lhotsky
monitorama
5
750
Berlin 2013 - Session - Alex Petrov
monitorama
6
720
Berlin 2013 - Session - Jeff Weinstein
monitorama
2
650
Berlin 2013 - Session - Oliver Hankeln
monitorama
1
570
Berlin 2013 - Session - David Goodlad
monitorama
0
500
Featured
See All Featured
It's Worth the Effort
3n
188
29k
Digital Ethics as a Driver of Design Innovation
axbom
PRO
1
170
Designing for Performance
lara
610
70k
Darren the Foodie - Storyboard
khoart
PRO
2
2.3k
Hiding What from Whom? A Critical Review of the History of Programming languages for Music
tomoyanonymous
2
410
Getting science done with accelerated Python computing platforms
jacobtomlinson
2
110
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
49
9.8k
HDC tutorial
michielstock
1
360
How to Grow Your eCommerce with AI & Automation
katarinadahlin
PRO
0
100
Optimizing for Happiness
mojombo
379
71k
Navigating the moral maze — ethical principles for Al-driven product design
skipperchong
2
240
How to audit for AI Accessibility on your Front & Back End
davetheseo
0
180
Transcript
Predictable Failure Building systems that fail in predictable ways.
Failure is the inability to handle failure.
Southern Airways Flight 242
None
What went wrong • Radar misguided pilots into the storm
• Pilots applied thrust to a stalling engine • Close landing field was not suggested
How will the system fail?
How will Redis fail?
None
British Airways Flight 009
What went wrong • All engines failed • St. Elmos
fire but nothing on the radar • Radio to tower was not 100% • 1st officers oxygen mask broke
What happens when the system fails
No Redundancy - Simple func Query(c Conn, query string) Result
{ return c.DoQuery(query) }
Redundancy - Complex func Query(conns []Conn, query string) Result {
ch := make(chan Result, len(conns)) for _, conn := range conns { go func(c Conn) { ch <- c.DoQuery(query): }(conn) } return <-ch }
Independence
Same function Different implementation
How many redundancies?
A component that just works
Dormant Failures
Don't wait until disaster strikes to find out that Your
secondary RDMS has a full disk
Propagation
Danke! @ryandotsmith