Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Four years of breaking things in production, on purpose.
Search
Eric Sigler
November 09, 2017
Technology
0
45
Four years of breaking things in production, on purpose.
Presented at Chaos Day Twin Cities, November 2017.
Eric Sigler
November 09, 2017
Tweet
Share
More Decks by Eric Sigler
See All by Eric Sigler
Instrumenting The Rest Of The Company: Hunting For Metrics
esigler
0
290
A Brief Introduction To DevOps
esigler
0
97
Humans are terrible compilers: A User's Guide
esigler
0
110
Do You Know If Your Service Is Working Properly? A Guide To Being Paranoid.
esigler
0
160
"Is there any strong objection?"
esigler
0
200
Fear, Uncertainty, and Continuous Deployment
esigler
1
110
3AM, a survey.
esigler
0
210
Strategies For Being On Call & Keeping Your Sanity At The Same Time
esigler
0
160
Engineering for Engineers
esigler
0
81
Other Decks in Technology
See All in Technology
.NET Profiler in 2024.
kkamegawa
2
2.2k
Building a RAG-poweredAI chat appwith Python and VS Code
pamelafox
0
170
Handling focus in 2024
tahia910
0
480
Cloud Service Mesh に触れ合う
phaya72
1
270
Building Dashboards as a Hobby
egmc
0
410
ゼロから始めるVue.jsコミュニティ貢献 / first-vuejs-community-contribution-link-and-motivation
lmi
1
150
コードファーストの考え方。 Amplify Gen2から学ぶAWS次世代のWeb開発体験
yoshiitaka
2
510
DX企業CTOとして考える技術への向き合い方
shoheitai
0
100
M&A戦略を支えるデータマネジメント (MIDAS Tech Study #16 GENDA Komiyama)
kommy339
1
150
One engineer company with Ruby on Rails
rstankov
2
450
非同期推論システムによるコスト削減と信頼性向上
koki_nishihara
1
380
認知症フレンドリーテックとスタックチャン
naokiuc
0
340
Featured
See All Featured
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
22
1.4k
Mobile First: as difficult as doing things right
swwweet
217
8.6k
The World Runs on Bad Software
bkeepers
PRO
61
6.7k
The Illustrated Children's Guide to Kubernetes
chrisshort
32
47k
Being A Developer After 40
akosma
67
580k
Java REST API Framework Comparison - PWX 2021
mraible
PRO
18
6.9k
Designing with Data
zakiwarfel
96
4.8k
Scaling GitHub
holman
457
140k
A designer walks into a library…
pauljervisheath
201
23k
How to train your dragon (web standard)
notwaldorf
75
5.2k
StorybookのUI Testing Handbookを読んだ
zakiyama
13
4.6k
Creatively Recalculating Your Daily Design Routine
revolveconf
211
11k
Transcript
Eric Sigler, Head of DevOps, PagerDuty @esigler Four years of
breaking things in production, on purpose.
@esigler Obligatory disclaimer: This is what works for us. Take
away ideas, not dogmas.
@esigler
@esigler 2013: Every Friday, 1 hour. 2013 2014 2015 2016
2017
@esigler 2013 2014 2015 2016 2017
None
@esigler 2014: Expanding Scope 2013 2014 2015 2016 2017
@esigler 2013 2014 2015 2016 2017
@esigler 2015: Automation 2013 2014 2015 2016 2017
@esigler 2013 2014 2015 2016 2017
@esigler 2013 2014 2015 2016 2017
@esigler 2016: Adding In Randomness 2013 2014 2015 2016 2017
@esigler 2013 2014 2015 2016 2017
@esigler Also 2016: Putting It All Together 2013 2014 2015
2016 2017
@esigler 2013 2014 2015 2016 2017
@esigler 2017: Distributing Knowledge 2013 2014 2015 2016 2017
@esigler 2013 2014 2015 2016 2017
@esigler Failure Friday sessions: 133 Faults injected: 708 Fault injections
resulting in a public postmortem: 3
@esigler Simulated full AZ failures: 4 Simulated full Region failures:
3 Simulated partial Disaster Recovery: 2
@esigler Tickets created from Failure Friday: over 225 Distinct services
that had faults injected: 49
@esigler
@esigler Optimized for learning first, tooling second Built the toolchain
to enable other teams Distributed chaos engineering knowledge
@esigler