Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Leon Fayer - Ignite - Oncall for developers
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
devopsdaysraleigh
October 07, 2016
Technology
0
93
Leon Fayer - Ignite - Oncall for developers
devopsdaysraleigh
October 07, 2016
Tweet
Share
More Decks by devopsdaysraleigh
See All by devopsdaysraleigh
Aaron Suggs - Keynote - Context & Contingency: Patterns for choosing good tools
devopsdaysrdu
0
95
Chris Collins - Embracing the Container
devopsdaysrdu
0
72
Rebecca Fernandez & Jen Krieger - How to be more open, collaborative, and inclusive at work
devopsdaysrdu
0
170
Josh Atwell - How to Evolve Ops Skills to a DevOps World
devopsdaysrdu
0
150
Maggie Gourlay - Ignite - My Gaming Days Weren’t Wasted: How Gaming Trained Me for Testing in DevOps
devopsdaysrdu
0
120
Fraser Pollock - Ignite - Data Before DevOps
devopsdaysrdu
0
77
Dylan Schowengerdt - Ignite - Customer Success: The Missing Link in the Feedback Loop to Engineering
devopsdaysrdu
0
260
Eric Sigler - "Is there any strong objection?"
devopsdaysrdu
0
84
Michael DeHaan - Keynote - Speaking for the Dead: Is "Waterfall" and "Monolithic" Actually Good?
devopsdaysrdu
0
84
Other Decks in Technology
See All in Technology
Introduction to Sansan for Engineers / エンジニア向け会社紹介
sansan33
PRO
6
71k
Raspberry Pi AI HAT+ 2 介紹(#49)
piepie_tw
PRO
0
120
トラブルの大半は「言ってない」x「言ってない」じゃねーか!!
ichimichi
0
260
チームメンバー迷わないIaC設計
hayama17
5
3.4k
LINEヤフーにおけるAI駆動開発組織のプロデュース施策
lycorptech_jp
PRO
0
320
三菱UFJ銀行におけるエンタープライズAI駆動開発のリアル / Enterprise AI_Driven Development at MUFG Bank: The Real Story
muit
10
20k
Oracle Cloud Infrastructure:2026年2月度サービス・アップデート
oracle4engineer
PRO
0
140
opsmethod第1回_アラート調査の自動化にむけて
yamatook
0
330
Interop Tokyo 2025 ShowNet Team Memberで学んだSRv6を基礎から丁寧に
miyukichi_ospf
0
280
Exadata Fleet Update
oracle4engineer
PRO
0
1.3k
Claude Cowork Plugins を読む - Skills駆動型業務エージェント設計の実像と構造
knishioka
0
220
メタデータ同期に潜んでいた問題 〜 Cache Stampede 時の Cycle Wait を⾒つけた話
lycorptech_jp
PRO
0
120
Featured
See All Featured
Redefining SEO in the New Era of Traffic Generation
szymonslowik
1
230
A Soul's Torment
seathinner
5
2.4k
Bridging the Design Gap: How Collaborative Modelling removes blockers to flow between stakeholders and teams @FastFlow conf
baasie
0
470
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
49
3.3k
Context Engineering - Making Every Token Count
addyosmani
9
730
Embracing the Ebb and Flow
colly
88
5k
The #1 spot is gone: here's how to win anyway
tamaranovitovic
2
970
職位にかかわらず全員がリーダーシップを発揮するチーム作り / Building a team where everyone can demonstrate leadership regardless of position
madoxten
59
50k
GraphQLの誤解/rethinking-graphql
sonatard
75
11k
Un-Boring Meetings
codingconduct
0
220
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
55
3.3k
Documentation Writing (for coders)
carmenintech
77
5.3k
Transcript
None
N CALL FOR DEVELOPERS @papa_fire 1
MY NAME IS LEON HELLO and I am a developer
2
user:/$~ sudo -s bash: Permission denied user is not in
the sudoers file. This incident will be reported. 3
user:/$~ sudo -s root:/#~ F&*# YEAH! 4
WITH GREATPOWER COMES GREATRESPONSIBILITY (and more work) 5
This is where your Awesome TITLE GOES Should Developers be
On Call? 6
hardware network application performance process Things that can go wrong
security 7
alert escalation resolution ONLY ONE HAS TO SUFFER 8
1 2 3 can I fix it? can I fix
it tomorrow? do I care? ACTIONABLE ALERTS 9
ACTIONABLE ALERTS 1 2 3 can I fix it? can
I fix it tomorrow? do I care? 4 can someone else fix it? 10
…AND? 11
Create an Epic SlideShare with this TEMPLATE documentation documentation documentation
12
SAY NO TO UNDOCUMENTED ALERTS 13
DEEP INSTRUMENTATION top-down approach understand business 1 2 3 monitor
business correlate data 14
network latency conversions database load revenue email bounce rate performance
MONITOR EVERYTHING - ALERT ON WHAT’S IMPORTANT CPU load cache hit ratio API responsiveness 15
CONSTANT INSTRUMENTATION monitoring is NOT a feature 16
CONTINUOUS IMPROVEMENT 17
availability (determine the need) (deploys, special events) which one? AVAILABILITY
18
BEA GOODCITIZEN 19
@papa_fire