Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Leon Fayer - Ignite - Oncall for developers
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
devopsdaysraleigh
October 07, 2016
Technology
0
93
Leon Fayer - Ignite - Oncall for developers
devopsdaysraleigh
October 07, 2016
Tweet
Share
More Decks by devopsdaysraleigh
See All by devopsdaysraleigh
Aaron Suggs - Keynote - Context & Contingency: Patterns for choosing good tools
devopsdaysrdu
0
95
Chris Collins - Embracing the Container
devopsdaysrdu
0
72
Rebecca Fernandez & Jen Krieger - How to be more open, collaborative, and inclusive at work
devopsdaysrdu
0
170
Josh Atwell - How to Evolve Ops Skills to a DevOps World
devopsdaysrdu
0
150
Maggie Gourlay - Ignite - My Gaming Days Weren’t Wasted: How Gaming Trained Me for Testing in DevOps
devopsdaysrdu
0
120
Fraser Pollock - Ignite - Data Before DevOps
devopsdaysrdu
0
77
Dylan Schowengerdt - Ignite - Customer Success: The Missing Link in the Feedback Loop to Engineering
devopsdaysrdu
0
260
Eric Sigler - "Is there any strong objection?"
devopsdaysrdu
0
84
Michael DeHaan - Keynote - Speaking for the Dead: Is "Waterfall" and "Monolithic" Actually Good?
devopsdaysrdu
0
84
Other Decks in Technology
See All in Technology
マイグレーションガイドに書いてないRiverpod 3移行話
taiju59
0
330
APMの世界から見るOpenTelemetryのTraceの世界 / OpenTelemetry in the Java
soudai
PRO
0
220
Serverless Agent Architecture on Azure / serverless-agent-on-azure
miyake
1
120
Claude Codeと駆け抜ける 情報収集と実践録
sontixyou
2
1.3k
Windows ネットワークを再確認する
murachiakira
PRO
0
210
もう怖くないバックグラウンド処理 Background Tasks のすべて - Hakodate.swift #1
kantacky
0
240
チームメンバー迷わないIaC設計
hayama17
5
3.4k
オンプレとGoogle Cloudを安全に繋ぐための、セキュア通信の勘所
waiwai2111
3
1k
Oracle Base Database Service 技術詳細
oracle4engineer
PRO
15
95k
組織のSREを推進するためのPlatform EngineeringとEKS / Platform Engineering and EKS to drive SRE in your organization
chmikata
0
170
Bill One 開発エンジニア 紹介資料
sansan33
PRO
5
18k
AI Coding Agentの地殻変動 ~ ai-coding.info の定点観測 ~
kotauchisunsun
1
500
Featured
See All Featured
Conquering PDFs: document understanding beyond plain text
inesmontani
PRO
4
2.4k
Efficient Content Optimization with Google Search Console & Apps Script
katarinadahlin
PRO
1
360
Reflections from 52 weeks, 52 projects
jeffersonlam
356
21k
We Analyzed 250 Million AI Search Results: Here's What I Found
joshbly
1
860
Claude Code どこまでも/ Claude Code Everywhere
nwiizo
63
53k
Building an army of robots
kneath
306
46k
Between Models and Reality
mayunak
2
210
Jess Joyce - The Pitfalls of Following Frameworks
techseoconnect
PRO
1
91
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
194
17k
How People are Using Generative and Agentic AI to Supercharge Their Products, Projects, Services and Value Streams Today
helenjbeal
1
130
Building a Modern Day E-commerce SEO Strategy
aleyda
45
8.7k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
49
9.9k
Transcript
None
N CALL FOR DEVELOPERS @papa_fire 1
MY NAME IS LEON HELLO and I am a developer
2
user:/$~ sudo -s bash: Permission denied user is not in
the sudoers file. This incident will be reported. 3
user:/$~ sudo -s root:/#~ F&*# YEAH! 4
WITH GREATPOWER COMES GREATRESPONSIBILITY (and more work) 5
This is where your Awesome TITLE GOES Should Developers be
On Call? 6
hardware network application performance process Things that can go wrong
security 7
alert escalation resolution ONLY ONE HAS TO SUFFER 8
1 2 3 can I fix it? can I fix
it tomorrow? do I care? ACTIONABLE ALERTS 9
ACTIONABLE ALERTS 1 2 3 can I fix it? can
I fix it tomorrow? do I care? 4 can someone else fix it? 10
…AND? 11
Create an Epic SlideShare with this TEMPLATE documentation documentation documentation
12
SAY NO TO UNDOCUMENTED ALERTS 13
DEEP INSTRUMENTATION top-down approach understand business 1 2 3 monitor
business correlate data 14
network latency conversions database load revenue email bounce rate performance
MONITOR EVERYTHING - ALERT ON WHAT’S IMPORTANT CPU load cache hit ratio API responsiveness 15
CONSTANT INSTRUMENTATION monitoring is NOT a feature 16
CONTINUOUS IMPROVEMENT 17
availability (determine the need) (deploys, special events) which one? AVAILABILITY
18
BEA GOODCITIZEN 19
@papa_fire