Lock in $30 Savings on PRO—Offer Ends Soon! ⏳
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Production Debugging
Search
josh_robb
April 09, 2014
Technology
0
50
Production Debugging
What to do when shits on fire
josh_robb
April 09, 2014
Tweet
Share
More Decks by josh_robb
See All by josh_robb
Wellington Codecamp 2016
josh_robb
0
140
Codemania - Coupling, Cohesion, Connascence
josh_robb
1
590
Blameless Postmortems - Security by Inclusion
josh_robb
0
290
Effective Unit Testing
josh_robb
0
75
Coupling, Cohesion, Connascence
josh_robb
1
3k
Brown Field ASP.NET MVC with Webforms
josh_robb
1
150
Other Decks in Technology
See All in Technology
プロダクトマネジメントの分業が生む「デリバリーの渋滞」を解消するTPMの越境
recruitengineers
PRO
3
710
ログ管理の新たな可能性?CloudWatchの新機能をご紹介
ikumi_ono
0
470
Kubernetes Multi-tenancy: Principles and Practices for Large Scale Internal Platforms
hhiroshell
0
110
直接メモリアクセス
koba789
0
280
小さな判断で育つ、大きな意思決定力 / 20251204 Takahiro Kinjo
shift_evolve
PRO
1
580
第4回 「メタデータ通り」 リアル開催
datayokocho
0
120
「Managed Instances」と「durable functions」で広がるAWS Lambdaのユースケース
lamaglama39
0
260
ML PM Talk #1 - ML PMの分類に関する考察
lycorptech_jp
PRO
1
700
法人支出管理領域におけるソフトウェアアーキテクチャに基づいたテスト戦略の実践
ogugu9
1
210
AWS Bedrock AgentCoreで作る 1on1支援AIエージェント 〜Memory × Evaluationsによる実践開発〜
yusukeshimizu
6
360
生成AI時代の自動E2Eテスト運用とPlaywright実践知_引持力哉
legalontechnologies
PRO
0
210
Debugging Edge AI on Zephyr and Lessons Learned
iotengineer22
0
110
Featured
See All Featured
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
285
14k
How Fast Is Fast Enough? [PerfNow 2025]
tammyeverts
3
390
RailsConf 2023
tenderlove
30
1.3k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.5k
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
4.1k
Principles of Awesome APIs and How to Build Them.
keavy
127
17k
The Straight Up "How To Draw Better" Workshop
denniskardys
239
140k
Raft: Consensus for Rubyists
vanstee
141
7.2k
Product Roadmaps are Hard
iamctodd
PRO
55
12k
The Cost Of JavaScript in 2023
addyosmani
55
9.3k
Measuring & Analyzing Core Web Vitals
bluesmoon
9
700
Large-scale JavaScript Application Architecture
addyosmani
515
110k
Transcript
Production Debugging
Production Debugging What to do when shit’s on fire
Me @josh_robb Code Nanny @ Pushpay.com “We’re Hiring” (tm)
Overview Context Tools Demos Wrap-up
Context OODA - John Boyd
OODA - Observe - Orient - Decide - Act
Stressful situations Getting shot at in an aeroplane Fuck that
Getting shot at full stop! Flying a helicopter with no engine
How to train for stress?
How to train for stress?
None
Emotions Denial Fear Anxiety Fatigue (not an emotion - related)
Bad Judgement All of these things lead to poor quality
decision making.
What would NPH Do? First do no harm!
Don’t make things worse Evaluate your options - Can you
roll back? - Can you get a new job? - Can you roll forward?
MTTR Optimize for Mean Time To Recovery!
Don’t roll back in fear Roll forward to victory -
Etsy
Failure If you work somewhere failure is unacceptable (Apart from
Avionics or medical gear) Get a new job Seriously Failure in Tech is unavoidable Learn from it
Postmortems Postmortems are important Blameless ones are best http://codeascraft.com/2012/05/22/blameless- postmortems/
Demo Scenarios - Two of them - In one app
- Introducing
BrokenApp
BrokenApp Two Scenarios - Hang - Hi CPU
Tools - Process tools - Tracing Tools - Dump analysis
Tools - Process tools - Orient - Tracing Tools -
Observe - Dump analysis - Decide - ACT?
Process Tools Windows Server 2012/2008 - Resource Monitor! Also -
Task Manager - Process Explorer - Procmon
Process Tools Demo
Tracing Tools Perfview - Process sampling tool - Great for
what's happening over time? - Live profiling
Perfview XCopy deployable Offline analysis #FTW NOTE: Enable ASP.NET tracing
(DSIM) Demo!
Tracing Tools Message analyzer - Network traffic - Packet sniffing
Dump Analysis - Windbg (for masochists these days)
windbg
windbg - yeah - no - just say no -
debugdiag FTW!