Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Production Debugging
Search
josh_robb
April 09, 2014
Technology
0
54
Production Debugging
What to do when shits on fire
josh_robb
April 09, 2014
Tweet
Share
More Decks by josh_robb
See All by josh_robb
Wellington Codecamp 2016
josh_robb
0
150
Codemania - Coupling, Cohesion, Connascence
josh_robb
1
590
Blameless Postmortems - Security by Inclusion
josh_robb
0
290
Effective Unit Testing
josh_robb
0
81
Coupling, Cohesion, Connascence
josh_robb
1
3k
Brown Field ASP.NET MVC with Webforms
josh_robb
1
150
Other Decks in Technology
See All in Technology
なぜ今、コスト最適化(倹約)が必要なのか? ~AWSでのコスト最適化の進め方「目的編」~
htan
1
110
データ民主化のための LLM 活用状況と課題紹介(IVRy の場合)
wxyzzz
2
680
Meshy Proプラン課金した
henjin0
0
250
FinTech SREのAWSサービス活用/Leveraging AWS Services in FinTech SRE
maaaato
0
130
Embedded SREの終わりを設計する 「なんとなく」から計画的な自立支援へ
sansantech
PRO
3
2.2k
名刺メーカーDevグループ 紹介資料
sansan33
PRO
0
1k
10Xにおける品質保証活動の全体像と改善 #no_more_wait_for_test
nihonbuson
PRO
2
220
Stately
mu7889yoon
1
110
AzureでのIaC - Bicep? Terraform? それ早く言ってよ会議
torumakabe
1
440
プロダクト成長を支える開発基盤とスケールに伴う課題
yuu26
4
1.3k
制約が導く迷わない設計 〜 信頼性と運用性を両立するマイナンバー管理システムの実践 〜
bwkw
3
890
Data Hubグループ 紹介資料
sansan33
PRO
0
2.7k
Featured
See All Featured
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
4.2k
Skip the Path - Find Your Career Trail
mkilby
0
53
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
21
1.4k
How To Stay Up To Date on Web Technology
chriscoyier
791
250k
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
11
820
WCS-LA-2024
lcolladotor
0
450
KATA
mclloyd
PRO
34
15k
Building Experiences: Design Systems, User Experience, and Full Site Editing
marktimemedia
0
410
Embracing the Ebb and Flow
colly
88
5k
Build The Right Thing And Hit Your Dates
maggiecrowley
38
3k
How to build a perfect <img>
jonoalderson
1
4.9k
How to Grow Your eCommerce with AI & Automation
katarinadahlin
PRO
0
100
Transcript
Production Debugging
Production Debugging What to do when shit’s on fire
Me @josh_robb Code Nanny @ Pushpay.com “We’re Hiring” (tm)
Overview Context Tools Demos Wrap-up
Context OODA - John Boyd
OODA - Observe - Orient - Decide - Act
Stressful situations Getting shot at in an aeroplane Fuck that
Getting shot at full stop! Flying a helicopter with no engine
How to train for stress?
How to train for stress?
None
Emotions Denial Fear Anxiety Fatigue (not an emotion - related)
Bad Judgement All of these things lead to poor quality
decision making.
What would NPH Do? First do no harm!
Don’t make things worse Evaluate your options - Can you
roll back? - Can you get a new job? - Can you roll forward?
MTTR Optimize for Mean Time To Recovery!
Don’t roll back in fear Roll forward to victory -
Etsy
Failure If you work somewhere failure is unacceptable (Apart from
Avionics or medical gear) Get a new job Seriously Failure in Tech is unavoidable Learn from it
Postmortems Postmortems are important Blameless ones are best http://codeascraft.com/2012/05/22/blameless- postmortems/
Demo Scenarios - Two of them - In one app
- Introducing
BrokenApp
BrokenApp Two Scenarios - Hang - Hi CPU
Tools - Process tools - Tracing Tools - Dump analysis
Tools - Process tools - Orient - Tracing Tools -
Observe - Dump analysis - Decide - ACT?
Process Tools Windows Server 2012/2008 - Resource Monitor! Also -
Task Manager - Process Explorer - Procmon
Process Tools Demo
Tracing Tools Perfview - Process sampling tool - Great for
what's happening over time? - Live profiling
Perfview XCopy deployable Offline analysis #FTW NOTE: Enable ASP.NET tracing
(DSIM) Demo!
Tracing Tools Message analyzer - Network traffic - Packet sniffing
Dump Analysis - Windbg (for masochists these days)
windbg
windbg - yeah - no - just say no -
debugdiag FTW!