Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Resillent By Design
Search
Smit Shah
April 21, 2015
Technology
140
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Resillent By Design
How to build fault tolerant systems which can withstand failure and respond under load.
Smit Shah
April 21, 2015
More Decks by Smit Shah
See All by Smit Shah
How does Bundler bundle?
who828
0
140
KDE is unlike a coconut
who828
0
130
My Mom told me that Ruby isn't thread-safe
who828
0
140
Other Decks in Technology
See All in Technology
Bucharest Tech Week 2026 - Guardians of the Cloud-Native Galaxy
edeandrea
PRO
0
130
【2026年版】 ベクトル検索とEmbedding最前線
mocobeta
23
7.3k
サイバーエージェントにおけるAI推進戦略と変革への取り組み
shotatsuge
0
460
事業会社における 機械学習・推薦システム技術の活用事例と必要な能力 / ml-recsys-in-layerx-wantedly-2026
yuya4
0
160
10年間のブログ発信を振り返って見えたWebアプリケーションエンジニアとしての軌跡
stefafafan
0
180
自分が詳しくない領域でAIを使う #プロヒス2026
konifar
20
7.2k
ACE-Step-1.5で見る 音楽生成AIのしくみと“破綻だけ直す”Retake機能の開発【zennfes spring 2026 登壇資料】
personabb
1
560
MUSUBI 田中裕一『AIと共に行う「しごとのリデザイン」- スモールバックオフィス編』AI Ops Lab #4
musubi
0
300
【セミナー資料】Claude Code をセキュアに使うための考え方と設定の勘どころ / Claude Code Webinar 20260616
masahirokawahara
2
460
レガシーな広告配信システムでのAI駆動開発/運用の挑戦
i16fujimoto
0
110
Flow 不死:AI 時代 DevOps 的不變本質
cheng_wei_chen
2
480
AIチャットの改善から見えた、良いAI体験とは / What Constitutes a Good AI Experience: Insights from Improving AI Chat
kubode
0
120
Featured
See All Featured
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
6k
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
72
40k
How Software Deployment tools have changed in the past 20 years
geshan
0
34k
A Guide to Academic Writing Using Generative AI - A Workshop
ks91
PRO
1
330
How to Grow Your eCommerce with AI & Automation
katarinadahlin
PRO
1
210
Primal Persuasion: How to Engage the Brain for Learning That Lasts
tmiket
0
370
From π to Pie charts
rasagy
0
220
Digital Ethics as a Driver of Design Innovation
axbom
PRO
1
320
Measuring & Analyzing Core Web Vitals
bluesmoon
9
870
Done Done
chrislema
186
16k
B2B Lead Gen: Tactics, Traps & Triumph
marketingsoph
0
160
WENDY [Excerpt]
tessaabrams
11
38k
Transcript
RESILIENT BY DESIGN SMIT SHAH github.com/who828 twitter.com/who828
None
WHY DO WE CARE ABOUT RESILIENCE?
HOW DO WE BUILD A RESILIENT SYSTEM?
WE MUST DESIGN FOR RESILIENCE FROM THE START
RESILIENT DESIGN PATTERNS
RESILIENT DESIGN PATTERNS ▸ Fail fast ▸ Bounding ▸ Circuit
breakers ▸ SteadyState ▸ Bulkheads
FAIL FAST
LITTLE’S LAW: L = Λ * W L = Length
of the Queue λ = Message Arrival Rate W = Mean Time in System
None
None
None
None
BOUNDING
BOUNDING ▸ Use Timeouts
BOUNDING ▸ Use Timeouts ▸ Limit Memory/CPU
BOUNDING ▸ Use Timeouts ▸ Limit Memory/CPU ▸ Use Explicit
Bounded Queues
CIRCUIT BREAKERS
None
BULKHEADS
None
None
STEADYSTATE
STEADYSTATE ▸ Avoid fiddling
STEADYSTATE ▸ Avoid fiddling ▸ Roll your logs
STEADYSTATE ▸ Avoid fiddling ▸ Roll your logs ▸ Have
a data archiving strategy
Software design today only talks about what systems should do.
It doesn’t address the things systems should not do. — Michael T. Nygard
PUTTING IT TOGETHER ▸ Fail fast ▸ Bound your resources
▸ Use circuit breakers ▸ Isolate failures
QUESTIONS?
REFERENCES Release It! The Art of Capacity Planning Thinking for
Programmers Responding in Timely Manner Reactive Service Levels Queuing Theory Calculator