Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Monitoring is dead
Search
Sebastian Montini
September 21, 2018
Technology
0
220
Monitoring is dead
Sebastian Montini
September 21, 2018
Tweet
Share
More Decks by Sebastian Montini
See All by Sebastian Montini
AWS Community Day BA 2019
sebamontini
0
60
Giraffe: our journey to support 1 million metrics per second
sebamontini
0
170
Nomad-PyCon2017
sebamontini
0
75
Atlas, a PaaS with batteries included
sebamontini
0
69
Nomad: The sequel
sebamontini
1
150
Nomad, a love story
sebamontini
0
150
Aurora: 5 Tb later ...
sebamontini
0
73
Ansible 202 - Sysarmy Meetup
sebamontini
0
88
Cloud Computing: All that glitters is not AWS - Nerdear.la 2016
sebamontini
0
57
Other Decks in Technology
See All in Technology
開発組織のための セキュアコーディング研修の始め方
flatt_security
3
2.4k
『衛星データ利用の方々にとって近いようで触れる機会のなさそうな小話 ~ 衛星搭載ソフトウェアと衛星運用ソフトウェア (実物) を動かしながらわいわいする編 ~』 @日本衛星データコミニティ勉強会
meltingrabbit
0
150
Developers Summit 2025 浅野卓也(13-B-7 LegalOn Technologies)
legalontechnologies
PRO
1
740
2.5Dモデルのすべて
yu4u
2
880
白金鉱業Meetup Vol.17_あるデータサイエンティストのデータマネジメントとの向き合い方
brainpadpr
6
770
関東Kaggler会LT: 人狼コンペとLLM量子化について
nejumi
3
600
利用終了したドメイン名の最強終活〜観測環境を育てて、分析・供養している件〜 / The Ultimate End-of-Life Preparation for Discontinued Domain Names
nttcom
2
200
あれは良かった、あれは苦労したB2B2C型SaaSの新規開発におけるCloud Spanner
hirohito1108
2
640
Developer Summit 2025 [14-D-1] Yuki Hattori
yuhattor
19
6.3k
30分でわかる『アジャイルデータモデリング』
hanon52_
9
2.7k
抽象化をするということ - 具体と抽象の往復を身につける / Abstraction and concretization
soudai
20
8.3k
2/18/25: Java meets AI: Build LLM-Powered Apps with LangChain4j
edeandrea
PRO
0
130
Featured
See All Featured
Designing Experiences People Love
moore
140
23k
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
33
2.1k
Optimising Largest Contentful Paint
csswizardry
34
3.1k
Making the Leap to Tech Lead
cromwellryan
133
9.1k
Making Projects Easy
brettharned
116
6k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
4
410
Building an army of robots
kneath
303
45k
Rebuilding a faster, lazier Slack
samanthasiow
80
8.8k
Why Our Code Smells
bkeepers
PRO
336
57k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
45
9.4k
Agile that works and the tools we love
rasmusluckow
328
21k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
100
18k
Transcript
@sebamontini MEDALLIA Monitoring is Dead And why you’re (probably) doing
it wrong
@sebamontini MEDALLIA
@sebamontini MEDALLIA Why?
@sebamontini MEDALLIA The big 5
@sebamontini MEDALLIA The big 5 ✓ CPU → uptime |
mailx -s “cpu” root ✓ MEM → free | mailx -s “mem” root ✓ DISK → (df -h; du -sh /home/*) | mailx -s “disk” root ✓ PROC → (ps -ef | grep important) | mailx -s root ✓ SYS → ping -c 4 google.com | mailx -s root
@sebamontini MEDALLIA OK: x < something
@sebamontini MEDALLIA WARN: something < x < something
@sebamontini MEDALLIA CRITICAL: x > something
@sebamontini MEDALLIA
@sebamontini MEDALLIA Observability
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA A system is a set of connected components.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA The manner in which a system acts is
it’s behavior.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA The outputs of a system are the concrete
results of it’s behaviors.
@sebamontini MEDALLIA Monitoring is the action of observing and checking
the behavior and outputs of a system and it’s components over time.
@sebamontini MEDALLIA The (real) big 5
@sebamontini MEDALLIA Instrumentation Collection Storage Alerting Visualization
@sebamontini MEDALLIA Instrumentation
@sebamontini MEDALLIA Gauges Counters Histogram Timers
@sebamontini MEDALLIA Gauges A gauge is an instantaneous measurement of
a value. For example, we may want to measure the number of pending jobs in a queue
@sebamontini MEDALLIA Counters A counter is just a gauge that
you can increment or decrement its value. For example, we may want a more efficient way of measuring the pending job in a queue
@sebamontini MEDALLIA Histogram A histogram measures the statistical distribution of
values in a stream of data like median or percentiles
@sebamontini MEDALLIA Timers A timer measures both the rate that
a particular piece of code is called and the distribution of its duration.
@sebamontini MEDALLIA Collection
@sebamontini MEDALLIA
@sebamontini MEDALLIA Storage
@sebamontini MEDALLIA Storage
@sebamontini MEDALLIA Alerting
@sebamontini MEDALLIA Thresholds Dead man Delta Anomaly detection
@sebamontini MEDALLIA Visualization
@sebamontini MEDALLIA
@sebamontini MEDALLIA The big 5 ✓ Instrumentation → gauges, histograms,
timers, counters ✓ Collection → pull vs push ✓ Storage → Time Series DB ✓ Alerting → threshold, flatline, delta, anomaly ✓ Visualization → dashboards
@sebamontini MEDALLIA
@sebamontini MEDALLIA The Four Golden Signals
@sebamontini MEDALLIA Latency Traffic Errors Saturation
@sebamontini MEDALLIA Thanks