Upgrade to PRO for Only $50/Year—Limited-Time Offer! 🔥
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Monitoring is dead
Search
Sebastian Montini
September 21, 2018
Technology
0
240
Monitoring is dead
Sebastian Montini
September 21, 2018
Tweet
Share
More Decks by Sebastian Montini
See All by Sebastian Montini
AWS Community Day BA 2019
sebamontini
0
78
Giraffe: our journey to support 1 million metrics per second
sebamontini
0
180
Nomad-PyCon2017
sebamontini
0
86
Atlas, a PaaS with batteries included
sebamontini
0
78
Nomad: The sequel
sebamontini
1
160
Nomad, a love story
sebamontini
0
160
Aurora: 5 Tb later ...
sebamontini
0
85
Ansible 202 - Sysarmy Meetup
sebamontini
0
98
Cloud Computing: All that glitters is not AWS - Nerdear.la 2016
sebamontini
0
64
Other Decks in Technology
See All in Technology
[CMU-DB-2025FALL] Apache Fluss - A Streaming Storage for Real-Time Lakehouse
jark
0
110
ML PM Talk #1 - ML PMの分類に関する考察
lycorptech_jp
PRO
1
770
AWSセキュリティアップデートとAWSを育てる話
cmusudakeisuke
0
170
5分で知るMicrosoft Ignite
taiponrock
PRO
0
300
コミューンのデータ分析AIエージェント「Community Sage」の紹介
fufufukakaka
0
460
意外とあった SQL Server 関連アップデート + Database Savings Plans
stknohg
PRO
0
300
小さな判断で育つ、大きな意思決定力 / 20251204 Takahiro Kinjo
shift_evolve
PRO
1
590
re:Invent2025 コンテナ系アップデート振り返り(+CloudWatchログのアップデート紹介)
masukawa
0
330
pmconf2025 - 他社事例を"自社仕様化"する技術_iRAFT法
daichi_yamashita
0
800
eBPFとwaruiBPF
sat
PRO
4
2.5k
AWS Bedrock AgentCoreで作る 1on1支援AIエージェント 〜Memory × Evaluationsによる実践開発〜
yusukeshimizu
6
380
EM歴1年10ヶ月のぼくがぶち当たった苦悩とこれからへ向けて
maaaato
0
270
Featured
See All Featured
How to train your dragon (web standard)
notwaldorf
97
6.4k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
31
3k
Rebuilding a faster, lazier Slack
samanthasiow
84
9.3k
ReactJS: Keep Simple. Everything can be a component!
pedronauck
666
130k
How GitHub (no longer) Works
holman
316
140k
Embracing the Ebb and Flow
colly
88
4.9k
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
21
1.3k
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
128
54k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
31
9.8k
Leading Effective Engineering Teams in the AI Era
addyosmani
8
1.3k
Building a Scalable Design System with Sketch
lauravandoore
463
34k
4 Signs Your Business is Dying
shpigford
186
22k
Transcript
@sebamontini MEDALLIA Monitoring is Dead And why you’re (probably) doing
it wrong
@sebamontini MEDALLIA
@sebamontini MEDALLIA Why?
@sebamontini MEDALLIA The big 5
@sebamontini MEDALLIA The big 5 ✓ CPU → uptime |
mailx -s “cpu” root ✓ MEM → free | mailx -s “mem” root ✓ DISK → (df -h; du -sh /home/*) | mailx -s “disk” root ✓ PROC → (ps -ef | grep important) | mailx -s root ✓ SYS → ping -c 4 google.com | mailx -s root
@sebamontini MEDALLIA OK: x < something
@sebamontini MEDALLIA WARN: something < x < something
@sebamontini MEDALLIA CRITICAL: x > something
@sebamontini MEDALLIA
@sebamontini MEDALLIA Observability
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA A system is a set of connected components.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA The manner in which a system acts is
it’s behavior.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA The outputs of a system are the concrete
results of it’s behaviors.
@sebamontini MEDALLIA Monitoring is the action of observing and checking
the behavior and outputs of a system and it’s components over time.
@sebamontini MEDALLIA The (real) big 5
@sebamontini MEDALLIA Instrumentation Collection Storage Alerting Visualization
@sebamontini MEDALLIA Instrumentation
@sebamontini MEDALLIA Gauges Counters Histogram Timers
@sebamontini MEDALLIA Gauges A gauge is an instantaneous measurement of
a value. For example, we may want to measure the number of pending jobs in a queue
@sebamontini MEDALLIA Counters A counter is just a gauge that
you can increment or decrement its value. For example, we may want a more efficient way of measuring the pending job in a queue
@sebamontini MEDALLIA Histogram A histogram measures the statistical distribution of
values in a stream of data like median or percentiles
@sebamontini MEDALLIA Timers A timer measures both the rate that
a particular piece of code is called and the distribution of its duration.
@sebamontini MEDALLIA Collection
@sebamontini MEDALLIA
@sebamontini MEDALLIA Storage
@sebamontini MEDALLIA Storage
@sebamontini MEDALLIA Alerting
@sebamontini MEDALLIA Thresholds Dead man Delta Anomaly detection
@sebamontini MEDALLIA Visualization
@sebamontini MEDALLIA
@sebamontini MEDALLIA The big 5 ✓ Instrumentation → gauges, histograms,
timers, counters ✓ Collection → pull vs push ✓ Storage → Time Series DB ✓ Alerting → threshold, flatline, delta, anomaly ✓ Visualization → dashboards
@sebamontini MEDALLIA
@sebamontini MEDALLIA The Four Golden Signals
@sebamontini MEDALLIA Latency Traffic Errors Saturation
@sebamontini MEDALLIA Thanks