Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Monitoring is dead
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Sebastian Montini
September 21, 2018
Technology
250
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Monitoring is dead
Sebastian Montini
September 21, 2018
More Decks by Sebastian Montini
See All by Sebastian Montini
AWS Community Day BA 2019
sebamontini
0
88
Giraffe: our journey to support 1 million metrics per second
sebamontini
0
190
Nomad-PyCon2017
sebamontini
0
100
Atlas, a PaaS with batteries included
sebamontini
0
83
Nomad: The sequel
sebamontini
1
180
Nomad, a love story
sebamontini
0
170
Aurora: 5 Tb later ...
sebamontini
0
92
Ansible 202 - Sysarmy Meetup
sebamontini
0
110
Cloud Computing: All that glitters is not AWS - Nerdear.la 2016
sebamontini
0
72
Other Decks in Technology
See All in Technology
クレデンシャル流出 ― 攻撃 3 時間 vs 復旧 10 時間。この非対称性にどう備えるか
kazzpapa3
2
150
20260619 私の日常業務での生成 AI 活用
masaruogura
1
230
iAEONの段階的リアーキテクト戦略 / iAEON's_Gradual_Re-architecture_Strategy
aeonpeople
0
230
2026 TECHFRESH 畢業分享會 - AI-Native 重塑軟體工程與虛擬講師
line_developers_tw
PRO
0
1.3k
SteampipeとExcel Power QueryでAWS構成定義書の作成を自動化する
jhashimoto
0
160
データサイエンスを価値につなげるプロジェクト設計 〜 DS一年目が現場で得た気づき 〜
ysd113
1
290
Lightning近況報告
kozy4324
0
210
AWS Security Hub CSPMの成功・失敗体験
cmusudakeisuke
0
310
IaC コードを資産へ:AWS CDK 社内ライブラリと横断展開 / aws-summit-japan-2026
gotok365
9
1.3k
SONiCで構築・運用する生成AI向けパブリッククラウドネットワーク ~実装編~
sonic
0
290
あなたの知らないPDFのアクセシビリティ
lycorptech_jp
PRO
0
220
200個のGitHubリポジトリを横断調査したかった
icck
0
140
Featured
See All Featured
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
31
10k
Code Review Best Practice
trishagee
74
20k
Leveraging Curiosity to Care for An Aging Population
cassininazir
1
270
Everyday Curiosity
cassininazir
0
230
Documentation Writing (for coders)
carmenintech
77
5.4k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
49
3.5k
Technical Leadership for Architectural Decision Making
baasie
3
420
Paper Plane (Part 1)
katiecoart
PRO
0
9.1k
[SF Ruby Conf 2025] Rails X
palkan
2
1.1k
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
508
140k
Mind Mapping
helmedeiros
PRO
1
260
Tips & Tricks on How to Get Your First Job In Tech
honzajavorek
1
540
Transcript
@sebamontini MEDALLIA Monitoring is Dead And why you’re (probably) doing
it wrong
@sebamontini MEDALLIA
@sebamontini MEDALLIA Why?
@sebamontini MEDALLIA The big 5
@sebamontini MEDALLIA The big 5 ✓ CPU → uptime |
mailx -s “cpu” root ✓ MEM → free | mailx -s “mem” root ✓ DISK → (df -h; du -sh /home/*) | mailx -s “disk” root ✓ PROC → (ps -ef | grep important) | mailx -s root ✓ SYS → ping -c 4 google.com | mailx -s root
@sebamontini MEDALLIA OK: x < something
@sebamontini MEDALLIA WARN: something < x < something
@sebamontini MEDALLIA CRITICAL: x > something
@sebamontini MEDALLIA
@sebamontini MEDALLIA Observability
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA A system is a set of connected components.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA The manner in which a system acts is
it’s behavior.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA The outputs of a system are the concrete
results of it’s behaviors.
@sebamontini MEDALLIA Monitoring is the action of observing and checking
the behavior and outputs of a system and it’s components over time.
@sebamontini MEDALLIA The (real) big 5
@sebamontini MEDALLIA Instrumentation Collection Storage Alerting Visualization
@sebamontini MEDALLIA Instrumentation
@sebamontini MEDALLIA Gauges Counters Histogram Timers
@sebamontini MEDALLIA Gauges A gauge is an instantaneous measurement of
a value. For example, we may want to measure the number of pending jobs in a queue
@sebamontini MEDALLIA Counters A counter is just a gauge that
you can increment or decrement its value. For example, we may want a more efficient way of measuring the pending job in a queue
@sebamontini MEDALLIA Histogram A histogram measures the statistical distribution of
values in a stream of data like median or percentiles
@sebamontini MEDALLIA Timers A timer measures both the rate that
a particular piece of code is called and the distribution of its duration.
@sebamontini MEDALLIA Collection
@sebamontini MEDALLIA
@sebamontini MEDALLIA Storage
@sebamontini MEDALLIA Storage
@sebamontini MEDALLIA Alerting
@sebamontini MEDALLIA Thresholds Dead man Delta Anomaly detection
@sebamontini MEDALLIA Visualization
@sebamontini MEDALLIA
@sebamontini MEDALLIA The big 5 ✓ Instrumentation → gauges, histograms,
timers, counters ✓ Collection → pull vs push ✓ Storage → Time Series DB ✓ Alerting → threshold, flatline, delta, anomaly ✓ Visualization → dashboards
@sebamontini MEDALLIA
@sebamontini MEDALLIA The Four Golden Signals
@sebamontini MEDALLIA Latency Traffic Errors Saturation
@sebamontini MEDALLIA Thanks