Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
AWS Community Day BA 2019
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
Sebastian Montini
June 18, 2019
Technology
84
0
Share
AWS Community Day BA 2019
Sebastian Montini
June 18, 2019
More Decks by Sebastian Montini
See All by Sebastian Montini
Giraffe: our journey to support 1 million metrics per second
sebamontini
0
180
Monitoring is dead
sebamontini
0
250
Nomad-PyCon2017
sebamontini
0
92
Atlas, a PaaS with batteries included
sebamontini
0
80
Nomad: The sequel
sebamontini
1
170
Nomad, a love story
sebamontini
0
170
Aurora: 5 Tb later ...
sebamontini
0
88
Ansible 202 - Sysarmy Meetup
sebamontini
0
110
Cloud Computing: All that glitters is not AWS - Nerdear.la 2016
sebamontini
0
70
Other Decks in Technology
See All in Technology
こんなアーキテクチャ図はいやだ / Anti-pattern in AWS Architecture Diagrams
naospon
1
450
名刺メーカーDevグループ 紹介資料
sansan33
PRO
0
1.1k
AI駆動1on1〜AIに自分を育ててもらう〜
yoshiakiyasuda
0
120
ワールドカフェI /チューターを改良する / World Café I and Improving the Tutors
ks91
PRO
0
320
Choose your own adventure in agentic design patterns
glaforge
0
140
#jawsugyokohama 100 LT11, "My AWS Journey 2011-2026 - kwntravel"
shinichirokawano
0
350
AIが書いたコードを信じられない問題 〜レビュー負荷を下げるために変えたこと〜 / The AI Code Trust Gap: Reducing the Review Burden
bitkey
PRO
7
1.3k
Azure Static Web Apps の自動ビルドがタイムアウトしやすくなった状況に対応した件/global-azure2026
thara0402
0
410
M5Stack CoreS3とZephyr(RTOS)で Edge AIっぽいことしてみた
iotengineer22
0
200
小説執筆のハーネスエンジニアリング
yoshitetsu
0
690
AIでAIをテストする - 音声AIエージェントの品質保証戦略
morix1500
1
120
サイボウズ 開発本部採用ピッチ / Cybozu Engineer Recruit
cybozuinsideout
PRO
10
78k
Featured
See All Featured
Game over? The fight for quality and originality in the time of robots
wayneb77
1
160
Neural Spatial Audio Processing for Sound Field Analysis and Control
skoyamalab
0
260
Learning to Love Humans: Emotional Interface Design
aarron
275
41k
Lessons Learnt from Crawling 1000+ Websites
charlesmeaden
PRO
1
1.2k
Chrome DevTools: State of the Union 2024 - Debugging React & Beyond
addyosmani
10
1.1k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
49
9.9k
The Straight Up "How To Draw Better" Workshop
denniskardys
239
140k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
27
3.4k
Build your cross-platform service in a week with App Engine
jlugia
234
18k
Into the Great Unknown - MozCon
thekraken
41
2.4k
Product Roadmaps are Hard
iamctodd
PRO
55
12k
Hiding What from Whom? A Critical Review of the History of Programming languages for Music
tomoyanonymous
2
760
Transcript
@sebamontini Project Giraffe our struggle journey to support anomaly detection
in over 1MM metrics per second
@sebamontini Sebastian Montini Sr. Manager @ Medallia AWS Buenos Aires
Community Leader
@sebamontini Why?
@sebamontini Observability
@sebamontini A system is observable if you can determine the
behavior of the system based on its outputs.
@sebamontini A system is observable if you can determine the
behavior of the system based on its outputs.
@sebamontini Monitoring is the action of observing and checking the
behavior of a system and its components over time.
@sebamontini What?
@sebamontini Instrumentation Collection Storage Alerting Visualization
@sebamontini The Four Golden Signals
@sebamontini Latency Traffic Errors Saturation
@sebamontini How?
@sebamontini Giraffe
@sebamontini “Giraffe is a scalable metrics platform, designed with reliability
and scalability in mind. Giraffe is based in Prometheus and Thanos, running on top of EKS within Amazon Web Services” What is it?
@sebamontini Thanos
@sebamontini Thanos
@sebamontini
@sebamontini So what?
@sebamontini
@sebamontini “Something that for an experienced human being might seem
easy to identify, but it can be tricky for a machine to tell apart” What’s an anomaly?
@sebamontini Exponentially Weighted Moving Average
@sebamontini Holt-Winters
@sebamontini What’s next?
@sebamontini ✓ Currently monitoring live production workloads ✓ 100% decoupled
from the monitoring platform What’s next? ❏ Default behavior based alerting ❏ Consensus based anomaly detection
@sebamontini Thanks!