Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
AWS Community Day BA 2019
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
Sebastian Montini
June 18, 2019
Technology
88
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
AWS Community Day BA 2019
Sebastian Montini
June 18, 2019
More Decks by Sebastian Montini
See All by Sebastian Montini
Giraffe: our journey to support 1 million metrics per second
sebamontini
0
190
Monitoring is dead
sebamontini
0
250
Nomad-PyCon2017
sebamontini
0
100
Atlas, a PaaS with batteries included
sebamontini
0
83
Nomad: The sequel
sebamontini
1
180
Nomad, a love story
sebamontini
0
170
Aurora: 5 Tb later ...
sebamontini
0
92
Ansible 202 - Sysarmy Meetup
sebamontini
0
110
Cloud Computing: All that glitters is not AWS - Nerdear.la 2016
sebamontini
0
72
Other Decks in Technology
See All in Technology
徹底討論!ECS vs EKS!
daitak
3
970
LayerX コーポレートエンジニアリング室におけるサプライチェーンセキュリティへの取り組み / Supply Chain Security at LayerX Corporate Engineering
yuyatakeyama
2
690
AIネイティブな開発のサプライチェーンリスク対策 〜激動の開発現場でリスクに立ち向かう〜【ZennFes】
cscengineer
PRO
2
140
小さく始める AI 活用推進 ― 日経電子版 Web チームの事例/nikkei-tech-talk47
nikkei_engineer_recruiting
0
310
【Cyber-sec+】経営層を"動かす"ための考え方
hssh2_bin
0
200
200個のGitHubリポジトリを横断調査したかった
icck
0
140
AIはどのように 組織のアジリティを変えるのか?
junki
4
1.1k
AI時代のコスト管理を考えよう〜明日から使える実践AWSノウハウ~
yoshimi0227
0
340
LayerXにおけるセキュリティ管理の現在地と次の一手
tosho
0
250
Bucharest Tech Week 2026 - Guardians of the Cloud-Native Galaxy
edeandrea
PRO
0
130
現場のトークンマネジメント
dak2
0
140
Kubernetesにおける学習基盤とLLMOpsの概要
ry
1
320
Featured
See All Featured
The Impact of AI in SEO - AI Overviews June 2024 Edition
aleyda
5
1.1k
DBのスキルで生き残る技術 - AI時代におけるテーブル設計の勘所
soudai
PRO
66
55k
Max Prin - Stacking Signals: How International SEO Comes Together (And Falls Apart)
techseoconnect
PRO
0
180
Conquering PDFs: document understanding beyond plain text
inesmontani
PRO
4
2.8k
Utilizing Notion as your number one productivity tool
mfonobong
4
320
How to Align SEO within the Product Triangle To Get Buy-In & Support - #RIMC
aleyda
2
1.5k
Navigating the Design Leadership Dip - Product Design Week Design Leaders+ Conference 2024
apolaine
1
350
sira's awesome portfolio website redesign presentation
elsirapls
0
280
Google's AI Overviews - The New Search
badams
0
1k
Paper Plane
katiecoart
PRO
1
51k
Dominate Local Search Results - an insider guide to GBP, reviews, and Local SEO
greggifford
PRO
0
200
Breaking role norms: Why Content Design is so much more than writing copy - Taylor Woolridge
uxyall
0
320
Transcript
@sebamontini Project Giraffe our struggle journey to support anomaly detection
in over 1MM metrics per second
@sebamontini Sebastian Montini Sr. Manager @ Medallia AWS Buenos Aires
Community Leader
@sebamontini Why?
@sebamontini Observability
@sebamontini A system is observable if you can determine the
behavior of the system based on its outputs.
@sebamontini A system is observable if you can determine the
behavior of the system based on its outputs.
@sebamontini Monitoring is the action of observing and checking the
behavior of a system and its components over time.
@sebamontini What?
@sebamontini Instrumentation Collection Storage Alerting Visualization
@sebamontini The Four Golden Signals
@sebamontini Latency Traffic Errors Saturation
@sebamontini How?
@sebamontini Giraffe
@sebamontini “Giraffe is a scalable metrics platform, designed with reliability
and scalability in mind. Giraffe is based in Prometheus and Thanos, running on top of EKS within Amazon Web Services” What is it?
@sebamontini Thanos
@sebamontini Thanos
@sebamontini
@sebamontini So what?
@sebamontini
@sebamontini “Something that for an experienced human being might seem
easy to identify, but it can be tricky for a machine to tell apart” What’s an anomaly?
@sebamontini Exponentially Weighted Moving Average
@sebamontini Holt-Winters
@sebamontini What’s next?
@sebamontini ✓ Currently monitoring live production workloads ✓ 100% decoupled
from the monitoring platform What’s next? ❏ Default behavior based alerting ❏ Consensus based anomaly detection
@sebamontini Thanks!