Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Monitoring in Motion: Challenges of Monitoring Containers and Kuberntes
Search
Ilan Rabinovitch
February 26, 2016
Technology
0
82
Monitoring in Motion: Challenges of Monitoring Containers and Kuberntes
Ilan Rabinovitch
February 26, 2016
Tweet
Share
More Decks by Ilan Rabinovitch
See All by Ilan Rabinovitch
Monitoring in Motion - ContainerCon 2016
irabinovitch
0
89
Data Driven Post Mortems at Datadog - LinuxCon 2016
irabinovitch
1
200
Introduction to Docker Monitoring
irabinovitch
0
110
OSCON 2016 - Monitoring in Motion
irabinovitch
2
160
Monitoring OpenStack at Lithium (OpenStack Summit Austin 2016)
irabinovitch
0
57
LinuxFest Northwest 2016 - Monitoring 101
irabinovitch
0
35
Monitoring ECS and Dynamic Infrastructure
irabinovitch
0
97
Doing DevOps Right with Datadog + Pagerduty
irabinovitch
0
110
Docker Usage Patterns - Docker Meetup Palo Alto - Nov 2015
irabinovitch
0
56
Other Decks in Technology
See All in Technology
DMM.com アルファ室採用案内資料
hsugita
1
230
止まらないLinuxシステムを構築する_高信頼性クラスタ入門
koedoyoshida
2
430
ルーターでプレゼンする
puhitaku
1
3.3k
M5と自作基板をくっつけてみた〜M5 Japan Tour 2024 Spring 福冈 (Fukuoka|福岡)〜
keropiyo
0
130
ワールドカフェI /チューターを改良する / World Café I and Improving the Tutors
ks91
PRO
0
150
生産性向上チームの紹介
cybozuinsideout
PRO
1
920
The AI Revolution Will Not Be Monopolized: Behind the scenes
inesmontani
PRO
1
160
コードファーストの考え方。 Amplify Gen2から学ぶAWS次世代のWeb開発体験
yoshiitaka
2
370
障害対応をちょっとずつよくしていくための 演習の作りかた
heleeen
1
1.7k
アクセス制御にまつわる改善 / Improving access control
itkq
0
590
AWSに詳しくない人でも始められるコスト最適化ガイド
yuhta28
2
400
Building a RAG-poweredAI chat appwith Python and VS Code
pamelafox
0
160
Featured
See All Featured
jQuery: Nuts, Bolts and Bling
dougneiner
59
7.2k
Typedesign – Prime Four
hannesfritz
36
2.1k
GitHub's CSS Performance
jonrohan
1025
450k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
228
16k
The Invisible Side of Design
smashingmag
294
49k
Designing for humans not robots
tammielis
247
25k
Build your cross-platform service in a week with App Engine
jlugia
226
17k
The Cult of Friendly URLs
andyhume
74
5.7k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
65
14k
Why Our Code Smells
bkeepers
PRO
331
56k
Why You Should Never Use an ORM
jnunemaker
PRO
51
8.7k
Making Projects Easy
brettharned
109
5.5k
Transcript
Monitoring In Motion Challenges in Monitoring Kubernetes & Containers Cloud
Native SF Meetup Feb 25, 2016 Ilan Rabinovitch Director, Community Datadog
About Me • Long time Datadog user. • Prior to
Datadog built automation and monitoring tooling at Ooyala and Edmunds.com • SCALE and TXLF Co-Founder Ilan Rabinovitch Datadog
[email protected]
@irabinovitch
Agenda • Monitoring 101 - Crash Course • Challenges in
Monitoring Dynamic Infrastructure • Demo Time • Questions?
Monitoring Everything
None
@honest_update on Twitter
Quick Overview of Datadog • Monitoring for modern applications. •
Time series storage of metrics and events. • Trending, alerting and anomaly detection. • Hundreds of integrations out of the box.
Monitoring 101: Categorization More at: http://goo.gl/t1Rgcg
None
Monitoring 101: Focus on symptoms More at: http://goo.gl/t1Rgcg
Recurse until you find root cause. More at: http://goo.gl/t1Rgcg
Container Monitoring Challenges
https://www.datadoghq.com/docker-adoption/
None
None
Operational Complexity •Average containers per host: N (N=4, 10/2015) •N-times
as many “hosts” to manage •Affects everything
Operational Complexity: Scale 100 instances 400 containers
Operational Complexity: Scale 160 metrics per host 640 metrics per
host
Operational Complexity: Scale 100 instances 64,000 metrics
None
Host Centric vs Service Centric
Host Centric vs Service Centric
Query Based Monitoring … … …
•Use tags, labels, etc on your hosts and metrics. •Pull
in existing labels from your infrastructure (Region, Docker Images, K8S Tags..) Query Based Monitoring By using tags, auto-adapt!
Where is my application running ? What’s the total throughput
of App X ? What’s its response time per tag ? (pod, version, DC) What’s the distribution of 5xx from Nginx per pod ?
Auto Discovery
Docker API Kubelet API Monitoring Agent Container A O A
O A O Application Container Off-The-Shelf Application (Redis, PostgreSQL, …) Containers List Metadata Additional Metadata (Pod names, RC, …) Config Backend Integration Configurations Host Level Metrics
Some Pictures Dashboards and Metrics Alerts Sharing
Demo time