Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Monitoring and Debugging Containers
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
JBD
December 04, 2018
Programming
1.2k
2
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Monitoring and Debugging Containers
JBD
December 04, 2018
More Decks by JBD
See All by JBD
eBPF in Microservices Observability at eBPF Day
rakyll
1
2.2k
eBPF in Microservices Observability
rakyll
1
1.8k
OpenTelemetry at AWS
rakyll
1
1.9k
Debugging Code Generation in Go
rakyll
5
1.7k
Are you ready for production?
rakyll
8
3k
Servers are doomed to fail
rakyll
3
1.6k
Serverless Containers
rakyll
1
290
Critical Path Analysis
rakyll
0
700
CPDD
rakyll
0
4.3k
Other Decks in Programming
See All in Programming
net-httpのHTTP/2対応について
naruse
0
450
肥大化するレガシーコードに立ち向かうためのインターフェース分離と依存の逆転 / JJUG CCC 2026 Spring
hirokunimaeta
0
500
脅威をエンジニアリングの糧にして――現場編 / Turning Threats into Engineering Fuel — Field Edition
nrslib
0
250
ADKを使って簡単にAIエージェントを作ってみよう
k1mu21
0
230
Java × distroless で 軽量なコンテナイメージを / Java on Distroless
contour_gara
0
500
LLM Plugin for Node-REDの利用方法と開発について
404background
0
160
正しくソフトウェアを作る、前提を疑うための認知の視点 / doubt-premise
minodriven
17
6.1k
気づいたらRubyで100作品 ー クリエイティブコーディングが生活の一部になるまで / 100 Ruby Sketches Later: How Creative Coding Became Part of My Life
chobishiba
3
550
JJUG CCC 2026 Spring: JSpecify で実現する Kotlin フレンドリーな Java API 設計
ternbusty
1
140
並列実装の現場、2ヶ月間実務でAIを使い倒したAIもPCも私も限界が近い
ming_ayami
0
110
権限チェックの一貫性を型で守る TypeScript による多層防御
mnch
4
1.1k
Spec Driven Development | AI Summit Lisbon
danielsogl
PRO
0
160
Featured
See All Featured
Designing for Performance
lara
611
70k
The innovator’s Mindset - Leading Through an Era of Exponential Change - McGill University 2025
jdejongh
PRO
1
190
Measuring & Analyzing Core Web Vitals
bluesmoon
9
860
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
31
2.8k
Testing 201, or: Great Expectations
jmmastey
46
8.2k
Visual Storytelling: How to be a Superhuman Communicator
reverentgeek
2
550
Navigating Algorithm Shifts & AI Overviews - #SMXNext
aleyda
1
1.3k
How to Grow Your eCommerce with AI & Automation
katarinadahlin
PRO
1
200
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.7k
New Earth Scene 8
popppiees
3
2.3k
Digital Projects Gone Horribly Wrong (And the UX Pros Who Still Save the Day) - Dean Schuster
uxyall
0
1.6k
Paper Plane (Part 1)
katiecoart
PRO
0
8.5k
Transcript
@rakyll monitoring and debugging containerized systems Jaana B. Dogan, Google
[email protected]
@rakyll me overly frustrated engineer 15+ years in networking systems
making systems more reliable
@rakyll the new old monitoring? (maybe)
@rakyll systems are growing... and you are not in control
@rakyll bare metal kernel network stack cloud stack libraries frameworks
your code
@rakyll
@rakyll complexity is inevitable
@rakyll container
@rakyll container
@rakyll container container
@rakyll container container
@rakyll container container message queue
@rakyll container container storage/database
@rakyll container container load balancer location=us-west location=europe-central
@rakyll host host container container load balancer
@rakyll container container container container container orchestrated hot mess
@rakyll areas of issues: - lack of locality - networking
- scheduling - dependencies
@rakyll bare metal kernel network stack cloud stack libraries frameworks
your code
@rakyll “my job is done here”
@rakyll after going to production... 1. monitor 2. alert 3.
troubleshoot 4. fix
@rakyll
@rakyll load balancer
@rakyll load balancer critical path
@rakyll discovering critical paths making them reliable then fast making
them debuggable
@rakyll
@rakyll Latency Numbers Every Programmer Should Know by Jeff Dean
@rakyll
@rakyll ping pong pongservice:6996 project: ping the pong server.
@rakyll opencensus.io
@rakyll not my team!
@rakyll where is the source code?
@rakyll who to page?
@rakyll who to page?
@rakyll give me the logs, runtime events, profiles...
@rakyll
@rakyll
@rakyll
@rakyll http://server:9999/tracez
@rakyll challenges...
@rakyll no wire standards
@rakyll
@rakyll traceparent: <version>-<traceid>-<spanid>-<opts> Example: traceparent: 00-0af7651916cd43dd8448eb211c80319c-b7ad6b7169203331-01
@rakyll no export standards
@rakyll areas of issues: - locality - networking - scheduling
- dependencies
@rakyll fin
[email protected]