Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Monitoring and Debugging Containers
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
JBD
December 04, 2018
Programming
2
1.1k
Monitoring and Debugging Containers
JBD
December 04, 2018
Tweet
Share
More Decks by JBD
See All by JBD
eBPF in Microservices Observability at eBPF Day
rakyll
1
2.2k
eBPF in Microservices Observability
rakyll
1
1.7k
OpenTelemetry at AWS
rakyll
1
1.9k
Debugging Code Generation in Go
rakyll
5
1.6k
Are you ready for production?
rakyll
8
2.9k
Servers are doomed to fail
rakyll
3
1.6k
Serverless Containers
rakyll
1
270
Critical Path Analysis
rakyll
0
680
CPDD
rakyll
0
4.2k
Other Decks in Programming
See All in Programming
CSC307 Lecture 02
javiergs
PRO
1
770
AI & Enginnering
codelynx
0
110
Basic Architectures
denyspoltorak
0
660
フルサイクルエンジニアリングをAI Agentで全自動化したい 〜構想と現在地〜
kamina_zzz
0
400
OCaml 5でモダンな並列プログラミングを Enjoyしよう!
haochenx
0
140
Lambda のコードストレージ容量に気をつけましょう
tattwan718
0
110
Unicodeどうしてる? PHPから見たUnicode対応と他言語での対応についてのお伺い
youkidearitai
PRO
1
1.1k
組織で育むオブザーバビリティ
ryota_hnk
0
170
dchart: charts from deck markup
ajstarks
3
990
AI Agent の開発と運用を支える Durable Execution #AgentsInProd
izumin5210
7
2.3k
責任感のあるCloudWatchアラームを設計しよう
akihisaikeda
3
160
HTTPプロトコル正しく理解していますか? 〜かわいい猫と共に学ぼう。ฅ^•ω•^ฅ ニャ〜
hekuchan
2
680
Featured
See All Featured
Between Models and Reality
mayunak
1
180
JAMstack: Web Apps at Ludicrous Speed - All Things Open 2022
reverentgeek
1
320
AI Search: Implications for SEO and How to Move Forward - #ShenzhenSEOConference
aleyda
1
1.1k
Groundhog Day: Seeking Process in Gaming for Health
codingconduct
0
91
How to Grow Your eCommerce with AI & Automation
katarinadahlin
PRO
0
100
Navigating the moral maze — ethical principles for Al-driven product design
skipperchong
2
240
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
254
22k
Design in an AI World
tapps
0
140
A better future with KSS
kneath
240
18k
The Straight Up "How To Draw Better" Workshop
denniskardys
239
140k
brightonSEO & MeasureFest 2025 - Christian Goodrich - Winning strategies for Black Friday CRO & PPC
cargoodrich
3
97
Making Projects Easy
brettharned
120
6.6k
Transcript
@rakyll monitoring and debugging containerized systems Jaana B. Dogan, Google
[email protected]
@rakyll me overly frustrated engineer 15+ years in networking systems
making systems more reliable
@rakyll the new old monitoring? (maybe)
@rakyll systems are growing... and you are not in control
@rakyll bare metal kernel network stack cloud stack libraries frameworks
your code
@rakyll
@rakyll complexity is inevitable
@rakyll container
@rakyll container
@rakyll container container
@rakyll container container
@rakyll container container message queue
@rakyll container container storage/database
@rakyll container container load balancer location=us-west location=europe-central
@rakyll host host container container load balancer
@rakyll container container container container container orchestrated hot mess
@rakyll areas of issues: - lack of locality - networking
- scheduling - dependencies
@rakyll bare metal kernel network stack cloud stack libraries frameworks
your code
@rakyll “my job is done here”
@rakyll after going to production... 1. monitor 2. alert 3.
troubleshoot 4. fix
@rakyll
@rakyll load balancer
@rakyll load balancer critical path
@rakyll discovering critical paths making them reliable then fast making
them debuggable
@rakyll
@rakyll Latency Numbers Every Programmer Should Know by Jeff Dean
@rakyll
@rakyll ping pong pongservice:6996 project: ping the pong server.
@rakyll opencensus.io
@rakyll not my team!
@rakyll where is the source code?
@rakyll who to page?
@rakyll who to page?
@rakyll give me the logs, runtime events, profiles...
@rakyll
@rakyll
@rakyll
@rakyll http://server:9999/tracez
@rakyll challenges...
@rakyll no wire standards
@rakyll
@rakyll traceparent: <version>-<traceid>-<spanid>-<opts> Example: traceparent: 00-0af7651916cd43dd8448eb211c80319c-b7ad6b7169203331-01
@rakyll no export standards
@rakyll areas of issues: - locality - networking - scheduling
- dependencies
@rakyll fin
[email protected]