Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Cloudy with a Chance of Insights:
Search
Posedio
PRO
January 18, 2024
Programming
0
34
Cloudy with a Chance of Insights:
Demystifying Observability in the Cloud
Posedio
PRO
January 18, 2024
Tweet
Share
More Decks by Posedio
See All by Posedio
What is a platform?
posedio
PRO
0
17
Unsealing Vault
posedio
PRO
0
10
Modern data observability
posedio
PRO
0
14
Lost Jobs, Zombie Tasks and AirFlow Nightmares: A debugging Deep Dive
posedio
PRO
0
34
Designing Zero Trust Systems
posedio
PRO
0
26
Platform user's remorse
posedio
PRO
0
140
Go KonMari on your SQL
posedio
PRO
0
24
Rolling out digital receipts on GCP infrastructure
posedio
PRO
0
21
API First revisited - where did we take a left turn?
posedio
PRO
0
65
Other Decks in Programming
See All in Programming
Web Components で実現する Hotwire とフロントエンドフレームワークの橋渡し / Bridging with Web Components
da1chi
3
2k
Flutterで分数(Fraction)を表示する方法
koukimiura
0
130
monorepo の Go テストをはやくした〜い!~最小の依存解決への道のり~ / faster-testing-of-monorepos
convto
2
460
Signals & Resource API in Angular: 3 Effective Rules for Your Architecture @BASTA 2025 in Mainz
manfredsteyer
PRO
0
110
Goで実践するドメイン駆動開発 AIと歩み始めた新規プロダクト開発の現在地
imkaoru
4
800
あなたとKaigi on Rails / Kaigi on Rails + You
shimoju
0
110
いま中途半端なSwift 6対応をするより、Default ActorやApproachable Concurrencyを有効にしてからでいいんじゃない?
yimajo
2
400
技術的負債の正体を知って向き合う / Facing Technical Debt
irof
0
150
CSC509 Lecture 04
javiergs
PRO
0
300
The Flutter Journey of Building a Live Streaming App — With a Side of Performance Tuning
u503
1
110
Introducing ReActionView: A new ActionView-Compatible ERB Engine @ Kaigi on Rails 2025, Tokyo, Japan
marcoroth
3
980
NetworkXとGNNで学ぶグラフデータ分析入門〜複雑な関係性を解き明かすPythonの力〜
mhrtech
3
1.2k
Featured
See All Featured
Building a Scalable Design System with Sketch
lauravandoore
462
33k
Rebuilding a faster, lazier Slack
samanthasiow
84
9.2k
A better future with KSS
kneath
239
18k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
26
3.1k
[RailsConf 2023] Rails as a piece of cake
palkan
57
5.9k
4 Signs Your Business is Dying
shpigford
185
22k
Build your cross-platform service in a week with App Engine
jlugia
232
18k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
54
3k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
49
3.1k
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
194
16k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
1.6k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
358
30k
Transcript
None
2 ?
“CLOUDY, WITH A CHANCE OF INSIGHTS” Demystifying Observability in the
Cloud PHILIPP SEIFERT-KEHRER 4 July 2024
4 WHAT ABOUT ME? • Developer for 7+ years •
@ Posedio since 2022 • Blood type AB- • Love cats, crocheting & crocheting cats • Monoliths: hehe • Shared libs: not hehe Philipp Seifert-Kehrer
ROADMAP 1. Debugging like it’s 1999 2. Observability Xs and
Os 3. Tracing 101 4.Introducing OpenTelemetry 5. Where benefits?
6 SO, WHAT TO EXPECT FROM THIS TALK? (Re)Gain confidence
to deploy…
DEBUGGING LIKE IT’S 1999 1
8 CLASSICAL DEBUGGING PHD in domain knowledge Sherlock gene
9 LONGEST EMPLOYEE = BEST DEBUGGER? I haven’t moved away
from here in 15 years
10 READING THE TEA LEAVES
11 MONITORING = BYOC Bring Your Own Context
12 BUT, BUT, BUT RUNBOOKS • No documentation > wrong
documentation
13 HELLO CLOUD
14 BUT WHY MONITORING NOT CUTTIN’ IT? Built upon assumptions
Only useful for known unknowns Purely reactive Linear increase of cost Uptime is so 2001
2 OBSERVABILITY XS AND OS
16 WHAT ARE WE TALKIN’ ABOUT? “How well internal states
of a system can be inferred from knowledge of its external outputs”
17 PLATO WAS RIGHT
18 BUT FOR SOFTWARE? Understand inner workings of application Understand
any system state Understand only by observing/external tools Understand without need to ship new code
19 GANG OF 3 Centralized Logging Metrics & Dashboards Distributed
Tracing
20 LET THEM WITHOUT SIN CAST THE FIRST STONE Every
transaction No issues with cardinality Proportional growth of cost Centralized Logging ✅ ✅ 🙅 Metrics & Dashboards ✅ 🙅 ✅ Distribruted Tracing 🙅 ✅ ✅
21 GANG OF 3 GANG OF 3 Centralized Logging Metrics
& Dashboards Distributed Tracing
22 GIMME SOME EXPLORABILITY
TRACING 3
24 SERVICE GRAPH
25 TRACING EXAMPLE
26 ONE TRACE TO RULE ‘EM ALL
27 ANATOMY OF A SPAN
28 EVENTS
29 IT’S ALL ABOUT THE CONTEXT Context propagation Baggage Marshall
Context matters (… even after the 14th blunt)
OPENTELEMETRY 4
31 OTEL OFFERING … that you can’t refuse Open specification
Instrumentation APIs & SDKs Semantic conventions Agent to collect telemetry Optimized protocol OTLP
32 COLLECTOR
33 SEMANTIC CONVENTIONS
34 INSTRUMENTATION Black vs White box Application vs System Distributed
vs Monolith
35 DO’S AND DON’TS Do Don’t Feature flags Be aware
of PII Verbocity level Start off with automatic instrumentation Ignore potential resource overhead Non-standard format Too high default resolution
WHERE BENEFITS 5
37 WHY ALL OF THIS? Less interpretation = less discussion
Foster proactive debugging Reliability Good for application, organization & employee (mental) health No more manual, mental joins Learning opportunity instead of learning prerequisite Link anomalies to distinct version
38 A NEW NON-FUNCTIONAL REQUIREMENT Maintainability Availability Scalability Observability in
the cloud
39 BUT IT’S SO MUCH OVERHEAD, RIGHT?
40 SAMPLING
41 WHAT IF I JUST KEEP LOGGING EVERYTHING?
42 WRAPPING IT UP? Maintainability Availability Scalability Sustainability Better understanding
of architecture More accessible debugging Not all can be fixed… and also should not! Benefits without too much effort possible No panacea
43 Failure is inevitable … so when the shit goes
down, you better be ready
THANK YOU! POSEDIO GMBH +43 660 9154419
[email protected]
Weyringergasse 1-3,
1040 Wien, Millenium Park 4, 6980 Lustenau www.posedio.com Dilly dilly!