Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Cloudy with a Chance of Insights:
Search
Posedio
PRO
January 18, 2024
Programming
0
17
Cloudy with a Chance of Insights:
Demystifying Observability in the Cloud
Posedio
PRO
January 18, 2024
Tweet
Share
More Decks by Posedio
See All by Posedio
The Future of Data Sharing
posedio
PRO
0
39
Is your spring boot application in Kubernetes secure?
posedio
PRO
0
45
Zero trust architecture with Keycloak
posedio
PRO
0
87
Site Reliability Engineering: Getting C-Level Support
posedio
PRO
0
12
Controlling Data in Gaia-X
posedio
PRO
0
13
The REWE Data Platform
posedio
PRO
0
28
Key Insights from Using Kafka in Large-Scale Projects
posedio
PRO
0
58
We tried to sell a cake for 1 Billion Euro... then build a data mesh
posedio
PRO
0
13
Handling round-the-clock availability
posedio
PRO
0
18
Other Decks in Programming
See All in Programming
Делим тесты между QA и разработчиком
mariyasaygina
0
470
Compose Multiplatform과 Ktor로 플랫폼의 경계를 넘어보자
kwakeuijin
0
210
CSC509 Lecture 03
javiergs
PRO
0
130
Go製CLIツールGatling Commanderによる負荷試験実施の自動化
okmtz
3
650
Infrastructure as Code でセキュリティを楽にしよう!
konokenj
5
1.4k
ROS 2のZenoh対応とZenohのROS 2対応
takasehideki
2
250
個人開発で使ってるやつを紹介する回
yohfee
1
640
C#および.NETに対する誤解をひも解く
ymd65536
0
210
GitHub Copilot Workspace で我々のアプリ開発がどう変わるのか?
shuyakinjo
0
840
[KR] Server Driven Compose With Firebase
skydoves
1
120
Frontend Magic mit CSS Houdini
joergneumann
0
420
CDKを活用した 大規模コンテナ移行 プロジェクトの紹介
yoyoyopg
0
230
Featured
See All Featured
[RailsConf 2023] Rails as a piece of cake
palkan
49
4.7k
Creatively Recalculating Your Daily Design Routine
revolveconf
217
12k
ReactJS: Keep Simple. Everything can be a component!
pedronauck
663
120k
What's new in Ruby 2.0
geeforr
341
31k
Building Better People: How to give real-time feedback that sticks.
wjessup
360
19k
How to Think Like a Performance Engineer
csswizardry
16
1k
A Tale of Four Properties
chriscoyier
155
22k
Building a Scalable Design System with Sketch
lauravandoore
459
32k
Reflections from 52 weeks, 52 projects
jeffersonlam
346
20k
Ruby is Unlike a Banana
tanoku
96
11k
Mobile First: as difficult as doing things right
swwweet
222
8.8k
The World Runs on Bad Software
bkeepers
PRO
65
11k
Transcript
None
2 ?
“CLOUDY, WITH A CHANCE OF INSIGHTS” Demystifying Observability in the
Cloud PHILIPP SEIFERT-KEHRER 4 July 2024
4 WHAT ABOUT ME? • Developer for 7+ years •
@ Posedio since 2022 • Blood type AB- • Love cats, crocheting & crocheting cats • Monoliths: hehe • Shared libs: not hehe Philipp Seifert-Kehrer
ROADMAP 1. Debugging like it’s 1999 2. Observability Xs and
Os 3. Tracing 101 4.Introducing OpenTelemetry 5. Where benefits?
6 SO, WHAT TO EXPECT FROM THIS TALK? (Re)Gain confidence
to deploy…
DEBUGGING LIKE IT’S 1999 1
8 CLASSICAL DEBUGGING PHD in domain knowledge Sherlock gene
9 LONGEST EMPLOYEE = BEST DEBUGGER? I haven’t moved away
from here in 15 years
10 READING THE TEA LEAVES
11 MONITORING = BYOC Bring Your Own Context
12 BUT, BUT, BUT RUNBOOKS • No documentation > wrong
documentation
13 HELLO CLOUD
14 BUT WHY MONITORING NOT CUTTIN’ IT? Built upon assumptions
Only useful for known unknowns Purely reactive Linear increase of cost Uptime is so 2001
2 OBSERVABILITY XS AND OS
16 WHAT ARE WE TALKIN’ ABOUT? “How well internal states
of a system can be inferred from knowledge of its external outputs”
17 PLATO WAS RIGHT
18 BUT FOR SOFTWARE? Understand inner workings of application Understand
any system state Understand only by observing/external tools Understand without need to ship new code
19 GANG OF 3 Centralized Logging Metrics & Dashboards Distributed
Tracing
20 LET THEM WITHOUT SIN CAST THE FIRST STONE Every
transaction No issues with cardinality Proportional growth of cost Centralized Logging ✅ ✅ 🙅 Metrics & Dashboards ✅ 🙅 ✅ Distribruted Tracing 🙅 ✅ ✅
21 GANG OF 3 GANG OF 3 Centralized Logging Metrics
& Dashboards Distributed Tracing
22 GIMME SOME EXPLORABILITY
TRACING 3
24 SERVICE GRAPH
25 TRACING EXAMPLE
26 ONE TRACE TO RULE ‘EM ALL
27 ANATOMY OF A SPAN
28 EVENTS
29 IT’S ALL ABOUT THE CONTEXT Context propagation Baggage Marshall
Context matters (… even after the 14th blunt)
OPENTELEMETRY 4
31 OTEL OFFERING … that you can’t refuse Open specification
Instrumentation APIs & SDKs Semantic conventions Agent to collect telemetry Optimized protocol OTLP
32 COLLECTOR
33 SEMANTIC CONVENTIONS
34 INSTRUMENTATION Black vs White box Application vs System Distributed
vs Monolith
35 DO’S AND DON’TS Do Don’t Feature flags Be aware
of PII Verbocity level Start off with automatic instrumentation Ignore potential resource overhead Non-standard format Too high default resolution
WHERE BENEFITS 5
37 WHY ALL OF THIS? Less interpretation = less discussion
Foster proactive debugging Reliability Good for application, organization & employee (mental) health No more manual, mental joins Learning opportunity instead of learning prerequisite Link anomalies to distinct version
38 A NEW NON-FUNCTIONAL REQUIREMENT Maintainability Availability Scalability Observability in
the cloud
39 BUT IT’S SO MUCH OVERHEAD, RIGHT?
40 SAMPLING
41 WHAT IF I JUST KEEP LOGGING EVERYTHING?
42 WRAPPING IT UP? Maintainability Availability Scalability Sustainability Better understanding
of architecture More accessible debugging Not all can be fixed… and also should not! Benefits without too much effort possible No panacea
43 Failure is inevitable … so when the shit goes
down, you better be ready
THANK YOU! POSEDIO GMBH +43 660 9154419
[email protected]
Weyringergasse 1-3,
1040 Wien, Millenium Park 4, 6980 Lustenau www.posedio.com Dilly dilly!