Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
OpenTelemetry at AWS
Search
JBD
May 18, 2021
Programming
1
1.8k
OpenTelemetry at AWS
Register and watch this talk now!
https://o11yfest.org/attend
.
JBD
May 18, 2021
Tweet
Share
More Decks by JBD
See All by JBD
eBPF in Microservices Observability at eBPF Day
rakyll
1
2.1k
eBPF in Microservices Observability
rakyll
1
1.7k
Debugging Code Generation in Go
rakyll
5
1.5k
Are you ready for production?
rakyll
8
2.7k
Servers are doomed to fail
rakyll
3
1.5k
Serverless Containers
rakyll
1
240
Critical Path Analysis
rakyll
0
530
Monitoring and Debugging Containers
rakyll
2
1.1k
CPDD
rakyll
0
4.1k
Other Decks in Programming
See All in Programming
Recoilを剥がしている話
kirik
5
6.6k
DevFest Tokyo 2025 - Flutter のアプリアーキテクチャ現在地点
wasabeef
5
900
Go の GC の不得意な部分を克服したい
taiyow
2
770
Full stack testing :: basic to basic
up1
1
930
Scalaから始めるOpenFeature入門 / Scalaわいわい勉強会 #4
arthur1
1
300
RWC 2024 DICOM & ISO/IEC 2022
m_seki
0
210
MCP with Cloudflare Workers
yusukebe
2
220
rails stats で紐解く ANDPAD のイマを支える技術たち
andpad
1
290
rails statsで大解剖 🔍 “B/43流” のRailsの育て方を歴史とともに振り返ります
shoheimitani
2
930
ソフトウェアの振る舞いに着目し 複雑な要件の開発に立ち向かう
rickyban
0
890
KMP와 kotlinx.rpc로 서버와 클라이언트 동기화
kwakeuijin
0
140
見えないメモリを観測する: PHP 8.4 `pg_result_memory_size()` とSQL結果のメモリ管理
kentaroutakeda
0
140
Featured
See All Featured
Measuring & Analyzing Core Web Vitals
bluesmoon
4
170
Fontdeck: Realign not Redesign
paulrobertlloyd
82
5.3k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
330
21k
Building Applications with DynamoDB
mza
91
6.1k
GitHub's CSS Performance
jonrohan
1030
460k
Git: the NoSQL Database
bkeepers
PRO
427
64k
Rebuilding a faster, lazier Slack
samanthasiow
79
8.7k
The Cult of Friendly URLs
andyhume
78
6.1k
Testing 201, or: Great Expectations
jmmastey
40
7.1k
Typedesign – Prime Four
hannesfritz
40
2.4k
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
8
1.2k
Optimizing for Happiness
mojombo
376
70k
Transcript
@rakyll OpenTelemetry at AWS Jaana Dogan Principal Engineer, AWS
[email protected]
@rakyll Who? Jaana Dogan, AWS Explicit focus on instrumentation
@rakyll Five AWS stories... Too many agents Too many formats
Too little correlation Too many ways to propagate Too many products to support
@rakyll Too many agents 4-5 agents Friction in installation Operational
burden Friction in configuration delivery Performance penalty
@rakyll Too many formats EMF CloudWatch Prometheus statsd Vendor formats
... X-Ray Zipkin Jaeger Vendor formats ...
@rakyll Too little correlation Tool fatigue Disjoint views Missing metadata
Friction in troubleshooting
@rakyll Too many ways to propagate Lack of end-to-end traces
Missing label propagation No W3C TraceContext or B3 support No runtime propagation standards
@rakyll Too many products to support CloudWatch X-Ray Prometheus Elasticsearch/OpenSearch
New Relic, Datadog, Splunk, Honeycomb, Lightstep and more.
@rakyll What do we use? Specification Context Propagation Semantic Conventions
Data Model Protocol (OTLP) Collector Client Libraries
@rakyll What’s next? collector Managed on EC2, ECS, EKS, Lambda,
etc.
@rakyll What’s next? collector Managed on EC2, ECS, EKS, Lambda,
etc.
@rakyll What’s next? collector Managed on EC2, ECS, EKS, Lambda,
etc. OTLP Prometheus statsd X-Ray Jaeger Zipkin
@rakyll What’s next? collector Managed on EC2, ECS, EKS, Lambda,
etc. OTLP Prometheus statsd X-Ray Jaeger Zipkin CloudWatch Prometheus X-Ray Elastic/OpenSearch Jaeger Zipkin Vendors Raw storage
@rakyll What’s next? collector Managed on EC2, ECS, EKS, Lambda,
etc. OTLP Prometheus statsd X-Ray Jaeger Zipkin CloudWatch Prometheus X-Ray Jaeger Zipkin Vendors Raw storage enrich, transform, ...
@rakyll Container Insights now collected by OpenTelemetry.
@rakyll What do we use? Specification Context Propagation Semantic Conventions
Data Model Protocol (OTLP) Collector Client Libraries
@rakyll What works well? Flexible Composable Lightweight enough Holistic Legacy
protocol friendly Community
@rakyll What challenges us? Stability Custom builds Compatibility (Prometheus &
CloudWatch) Boilerplate in client libraries
@rakyll What are we working on next?
@rakyll Prometheus
@rakyll Prometheus Drop-in replacement for Prometheus Data model changes Remote
write compliance Discovery + scrape config compliance Kubernetes operator
@rakyll Components Container Insights receivers and processors CloudWatch histogram compatibility
CloudWatch Logs exporter S3 exporter
@rakyll Propagation Adopting 128-bit trace IDs in X-Ray Context propagation
in SQL
@rakyll Platforms EC2 ECS EKS Lambda (and control plane components...)
@rakyll Lambda support
@rakyll Others... eBPF Profiles Real time user monitoring Network diagnostics
Database performance
@rakyll One more thing...
@rakyll Exporting to vendors? Vended data streams CloudWatch Metric Streams
support OTLP CW Metrics S3 (in JSON or OTLP) Kinesis (in JSON or OTLP)
@rakyll It’s not a fork. It’s a snapshot for security,
performance, support.
@rakyll Thank you Jaana Dogan Principal Engineer, AWS
[email protected]