Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
OpenTelemetry at AWS
Search
JBD
May 18, 2021
Programming
1
1.8k
OpenTelemetry at AWS
Register and watch this talk now!
https://o11yfest.org/attend
.
JBD
May 18, 2021
Tweet
Share
More Decks by JBD
See All by JBD
eBPF in Microservices Observability at eBPF Day
rakyll
1
2.1k
eBPF in Microservices Observability
rakyll
1
1.7k
Debugging Code Generation in Go
rakyll
5
1.5k
Are you ready for production?
rakyll
8
2.8k
Servers are doomed to fail
rakyll
3
1.5k
Serverless Containers
rakyll
1
240
Critical Path Analysis
rakyll
0
570
Monitoring and Debugging Containers
rakyll
2
1.1k
CPDD
rakyll
0
4.1k
Other Decks in Programming
See All in Programming
ABEMA iOS 大規模プロジェクトにおける段階的な技術刷新 / ABEMA iOS Technology Upgrade
akkyie
1
220
iOSでQRコード生成奮闘記
ktcryomm
2
110
Domain-Driven Design (Tutorial)
hschwentner
13
22k
Honoとフロントエンドの 型安全性について
yodaka
7
1.5k
React 19アップデートのために必要なこと
uhyo
8
1.5k
Serverless Rust: Your Low-Risk Entry Point to Rust in Production (and the benefits are huge)
lmammino
1
160
ソフトウェアエンジニアの成長
masuda220
PRO
12
2.1k
Rubyで始める関数型ドメインモデリング
shogo_tksk
0
140
データの整合性を保つ非同期処理アーキテクチャパターン / Async Architecture Patterns
mokuo
55
19k
LINE messaging APIを使ってGoogleカレンダーと連携した予約ツールを作ってみた
takumakoike
0
120
JAWS Days 2025のインフラ
komakichi
1
120
15分で学ぶDuckDBの可愛い使い方 DuckDBの最近の更新
notrogue
3
750
Featured
See All Featured
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
29
1k
The Art of Programming - Codeland 2020
erikaheidi
53
13k
Unsuck your backbone
ammeep
669
57k
Statistics for Hackers
jakevdp
797
220k
Making Projects Easy
brettharned
116
6k
What’s in a name? Adding method to the madness
productmarketing
PRO
22
3.3k
Mobile First: as difficult as doing things right
swwweet
223
9.5k
Making the Leap to Tech Lead
cromwellryan
133
9.1k
Keith and Marios Guide to Fast Websites
keithpitt
411
22k
Product Roadmaps are Hard
iamctodd
PRO
51
11k
Typedesign – Prime Four
hannesfritz
41
2.5k
A Philosophy of Restraint
colly
203
16k
Transcript
@rakyll OpenTelemetry at AWS Jaana Dogan Principal Engineer, AWS jbd@amazon.com
@rakyll Who? Jaana Dogan, AWS Explicit focus on instrumentation
@rakyll Five AWS stories... Too many agents Too many formats
Too little correlation Too many ways to propagate Too many products to support
@rakyll Too many agents 4-5 agents Friction in installation Operational
burden Friction in configuration delivery Performance penalty
@rakyll Too many formats EMF CloudWatch Prometheus statsd Vendor formats
... X-Ray Zipkin Jaeger Vendor formats ...
@rakyll Too little correlation Tool fatigue Disjoint views Missing metadata
Friction in troubleshooting
@rakyll Too many ways to propagate Lack of end-to-end traces
Missing label propagation No W3C TraceContext or B3 support No runtime propagation standards
@rakyll Too many products to support CloudWatch X-Ray Prometheus Elasticsearch/OpenSearch
New Relic, Datadog, Splunk, Honeycomb, Lightstep and more.
@rakyll What do we use? Specification Context Propagation Semantic Conventions
Data Model Protocol (OTLP) Collector Client Libraries
@rakyll What’s next? collector Managed on EC2, ECS, EKS, Lambda,
etc.
@rakyll What’s next? collector Managed on EC2, ECS, EKS, Lambda,
etc.
@rakyll What’s next? collector Managed on EC2, ECS, EKS, Lambda,
etc. OTLP Prometheus statsd X-Ray Jaeger Zipkin
@rakyll What’s next? collector Managed on EC2, ECS, EKS, Lambda,
etc. OTLP Prometheus statsd X-Ray Jaeger Zipkin CloudWatch Prometheus X-Ray Elastic/OpenSearch Jaeger Zipkin Vendors Raw storage
@rakyll What’s next? collector Managed on EC2, ECS, EKS, Lambda,
etc. OTLP Prometheus statsd X-Ray Jaeger Zipkin CloudWatch Prometheus X-Ray Jaeger Zipkin Vendors Raw storage enrich, transform, ...
@rakyll Container Insights now collected by OpenTelemetry.
@rakyll What do we use? Specification Context Propagation Semantic Conventions
Data Model Protocol (OTLP) Collector Client Libraries
@rakyll What works well? Flexible Composable Lightweight enough Holistic Legacy
protocol friendly Community
@rakyll What challenges us? Stability Custom builds Compatibility (Prometheus &
CloudWatch) Boilerplate in client libraries
@rakyll What are we working on next?
@rakyll Prometheus
@rakyll Prometheus Drop-in replacement for Prometheus Data model changes Remote
write compliance Discovery + scrape config compliance Kubernetes operator
@rakyll Components Container Insights receivers and processors CloudWatch histogram compatibility
CloudWatch Logs exporter S3 exporter
@rakyll Propagation Adopting 128-bit trace IDs in X-Ray Context propagation
in SQL
@rakyll Platforms EC2 ECS EKS Lambda (and control plane components...)
@rakyll Lambda support
@rakyll Others... eBPF Profiles Real time user monitoring Network diagnostics
Database performance
@rakyll One more thing...
@rakyll Exporting to vendors? Vended data streams CloudWatch Metric Streams
support OTLP CW Metrics S3 (in JSON or OTLP) Kinesis (in JSON or OTLP)
@rakyll It’s not a fork. It’s a snapshot for security,
performance, support.
@rakyll Thank you Jaana Dogan Principal Engineer, AWS jbd@amazon.com