O11yfest 2021: Exporting Data with OpenTelemetry by Steve Flanders

Exporting Data with O11yfest 2021

Cloud Native Telemetry foreach(language) foreach(language) collectors, sidecars, etc w3c trace-context,
wire formats for trace data, metrics, logs, etc Telemetry “verticals” Telemetry “layers” Tracing Metrics Logs, etc Instrumentation APIs Canonical implementations Data infrastructure Interop formats

@smflanders flands https:/ /sflanders.net Steve Flanders Director of Engineering, Splunk
OpenTelemetry Website Maintainer OpenTelemetry Collector Triager CNCF SIG-Observability Member Previously: • Head of Product, Omnition • Senior Manager for Logs, VMware

Components & Architecture

OpenTelemetry Components API, SDK, Data Single library per language for
all signals Receive, process, and export data Specification Instrumentation Libraries Collector

Application Instrumentation Libraries Otel Inst. Library API SDK Processors Exporters
RUM Serverless

OpenTelemetry Reference Architecture

But why?

Application Instrumentation

Host Application traces metrics logs SaaS Back-end On-prem Back-end •
Signals may be push or pull; back-end determines whether scraping/tailing is possible • If instrumentation direct to back-end then every application, and language-speciﬁc, needs to: ◦ Authenticate ◦ Batch ◦ Compress ◦ CRUD metadata ◦ Encryption ◦ Proxy ◦ Queue ◦ Retry • In addition, host telemetry is desirable and correlation can be challenging metrics logs

Host Application traces metrics logs SaaS Back-end On-prem Back-end •
Signals may be push or pull; back-end determines whether scraping/tailing is possible • If instrumentation to collector then every application, and language-speciﬁc, needs to (everything else optional): ◦ Batch ◦ Queue ◦ Retry • In addition, the collector can capture host telemetry and correlate with application telemetry as well as convert between formats Collector metrics + logs traces metrics + logs

Summary: Exporting Application Instrumentation • Application instrumentation direct to back-end
◦ Instrumentation responsible for full exporting lifecycle; requires even more resources ◦ Instrumentation export features are language-speciﬁc; mileage may vary ◦ Adding resource information may prove challenging or impossible ◦ Summary: great for proof-of-concepts, be careful in production environments

Summary: Exporting Application Instrumentation • Application instrumentation direct to back-end
◦ Instrumentation responsible for full exporting lifecycle; requires even more resources ◦ Instrumentation export features are language-speciﬁc; mileage may vary ◦ Adding resource information may prove challenging or impossible ◦ Summary: great for proof-of-concepts, be careful in production environments • Application instrumentation to local Collector (agent) to back-end ◦ Instrumentation responsible for minimal exporting lifecycle ◦ Central location for most conﬁguration changes: Collector ◦ Resource enrichment straightforward; able to translate between formats ◦ Summary: generally recommended

Summary: Exporting Application Instrumentation • Application instrumentation to local Collector
(agent) to remote Collector (gateway) to back-end ◦ Often used for advanced use-cases such as tail-based sampling or metric aggregations ◦ May help security by limiting egress and centralizing token management ◦ Can oﬀer more queuing and retry ◦ Summary: not required, but may provide value depending on requirements

Summary: Exporting Application Instrumentation • Application instrumentation to local Collector
(agent) to remote Collector (gateway) to back-end ◦ Often used for advanced use-cases such as tail-based sampling or metric aggregations ◦ May help security by limiting egress and centralizing token management ◦ Can oﬀer more queuing and retry ◦ Summary: not required, but may provide value depending on requirements • Application instrumentation to remote Collector (gateway) to back-end ◦ Pretty much same as application instrumentation direct to back-end ◦ Can oﬀer client-side capabilities ◦ Summary: may be helpful for proof-of-concepts, be careful in production environments

What about serverless? NOTE: Serverless platforms oﬀer a variety of
feature sets, only common ones covered here.

VPC Function traces metrics logs SaaS Back-end On-prem Back-end •
With serverless, signals are typically pushed or handled by cloud provider telemetry. • If instrumentation direct to back-end then every application, and language-speciﬁc, needs to: ◦ Authenticate ◦ Batch ◦ Compress ◦ CRUD metadata ◦ Encryption ◦ Proxy ◦ Queue ◦ Retry • Common challenges with instrumenting serverless include cold boot time, resource consumption, and export latency

VPC Function SaaS Back-end On-prem Back-end • With serverless, signals
are typically pushed or handled by cloud provider telemetry. • If serverless instrumentation to collector then every application, and language-speciﬁc, needs to (everything else optional): ◦ Batch • In addition, export latency can be reduced using a Collector Collector traces metrics logs traces, metrics, logs

Summary: Exporting Serverless Instrumentation • Severless instrumentation direct to back-end
◦ Instrumentation responsible for full exporting lifecycle; requires even more resources ◦ Instrumentation export features are language-speciﬁc; mileage may vary ◦ Instrumentation and export latency may prove problematic ◦ Summary: generally recommended

Summary: Exporting Serverless Instrumentation • Severless instrumentation direct to back-end
◦ Instrumentation responsible for full exporting lifecycle; requires even more resources ◦ Instrumentation export features are language-speciﬁc; mileage may vary ◦ Instrumentation and export latency may prove problematic ◦ Summary: generally recommended • Serverless instrumentation to remote Collector to back-end ◦ Instrumentation responsible for minimal exporting lifecycle ◦ Central location for most conﬁguration changes: Collector ◦ Summary: optional, but generally recommended

How about RUM?

Browser traces metrics logs SaaS Back-end • With RUM, signals
must be pushed. • Browser instrumentation needs to: ◦ Authenticate (publicly!) ◦ Batch ◦ Compress ◦ Encrypt ◦ Queue ◦ Retry Hosted traces metrics logs • RUM data and APM data can be sent separately (stitching via context propagation)

Browser SaaS Back-end • With RUM, signals must be pushed.
• Browser instrumentation needs to: ◦ Authenticate (publicly!) ◦ Batch ◦ Compress ◦ Encrypt ◦ Queue ◦ Retry • Also possible to centralize data collection Hosted traces metrics logs traces, metrics, logs On-prem Back-end

Summary: Exporting RUM Instrumentation • RUM instrumentation direct to back-end
◦ Instrumentation responsible for full exporting lifecycle; authentication often diﬀerent! ◦ Summary: generally recommended

Summary: Exporting RUM Instrumentation • RUM instrumentation direct to back-end
◦ Instrumentation responsible for full exporting lifecycle; authentication often diﬀerent! ◦ Summary: generally recommended • RUM instrumentation to remote Collector to back-end ◦ Instrumentation responsible for full exporting lifecycle; authentication often diﬀerent! ◦ Collector should not be exposed to the Internet today! ◦ Summary: possible, but not recommended at this time

What else do I need to know?

OTel Me More! • Batching really matters so be sure
to enable it ◦ https:/ /doordash.engineering/2021/04/07/optimizing-opentelemetrys-span-processor/

to enable it ◦ https:/ /doordash.engineering/2021/04/07/optimizing-opentelemetrys-span-processor/ • Compression really matters so be sure to enable it (~6x typical)

DEMO! https:/ /gist.github.com/ﬂands/e2653e054a1ae7347a1fc68778ccbea4

to enable it ◦ https:/ /doordash.engineering/2021/04/07/optimizing-opentelemetrys-span-processor/ • Compression really matters so be sure to enable it (~6x typical) • Queuing is pay to play (more resources = more data) ◦ Instrumentation libraries: typically very small ◦ Collector as agent: moderate ◦ Collector as gateway: sky’s the limit

to enable it ◦ https:/ /doordash.engineering/2021/04/07/optimizing-opentelemetrys-span-processor/ • Compression really matters so be sure to enable it (~6x typical) • Queuing is pay to play (more resources = more data) ◦ Instrumentation libraries: typically very small ◦ Collector as agent: moderate ◦ Collector as gateway: sky’s the limit • Collector supports a wide range of use-cases ◦ CRUD metadata is very powerful ◦ Oﬀers a lot of exporters

Collector Exporters... • alibabacloudlogserviceexporter • awsemfexporter • awskinesisexporter • awsprometheusremotewriteexporter
• awsxrayexporter • azuremonitorexporter • carbonexporter • datadogexporter • dynatraceexporter • elasticexporter • elasticsearchexporter • f5cloudexporter • ﬁleexporter • googlecloudexporter • honeycombexporter • humioexporter • jaegerexporter • jaegerthrifthttpexporter • kafkaexporter • loadbalancingexporter • loggingexporter • logzioexporter • lokiexporter • newrelicexporter • opencensusexporter • otlpexporter • otlphttpexporter • prometheusexporter • prometheusremotewriteexporter • sapmexporter • sentryexporter • signalfxexporter • splunkhecexporter • stackdriverexporter • sumologicexporter • uptraceexporter • zipkinexporter

Collector Exporters… Open-Source • alibabacloudlogserviceexporter • awsemfexporter • awskinesisexporter •
awsprometheusremotewriteexporter • awsxrayexporter • azuremonitorexporter • carbonexporter • datadogexporter • dynatraceexporter • elasticexporter • elasticsearchexporter • f5cloudexporter • ﬁleexporter • googlecloudexporter • honeycombexporter • humioexporter • jaegerexporter • jaegerthrifthttpexporter • kafkaexporter • loadbalancingexporter • loggingexporter • logzioexporter • lokiexporter • newrelicexporter • opencensusexporter • otlpexporter • otlphttpexporter • prometheusexporter • prometheusremotewriteexporter • sapmexporter • sentryexporter • signalfxexporter • splunkhecexporter • stackdriverexporter • sumologicexporter • uptraceexporter • zipkinexporter

Collector Exporters… Commercial • alibabacloudlogserviceexporter • awsemfexporter • awskinesisexporter •

Collector Exporters… Traces • alibabacloudlogserviceexporter • awsemfexporter • awskinesisexporter •

Collector Exporters… Metrics • alibabacloudlogserviceexporter • awsemfexporter • awskinesisexporter •

Collector Exporters… Logs • alibabacloudlogserviceexporter • awsemfexporter • awskinesisexporter •

OTel Me More! • Choose the pieces you need ◦
Already have instrumentation? Jaeger? Prometheus? Use it! ◦ Only want to use OpenTelemetry for tracing? That’s ﬁne! ◦ Interested in semantic conventions? OTel has you covered! ◦ Looking for a uniﬁed and vendor-agnostic collector? Check! ◦ Need serverless or RUM support? No problem!

OTel Me More! • Choose the pieces you need ◦
Already have instrumentation? Jaeger? Prometheus? Use it! ◦ Only want to use OpenTelemetry for tracing? That’s ﬁne! ◦ Interested in semantic conventions? OTel has you covered! ◦ Looking for a uniﬁed and vendor-agnostic collector? Check! ◦ Need serverless or RUM support? No problem! • Reference architecture is OTLP all the way! ◦ Instrumentation library to Collector as agent? OTLP ◦ Collector as agent to Collector as gateway (optional)? OTLP ◦ Sending to back-end? Ideally OTLP (limited support today https:/ /opentelemetry.io/vendors/, but expect more soon!)

OpenTelemetry Reference Architecture Serverless RUM

OTel Me More! • Open-source projects are starting to export
OpenTelemetry! ◦ Ambassador: https:/ /app.getambassador.io/initializer/ ◦ Apache Camel: https:/ /camel.apache.org/components/3.7.x/others/opentelemetry.html ◦ Clickhouse: https:/ /clickhouse.tech/docs/en/operations/opentelemetry/ ◦ Dapr: https:/ /docs.dapr.io/operations/monitoring/tracing/open-telemetry-collector/ ◦ Envoy: https:/ /blog.envoyproxy.io/envoy-support-for-opentelemetry-access-logging-e4b08160d32c ◦ Jaeger: https:/ /medium.com/jaegertracing/jaeger-embraces-opentelemetry-collector-90a545cbc24 ◦ Jenkins: https:/ /plugins.jenkins.io/opentelemetry/ ◦ Open Distro for Elasticsearch: https:/ /opendistro.github.io/for-elasticsearch/blog/releases/2020/12/announcing-trace-analytics/ ◦ Spring Sleuth: https:/ /github.com/spring-cloud-incubator/spring-cloud-sleuth-otel

Join us!

45 Next Steps • Join a SIG: https:/ /github.com/open-telemetry/community#special-interest-groups •
Join the conversation: ◦ Each SIG leverages GitHub discussions ◦ CNCF Slack: https:/ /cloud-native.slack.com • Submit a PR (consider good-ﬁrst-issue and help-wanted labels)

Thank You!

O11yfest 2021: Exporting Data with OpenTelemetr...

O11yfest 2021: Exporting Data with OpenTelemetry by Steve Flanders

Other Decks in Technology

Featured

Transcript