Observing Python applications with OpenTelemetry

Observing Python applications with OpenTelemetry Riccardo Magliocchetti Incontro DevOps Italia,
20250314

Riccardo Magliocchetti Maintainer OpenTelemetry Python Senior Software Engineer Elastic Observability

Agenda Observing Python applications with OpenTelemetry: • OpenTelemetry Kubernetes Operator

• OpenTelemetry Python with opentelemetry-instrument

• OpenTelemetry Python with opentelemetry-instrument • OpenTelemetry Python programmatic auto-instrumentation

• OpenTelemetry Python with opentelemetry-instrument • OpenTelemetry Python programmatic auto-instrumentation • GenAI observability with OpenTelemetry

Observability and OpenTelemetry

Observability Observability is the ability to understand the internal state
of a system by examining its outputs.

OpenTelemetry An open source observability framework providing specifications and implementations
in order to create and manage telemetry data: • Traces, requests paths

OpenTelemetry An observability framework providing specifications and implementations in order
to create and manage telemetry data: • Traces, requests paths • Metrics, measurements

to create and manage telemetry data: • Traces, requests paths • Metrics, measurements • Logs, time stamped text

to create and manage telemetry data: • Traces, requests paths • Metrics, measurements • Logs, time stamped text • Profiles, profiling

Auto-instrumentation with Opentelemetry Operator for Kubernetes

opentelemetry-operator A Kubernetes operator that manages: • An OpenTelemetry collector
instance

opentelemetry-operator A Kubernetes operator that manages: • An OpenTelemetry collector
instance • A system to inject auto-instrumentation

opentelemetry-operator The managed OpenTelemetry collector instance: • Simplifies management of
secrets

opentelemetry-operator The managed OpenTelemetry collector instance: • Simplifies management of
secrets • Useful for processing, opentelemetry.io/docs/collector/transforming-telemetry/

opentelemetry-operator Recent improvements: • Less configuration variables

opentelemetry-operator Recent improvements: • Less configuration variables • Support for
musl libc based container images

opentelemetry-operator apiVersion: opentelemetry.io/v1alpha1 kind: OpenTelemetryCollector metadata: name: demo spec: config:
| receivers: otlp: protocols: grpc: endpoint: 0.0.0.0:4317 http: endpoint: 0.0.0.0:4318 processors: exporters: debug: service: pipelines: traces: receivers: [otlp] processors: [] exporters: [debug] metrics: receivers: [otlp] processors: [] exporters: [debug] logs: receivers: [otlp] processors: [] exporters: [debug]

opentelemetry-operator apiVersion: opentelemetry.io/v1alpha1 kind: Instrumentation metadata: name: otel-python-instrumentation spec: exporter:
endpoint: http://demo-collector:4318 propagators: - tracecontext - baggage sampler: type: parentbased_traceidratio argument: "1"

opentelemetry-operator apiVersion: apps/v1 kind: Deployment metadata: name: your-app-deployment spec: replicas:
1 selector: matchLabels: app: python-otel-app template: metadata: annotations: instrumentation.opentelemetry.io/inject-python: "otel-python-instrumentation" instrumentation.opentelemetry.io/otel-python-platform: "musl" # default: glibc labels: app: python-otel-app spec: containers: …

opentelemetry-operator: the bad One OpenTelemetry Operator Python auto-instrumentation image: •
Mind the binary wheels

opentelemetry-operator: the bad One OpenTelemetry Operator Python auto-instrumentation image: •
Mind the binary wheels • Python version support of dependencies

opentelemetry-operator apiVersion: opentelemetry.io/v1alpha1 kind: Instrumentation metadata: name: otel-python-instrumentation spec: python:
image: yourimage:1.0 exporter: endpoint: http://demo-collector:4318 propagators: - tracecontext - baggage sampler: type: parentbased_traceidratio argument: "1"

Auto-instrumentation with opentelemetry-instrument

opentelemetry-instrument Wrap your command with opentelemetry-instrument

opentelemetry-instrument FROM python:3.12-slim WORKDIR /app COPY . /app RUN pip
install flask elastic-opentelemetry # Install instrumentations for the installed packages, custom version of opentelemetry-bootstrap RUN edot-bootstrap -a install # default flask run port EXPOSE 5000 # Set some resource attributes to make our service recognizable ENV OTEL_RESOURCE_ATTRIBUTES="service.name=FlaskService,service.version=1.0,deployment.environment=development" CMD ["opentelemetry-instrument", "flask", "run"]

opentelemetry-instrument export OTEL_EXPORTER_OTLP_ENDPOINT=https://my-deployment.apm.us-west1.gcp.cloud.es.io export OTEL_EXPORTER_OTLP_HEADERS="Authorization=Bearer P....l" docker run \ -e
OTEL_EXPORTER_OTLP_ENDPOINT="$OTEL_EXPORTER_OTLP_ENDPOINT" \ -e OTEL_EXPORTER_OTLP_HEADERS="$OTEL_EXPORTER_OTLP_HEADERS" \ -p 5000:5000 -it --rm edot-flask:latest

opentelemetry-instrument For internals see Anatomy of a Python OpenTelemetry Instrumentation

Programmatic auto-instrumentation

Programmatic auto-instrumentation from opentelemetry.instrumentation import auto_instrumentation auto_instrumentation.initialize()

Auto-instrumentation recap

Auto-instrumentation recap Solution Application lines changed Requirements Notes OpenTelemetry Operator
0 Kubernetes May need to bring your own image opentelemetry-instrument 0 Wrap application entry point Programmatic auto-instrumentation 2 Change application code Configuration via environment variables

GenAI observability with OpenTelemetry

GenAI • Writing semantic conventions

GenAI • Writing semantic conventions • Writing instrumentations

GenAI: semantic conventions • Span attributes for chat completions •
Span attributes for embeddings

Span attributes for embeddings • Events for chat: ◦ Various messages roles ◦ Tool calls ◦ Responses

Span attributes for embeddings • Events for chat: ◦ Various messages roles ◦ Tool calls ◦ Responses • Metrics: ◦ Duration of operation ◦ Number of input/output tokens

GenAI Python instrumentations • OpenAI: streaming, sync, async chat completions
◦ Elastic one traces embeddings calls

◦ Elastic one traces embeddings calls • AWS Bedrock: ◦ Converse, ConverseStream ◦ InvokeModel, InvokeModelWithStreamResponse ▪ Amazon Titan and Nova ▪ Anthropic Claude

◦ Elastic one traces embeddings calls • AWS Bedrock: ◦ Converse, ConverseStream ◦ InvokeModel, InvokeModelWithStreamResponse ▪ Amazon Titan and Nova ▪ Anthropic Claude • Google VertexAI

◦ Elastic traces embeddings calls • AWS Bedrock: ◦ Converse, ConverseStream ◦ InvokeModel, InvokeModelWithStreamResponse ▪ Amazon Titan and Nova ▪ Anthropic Claude • Google VertexAI • Google GenAI

GenAI: whatʼs next? • Complete all the instrumentations

GenAI: whatʼs next? • Complete all the instrumentations • OpenLLMetry
instrumentation donation proposal

GenAI: whatʼs next? • Complete all the instrumentations • OpenLLMetry
instrumentation donation proposal • Agents semantic conventions

Conclusions

Conclusions • OpenTelemetry Operator for Kubernetes • opentelemetry-instrument to wrap
your application entry point • Programmatic auto-instrumentation if you can add two lines of code and set some environment variables • Thereʼs working support for GenAI instrumentation

Contacts and references • https://opentelemetry.io • opentelemetry-operator • opentelemetry-python and
opentelemetry-python-contrib • #otel-python on CNCF slack • @rmistaken / @[email protected] • speakerdeck.com/xrmx

Thank you!

Observing Python applications with OpenTelemetry

Observing Python applications with OpenTelemetry

More Decks by Riccardo Magliocchetti

Other Decks in Programming

Featured

Transcript