Unified Observability of Distributed Systems

Unified Observability of Distributed Systems Aditya Mukerjee Systems Engineer at
Stripe @chimeracoder

@chimeracoder

Why are we here? @chimeracoder

It’s 3:07 AM @chimeracoder

Dashboard Count: 1 @chimeracoder

@chimeracoder

What tools can we use? Metrics/dashboards? Logs? Request traces? No
context! Hard to aggregate! Require planning! @chimeracoder

Monitoring information is only as good as developers’ ability to
predict the future @chimeracoder

@chimeracoder

@chimeracoder Application

What’s the difference? •If you squint, it’s hard to tell
them apart •A log is a metric with “longer” information •A trace is a metric that allows “inner joins” @chimeracoder

What if we could have all three, all the time?
@chimeracoder

Standard Sensor Format @chimeracoder

@chimeracoder

@chimeracoder Application

Integrated Views @chimeracoder

@chimeracoder

Flexibility and Data Migrations @chimeracoder

@chimeracoder Application B A C

“Because we’re in control of our pipeline, we could add
a new data backend or migrate vendors without having to touch our application code at all.” “Having ownership over our pipeline gave us trust in our data. It made us confident that we we hadn’t overlooked any parts of the migration process.” @chimeracoder

Tradeoffs: Stacking the Deck @chimeracoder

Distributed Collection @chimeracoder host1 host2 host3 Dashboard Tool

Aggregation @chimeracoder host1 host2 host3 Global Aggregator Dashboard Tool

Distributed Aggregation @chimeracoder host1 host2 host3 Dashboard Tool

Stacking the Deck Histogram: t-digests @chimeracoder

Trying out Veneur •Free and open source! http://github.com/stripe/veneur •Six-week release
cycle • Drop-in support for statsd, Graphite, Datadog, SignalFx, Prometheus, and more •Native Kubernetes support •Public images on Docker Hub @chimeracoder

@chimeracoder

Veneur in 2017 • High availability • Host-local metrics •
Global aggregate metrics • Sketching data structures • … and more! Veneur in 2018 • Automatic cardinality detection • Expanded cross-dashboard integration • Unified client instrumentation • … help us decide the rest! @chimeracoder

Let’s build the world we want to see @chimeracoder

Thank you! https://github.com/stripe/veneur #veneur on Freenode Aditya Mukerjee @chimeracoder @chimeracoder

References @chimeracoder

Unified Observability of Distributed Systems

Unified Observability of Distributed Systems

More Decks by Aditya Mukerjee

Other Decks in Technology

Featured

Transcript