Tracing Production Services at Stripe

Tracing Production Services at Stripe Aditya Mukerjee Systems Engineer at
Stripe @chimeracoder

Tracing is about more than HTTP requests @chimeracoder

https://veneur.org

@chimeracoder

It’s 3:07 AM @chimeracoder

Dashboard Count: 1 @chimeracoder

@chimeracoder

If you need to look at logs, there’s a gap
in your observability tools @chimeracoder

Metrics/dashboards? Logs? Request traces? No context! Hard to aggregate! Require
planning! @chimeracoder

Monitoring information is only as good as developers’ ability to
predict the future @chimeracoder

@chimeracoder

@chimeracoder Application

What’s the difference? •If you squint, it’s hard to tell
them apart •A log is a metric with “longer” information •A trace is a metric that allows “inner joins” @chimeracoder

What if we could have all three, all the time?
@chimeracoder

Standard Sensor Format @chimeracoder

@chimeracoder

@chimeracoder Application

Tradeoffs: Stacking the Deck @chimeracoder

Distributed Collection @chimeracoder host1 host2 host3 Dashboard Tool

Aggregation @chimeracoder host1 host2 host3 Global Aggregator Dashboard Tool

Distributed Aggregation @chimeracoder host1 host2 host3 Dashboard Tool

Stacking the Deck Histogram: t-digests @chimeracoder

Let’s build the world we want to see @chimeracoder

It’s 3:07 AM @chimeracoder

@chimeracoder

Veneur in 2017 •High availability •Host-local metrics •Global aggregate metrics
•Probabilistic data structures •… and more! Veneur in 2018 •Automatic cardinality detection •Cross-dashboard integration •Unified client instrumentation •… help us decide the rest! @chimeracoder

Thank you! https://veneur.org #veneur on Freenode Aditya Mukerjee @chimeracoder

Tracing Production Services at Stripe

Tracing Production Services at Stripe

More Decks by Aditya Mukerjee

Other Decks in Technology

Featured

Transcript