Unifying Your Observability Pipeline

Aditya Mukerjee

July 13, 2018

800

Unifying Your Observability Pipeline

“If a microservice falls down in the middle of a server farm, does my pager make a sound?”

If your service is automatically monitored, then the answer is “yes!”. But now that you’ve been paged and roused from your slumber… what happens next? Do you stumble to your computer, bleary-eyed, trying to find the elusive problem by cross-referencing dashboards and server logs across eleven different browser tabs? Or do you have better tools that you can use to integrate your monitoring data?

Unless you’re an engineer at a massive company like Google, the answer is probably “no” - most companies don’t have the resources to build all their own monitoring tools in-house. When using third-party and open-source tools for monitoring, there will always be gaps in between.

Fortunately, there’s a way teams can get the best of both worlds: high-resolution visibility into your systems, but without having to write your entire monitoring stack yourselves. We built a custom-built, open-source distributed tracing and monitoring pipeline that allows us to inspect each step of an HTTP request and diagnose the root causes of errors, leveraging the same open-source and third-party monitoring platforms you’re already used to. And with a monitoring pipeline that unifies metrics, logs, and traces, you can live the observability dream: the right data, in the right form, right when you need it.

Aditya Mukerjee

July 13, 2018

Tweet

More Decks by Aditya Mukerjee

See All by Aditya Mukerjee

করো: Translating Go to Other (Human) Languages, and Back Again (GopherCon 2020)

1

82

Building Stripe’s Remote Hub: Scaling Distributed Teams

1

220

করো: Translating Code to Other (Human) Languages and Back Again

1

100

করো: Translating Go to Other (Human) Languages, and Back Again (GopherCon Vietnam)

1

67

Observability from the Panopticon: Measuring What Matters

2

350

করো: Translating Code to Other (Human) Languages, and Back Again

1

2k

Building Resilient Services in Go

3

830

Observing Your Go Services

1

210

You Might Be a Go Contributor Already and Not Know It

2

390

Other Decks in Programming

See All in Programming

Nuances on Kubernetes - RubyConf Taiwan 2025

0

140

『リコリス・リコイル』に学ぶ！！〜キャリア戦略における計画的偶発性理論と変わる勇気の重要性〜

1

470

Portapad紹介プレゼンテーション

1

120

0から始めるモジュラーモノリス-クリーンなモノリスを目指して

0

280

Jakarta EE Meets AI

0

660

#QiitaBash TDDで(自分の)開発がどう変わったか

1

360

Vibe coding コードレビュー

0

430

SwiftでMCPサーバーを作ろう！

PRO

2

230

CLI ツールを Go ライブラリとして再実装する理由 / Why reimplement a CLI tool as a Go library

3

1k

バイブコーディング超えてバイブデプロイ〜CloudflareMCPで実現する、未来のアプリケーションデリバリー〜

3

810

実践 Dev Containers × Claude Code

1

170

20250808_AIAgent勉強会_ClaudeCodeデータ分析の実運用〜競馬を題材に回収率100%の先を目指すメソッドとは〜

0

140

Featured

See All Featured

Gamification - CAS2011

81

5.4k

Being A Developer After 40

90

590k

Code Review Best Practice

69

19k

Cheating the UX When There Is Nothing More to Optimize - PixelPioneers

stephaniewalter

283

13k

VelocityConf: Rendering Performance Case Studies

332

24k

Exploring the Power of Turbo Streams & Action Cable | RailsConf2023

34

6k

Design and Strategy: How to Deal with People Who Don’t "Get" Design

131

19k

Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]

8

450

Designing for humans not robots

253

25k

JavaScript: Past, Present, and Future - NDC Porto 2020

50

5.5k

Building an army of robots

306

45k

Fantastic passwords and where to find them - at NoRuKo

51

3.4k

Transcript

Unifying Your Observability Pipeline Aditya Mukerjee Systems Engineer at Stripe
@chimeracoder
@chimeracoder
Why are we here? @chimeracoder
It’s 3:07 AM @chimeracoder
Dashboard Count: 1 @chimeracoder
Dashboard Count: 2 @chimeracoder
Dashboard Count: 3 @chimeracoder
@chimeracoder
Dashboard Count: 4 @chimeracoder
@chimeracoder
@chimeracoder Dashboard Count: 5
What tools can we use? Metrics/dashboards? Logs? Request traces? No
context! Hard to aggregate! Require planning! @chimeracoder
Monitoring information is only as good as developers’ ability to
predict the future @chimeracoder
@chimeracoder
@chimeracoder
@chimeracoder
@chimeracoder Application
What’s the difference? •If you squint, it’s hard to tell
them apart •A log is a metric with “longer” information •A trace is a metric that allows “inner joins” @chimeracoder
What if we could have all three, all the time?
@chimeracoder
Standard Sensor Format @chimeracoder
@chimeracoder
@chimeracoder
@chimeracoder Application
Integrated Views @chimeracoder
@chimeracoder
@chimeracoder
@chimeracoder
Tradeoffs: Stacking the Deck @chimeracoder
Distributed Collection @chimeracoder host1 host2 host3 Dashboard Tool
Aggregation @chimeracoder host1 host2 host3 Global Aggregator Dashboard Tool
Distributed Aggregation @chimeracoder host1 host2 host3 Dashboard Tool
Stacking the Deck Histogram: t-digests @chimeracoder
@chimeracoder
@chimeracoder
@chimeracoder
Trying out Veneur •Free and open source! http://github.com/stripe/veneur •Six-week release
cycle • Drop-in support for statsd, Graphite, Datadog, SignalFx, Prometheus, and more •Native Kubernetes support •Public images on Docker Hub @chimeracoder
Let’s build the world we want to see @chimeracoder
Thank you! https://github.com/stripe/veneur #veneur on Freenode Aditya Mukerjee @chimeracoder @chimeracoder
References @chimeracoder