Tracing and Profiling Java (and Native) Applications in Production

Tracing & proﬁling services in production Kaushik Srenevasan [email protected] @ksrenev
1 Monday, July 28, 14

Who am I? • Current (at Twitter) • VM and
Diagnostics: Ruby (Kiji), Hotspot JVM, Scala • Past (at Microsoft) • Authored the 64 bit optimizing compiler in the Chakra JavaScript runtime • Common Language Runtime (CLR) performance 2 Monday, July 28, 14

Twitter.com from ten thousand feet • Service Oriented Architecture •
Platform • CentOS Linux • OpenJDK JVM • Languages • Java/Scala, C/C++, Ruby (Kiji) and Python 3 Monday, July 28, 14

Data store 4 Monday, July 28, 14

JVM @ Twitter • Customized OpenJDK distribution • Dedicated team
to support and maintain releases • Regular internal release cycle • Ship JDK 7(u) (now) and 8 (future) • Bundle useful tools / JVMTI agents • Twitter University talk: Twitter scale computing with the OpenJDK 5 Monday, July 28, 14

JVM @ Twitter • Why we exist? • Low latency
garbage collection on dedicated hardware and Mesos • Scala-speciﬁc optimizations • Tools • Contrail • The Twitter Diagnostics Runtime 6 Monday, July 28, 14

Observability vs Diagnostics 7 Monday, July 28, 14

Diagnostics 8 Monday, July 28, 14

Diagnostics in production • Global • Performant • Dynamic 9
Monday, July 28, 14

State of the art • Global, dynamic, arbitrary context kernel
and user mode instrumentation. • An extremely low overhead, scalable mechanism for aggregating event data. • The ability to execute arbitrary user actions when events occur. 10 Monday, July 28, 14

Guiding principles • Twitter owns the entire stack • Integrate
well with standard platform tools • Do not reinvent the wheel! 11 Monday, July 28, 14

perf • Linux proﬁler • Ships in the kernel tree
• Abstraction over CPU’s performance counters 12 Monday, July 28, 14

Why perf? • Simple • No setup required • Lightweight
• Powerful 13 Monday, July 28, 14

Why perf? Benchmark (baseline) Sampling (perf) Sampling (perf, Yourkit) 14
Monday, July 28, 14

Why perf? Benchmark (baseline) Bytecode instrumentation (Heapster) Tracing Yourkit, JVM
SystemTap Sampling (perf) Sampling (perf, Yourkit) 15 Monday, July 28, 14

Why perf? • Powerful • Mixed mode stacks. • CPU,
Performance counters (cache, branch etc.), Scheduler latencies ... • Spawn, Attach and “top” modes. 16 Monday, July 28, 14

perf for Managed Code • Traditional managed code (Java) proﬁlers
• ThreadMXBean.getThreadInfo • JVMTI: GetAllStackTraces • Undocumented AsyncGetCallTrace • Our approach: Make Java look like native code 17 Monday, July 28, 14

Demo I perf and tooling 19 Monday, July 28, 14

Tracing • Scope • System wide • Process specific •
Application specific? • Export richer, context specific data • Unified event bus 20 Monday, July 28, 14

Tracing in Linux • Function tracing • Tracepoint support •
kprobes • uprobes • Covers NFS, RPC, Filesystem, Devices, Network, Power, Kernel, Virtualization etc. 21 Monday, July 28, 14

UProbes • Extension of the KProbes infrastructure to support user
mode tracepoints • Support for predicates • No support for arbitrary user actions (like DTrace) • No support for managed code 22 Monday, July 28, 14

Tracing in native code • Use SystemTap probe format •
Large number of pre-existing probes • Source level compatibility with DTrace probes • Add support in perf to understand SystemTap probe deﬁnitions 23 Monday, July 28, 14

Tracing in managed code • VM level tracing • Existing
support for DTrace probes • Very heavyweight (not sampled) • Java level tracing 24 Monday, July 28, 14

Demo II Tracing 25 Monday, July 28, 14

Open sourcing ... • Understand user interest • Upstream vs
Publish on Github • Please get in touch 27 Monday, July 28, 14

Questions? 28 Monday, July 28, 14

Tracing and Profiling Java (and Native) Applica...

Tracing and Profiling Java (and Native) Applications in Production

Kaushik Srenevasan

Other Decks in Programming

Featured

Transcript

Tracing & proﬁling services in production Kaushik Srenevasan [email protected] @ksrenev

Who am I? • Current (at Twitter) • VM and

Twitter.com from ten thousand feet • Service Oriented Architecture •

Data store 4 Monday, July 28, 14

JVM @ Twitter • Customized OpenJDK distribution • Dedicated team

JVM @ Twitter • Why we exist? • Low latency

Observability vs Diagnostics 7 Monday, July 28, 14

Diagnostics 8 Monday, July 28, 14

Diagnostics in production • Global • Performant • Dynamic 9

State of the art • Global, dynamic, arbitrary context kernel

Guiding principles • Twitter owns the entire stack • Integrate

perf • Linux proﬁler • Ships in the kernel tree

Why perf? • Simple • No setup required • Lightweight

Why perf? Benchmark (baseline) Sampling (perf) Sampling (perf, Yourkit) 14

Why perf? Benchmark (baseline) Bytecode instrumentation (Heapster) Tracing Yourkit, JVM

Why perf? • Powerful • Mixed mode stacks. • CPU,

perf for Managed Code • Traditional managed code (Java) proﬁlers

18 Monday, July 28, 14

Demo I perf and tooling 19 Monday, July 28, 14

Tracing • Scope • System wide • Process speciﬁc •

Tracing in Linux • Function tracing • Tracepoint support •

UProbes • Extension of the KProbes infrastructure to support user

Tracing in native code • Use SystemTap probe format •

Tracing in managed code • VM level tracing • Existing

Demo II Tracing 25 Monday, July 28, 14

26 Monday, July 28, 14

Open sourcing ... • Understand user interest • Upstream vs

Questions? 28 Monday, July 28, 14