Measure, don’t guess - Benchmarking stories from the trenches

Measure, don’t guess Benchmarking stories from the trenches

What is benchmarking?

What is benchmarking? BenchmarkETing?

What is benchmarking?

99,999% science

All by myself *OpenJDK Performance Tactic Index

So You Want to Write a (Micro)Benchmark 1. Read a
reputable paper on JVMs and micro-benchmarking 2. Always include a warmup phase which runs your test kernel all the way through, enough to trigger all initializations and compilations before timing phase(s) 3. Always run with -XX:+PrintCompilation, -verbose:gc, etc., so you can verify that the compiler and other parts of the JVM are not doing unexpected work during your timing phase. a. Print messages at the beginning and end of timing and warmup phases, so you can verify that there is no output during the timing phase. 4. Be aware of the difference between -client and -server, and OSR and regular compilations. Also be aware of the effects of -XX:+TieredCompilation, which mixes client and server modes together. 5. Be aware of initialization effects. Do not print for the first time during your timing phase, since printing loads and initializes classes. Do not load new classes outside of the warmup/reporting phase, unless you are testing class loading. 6. Be aware of deoptimization and recompilation effects. 7. Use appropriate tools to read the compiler's mind, and expect to be surprised by the code it produces. Inspect the code yourself before forming theories about what makes something faster or slower. 8. Reduce noise in your measurements. Run your benchmark on a quiet machine, and run it several times, discarding outliers.

Easy peasy….DONE!

Déjà vu? Benchmarking field access

E = mc^2

Fastest CPU EVER

Who stole my code? NOTE: ASM with AT&T syntax is
op SRC, DEST

In Java-ish... The optimized version execute the load of the
field just once for each test and (incredibly) get the same results too! * the actual optimization depends on the JVM version

USE JMH USE JMH USE JMH “A badly written benchmark
can lead you to wrong conclusions that will let you focus on useless optimizations, confusing yourself and wasting others’ time” - An anonymous performance engineer - *Effects of a poorly written benchmark

A bad benchmark (and its meaningless results) also mislead others
How many times a badly written blog post has pushed developers to adopt bad practices? 😢

JMH TLDR “JMH is a Java harness for building, running,
and analysing nano/micro/milli/macro benchmarks written in Java and other languages targeting the JVM.” - OpenJDK Code Tools -

“Is a collection of software and test data configured to
test a program unit by running it under varying conditions and monitoring its behavior and outputs. ... The typical objectives of a test harness are to: • Automate the testing process. • Execute test suites of test cases. • Generate associated test reports.” - Wikipedia: Test Harness - Test Harness

Déjà vu? (2)

Under the hood Generated code

Under the hood Method under benchmark nanoTime() is a costly
operation, called only once isDone is a volatile variable set by a timer

Just works?

Purpose is everything “Benchmark numbers don’t matter on their own.
It’s important what models you derive from those numbers.”

Making sense of data: Active vs. passive benchmarking • Passive
Benchmarking ◦ Benchmarks are commonly executed and then ignored until they have completed. That is passive benchmarking, where the main objective is the collection of benchmark data. Data is not Information. • Active Benchmarking ◦ With active benchmarking, you analyze performance while the benchmark is still running (not just after it's done), using other tools. You can confirm that the benchmark tests what you intend it to, and that you understand what that is. Data becomes Information. This can also identify the true limiters of the system under test, or of the benchmark itself.

Let’s get our hands dirty!!!

To recap Benchmarks are experiments intended to reproduce in a
controlled environment exactly the same behaviour that you would otherwise experience into the wild

To recap ( yes, I should have tell you before
😛 ) Software Engineer Software Performance Engineer • Mostly don’t care about underlying hardware and data specifics • Work based on abstract principles, actual formal science • Care writing beautiful, readable, composable, reusable … code • Explore complex interactions between hardware, software, and data • Work based on empirical evidence, more similar to natural science • Sacrifice all good software principles to squeeze the last microsecond

References • Code examples - https://github.com/mariofusco/jmh-playground • So You Want
to Write a Micro-Benchmark - https://wiki.openjdk.org/display/HotSpot/MicroBenchmarks • Active Benchmarking - https://www.brendangregg.com/activebenchmarking.html • JMH - https://github.com/openjdk/jmh • JMH Samples - https://github.com/openjdk/jmh/tree/master/jmh-samples/src/main/java/org/openjdk/jmh/s amples • VM Options Explorer - https://chriswhocodes.com/ • HotSpot disassembly plugin - https://chriswhocodes.com/hsdis/ • Environment OSTuning - https://github.com/ionutbalosin/jvm-performance-benchmarks?tab=readme-ov-file#os-tu ning • JMH Visualizer - https://jmh.morethan.io/ • Mastering the mechanics of Java method invocation - https://blogs.oracle.com/javamagazine/post/mastering-the-mechanics-of-java-method-in vocation • What’s Wrong With My Benchmark Results? Studying Bad Practices in JMH Benchmarks

Measure, don’t guess - Benchmarking stories fro...

Measure, don’t guess - Benchmarking stories from the trenches

Mario Fusco

More Decks by Mario Fusco

Featured

Transcript