Real world HTTP benchmarking, Lessons learned

Real world HTTP benchmarking Lessons learned Julien Viet

Julien Viet Open source developer for 16+ years @vertx_project lead
Principal software engineer at Marseille JUG Leader ! https://www.julienviet.com/ " http://github.com/vietj # @julienviet $ https://www.mixcloud.com/cooperdbi/

✓ 464 frameworks / 26 languages ✓ 5 tests ✓
Strict requirements ✓ Physical server or cloud ✓ Continuous benchmarking Framework Benchmark https://dzone.com/articles/ﬁve-facts-you-might-not-know-about-techempower-fr

10K ⋆ on " Powered by ! https://vertx.io # @vertx_project
A toolkit for building reactive applications in Java

5 free ebook codes to win NOW!!! Tweet and mention
@vertx_project Get 50% oﬀ with tsvertx

Round #8 (2013)

Round #14 (2017)

Round #14

✓ Benchmarking is not a simulation

✓ Benchmarking is not a simulation ✓ Measure don't guess

✓ Use a baseline

✓ Use a baseline ✓ Deﬁne expectations

Tools of the trade async-proﬁler perf/dtrace JVM logs Flame Graphs
jitwatch

Plaintext benchmark

Plaintext benchmark ✓ Synchronous static Hello World response ✓ HTTP
pipelining: 16 ✓ best of (256,...,16384) connections

Batch ﬂushes appropriately ◦ to amortise costs

GET OK GET OK GET OK GET OK OK OK
GET GET keep-alive pipelining

Default pipelining throughput ✓ Pipelining 1: 59,782 ✓ Pipelining 2:
74,195 ✓ Pipelining 4: 79,037 ✓ Pipelining 8: 82,122

GET OK OK OK GET GET immediate flush batched flushes
GET OK OK OK GET GET

Optimised pipelining requests/second ✓ Pipelining 1: 59,782 ◦ 57,123 ✓
Pipelining 2: 74,195 ◦ 97,329 ✓ Pipelining 4: 79,037 ◦ 217,110 ✓ Pipelining 8: 82,122 ◦ 293,381

Keep your methods small ◦ to ease method inlining

3% penalty? ad591ec985bcc9f99be173c2ce1c18e350a662f2

Just In Time compilation ✓ Translate Java bytecode to native
code ✓ Optimise only for the hot path ✓ Kinds of optimisations: method inlining, loop hoisting, dead code elimination, etc...

process error

process request process error

process request process body process error

handleError(request)

handleContent((HttpContent) msg) handleError(request)

reduce method size to favour inlining handleContent((HttpContent) msg) handleError(request)

inlining manually handleContent((HttpContent) msg) handleError(request)

Avoid unnecessary allocation ◦ to reduce GC pressure

class VertxHandler extends ChannelDuplexHandler { ... // Process Netty's messages
void channelRead(ChannelHandlerContext ctx, Object msg) { context.executeFromIO(() -> { conn.startRead(); handleMessage(conn, msg); }); } } interface Context { ... void executeFromIo(Runnable handler); ... }

void channelRead(ChannelHandlerContext ctx, Object msg) { context.executeFromIO(() -> { conn.startRead(); handleMessage(conn, msg); }); } } interface Context { ... void executeFromIo(Runnable handler); ... } Instantiate the lambda for each call

void channelRead(ChannelHandlerContext ctx, Object msg) { context.executeFromIO(() -> { conn.startRead(); handleMessage(conn, msg); }); } } interface Context { ... void executeFromIo(Runnable handler); <T> void executeFromIo(T msg, Consumer<T> handler); }

void channelRead(ChannelHandlerContext ctx, Object msg) { context.executeFromIO(msg, message-> { conn.startRead(); handleMessage(conn, message); }); } } interface Context { ... void executeFromIo(Runnable handler); <T> void executeFromIo(T msg, Consumer<T> handler); } Non capturing lambda instantiated once

void channelRead(ChannelHandlerContext ctx, Object msg) { context.executeFromIO(msg, handler); } private Handler<Object> handler = message -> { conn.startRead(); handleMessage(conn, message); }; } Lambda can become a field

✓ Minimize ﬂushing

✓ Minimize ﬂushing ✓ Optimise for the Just In Time
compiler

✓ Minimize ﬂushing ✓ Optimise for the Just In Time
compiler ✓ Keep GC cool

Round #15

Database benchmarks

Database benchmarks ✓ 4 benchmarks: db, queries, fortunes and updates
✓ MySQL, PostgreSQL or MongoDB ✓ 256 connections

At round #14 ✓ JDBC + IkariCP gives best performance
in Java ✓ Vert.x uses JDBC with a worker pool ✓ Blocking is actually not an issue(???)

Handling the problem ✓ Focus on PostgreSQL ✓ Bad results
were actually due to mistakes - Missing MongoDB $id for index resulting in bad perf - No usage of a transaction in UPDATES causing abysmal results

The reactive PostgreSQL client ✓ Goals - Simple, clean and
straightforward API - Non blocking - Performance - Lightweight ✓ Non goals - A driver - An abstraction

query result Round trips to PostgreSQL query result

Pipelining to increase concurrency query result query result query result

Running 5000 queries (100µs ping) Total time (seconds) 0 0,1
0,2 0,3 0,4 Pipelining level 1 2 4 8 16 JDBC Reactive client

Running 5000 queries (1ms ping) Total time (seconds) 0 3,5
7 10,5 14 Pipelining level 1 2 4 8 16 JDBC Reactive client

Round #15

CI - Java - unoﬃcial

The reactive SQL client ✓ Part of Vert.x stack since
3.8 as SQL Client ✓ Support more database - PostgreSQL - MySQL - SQLServer soon

Let there be pipelining

What did we learn? ✓ TFB does not favour non-blocking
designs ✓ JVM is a great place for performance ✓ Trade-oﬀs between usability and performance ✓ RDBMS protocol design is a bottleneck ✓ Protocols concurrency matters

' TechEmpower Framework Benchmarks https: // www.techempower.com/benchmarks/ ' Reactive PostgreSQL
Client https: //github.com/eclipse-vertx/vertx-sql-client ' Async Profiler https: //github.com/jvm-profiling-tools/async-profiler ' Flame Graphs https: //github.com/brendangregg/FlameGraph ' Jitwatch https: //github.com/AdoptOpenJDK/jitwatch

Real world HTTP benchmarking, Lessons learned

Real world HTTP benchmarking, Lessons learned

More Decks by Julien Viet

Other Decks in Programming

Featured

Transcript