When Performance Matters – Effective Performance Testing from the Ground Up

EFFECTIVE PERFORMANCE TESTING FROM THE GROUND UP WHEN PERFORMANCE MATTERS
Hassy Veldstra [email protected] - @hveldstra W-JAX, Nov 2018, Munich

• Open source load & functional testing toolkit

WHAT THIS TALK IS NOT ABOUT

WHAT THIS TALK IS NOT ABOUT • Writing performant code

• Profiling code to make it faster

• Profiling code to make it faster • Benchmarking code

WHAT THIS TALK IS ABOUT

WHAT THIS TALK IS ABOUT • Understanding where performance testing
fits into delivery process

fits into delivery process • The people involved

fits into delivery process • The people involved • Understanding how an effective performance testing strategy may be implemented

BACKGROUND

DONEC QUIS NUNC

MOST IMPORTANTLY Performance is an key requirement

ECOMMERCE

EVENTS/TICKETING

• Chat • Games • Web APIs • Payments APIs

HOW DO WE MEET OUR PERFORMANCE GOALS?

QUESTIONS QUESTIONS

QUESTIONS QUESTIONS Where does performance testing fit into the development,
testing, and delivery process?

QUESTIONS QUESTIONS How do we run performance tests in our
CI/CD pipeline?

QUESTIONS QUESTIONS How do we define performance goals & SLOs*
for our services? Can we have those checked automatically in CI?

SLOS Service Level Objectives, e.g.: • Support up to 2000
TPS • 99% of all requests should be served in under 200ms • No more than 0.1% of requests should return a 5xx response https://landing.google.com/sre/sre-book/chapters/service-level-objectives/ For a detailed discussion, see Google SRE book:

QUESTIONS QUESTIONS What types of performance tests are there? Which
ones do we need for our services?

QUESTIONS QUESTIONS How do we organize our test suites? What
are the best practices around structuring those tests?

QUESTIONS QUESTIONS How do we pick which services to test?
Do we need to test all of our microservices? What about testing different environments and configurations easily?

QUESTIONS QUESTIONS How do we encourage collaboration on performance testing
and involve everyone: developers, testers, SREs and product managers? (performance is everyone’s responsibility!)

QUESTIONS QUESTIONS What’s the best way to get started with
all of this?

MAYBE SOME ANSWERS

MAYBE SOME ANSWERS Understand the place of performance testing in
delivering process

MAYBE SOME ANSWERS How to go from zero to a
performance testing suite that’s:

MAYBE SOME ANSWERS How to go from zero to a
performance testing suite that’s: • Extensible & maintainable • Integrates into your CI/CD pipelines • Helps verify SLOs automatically • Works for developers, testers and SREs • Integrates with monitoring & reporting tools

DEFINITIONS

DEFINITIONS • Testing can suffer from lack of precision in
terminology

DEFINITIONS • Testing can suffer from lack of precision in
terminology • What is “testing”? Several different dimensions to consider

TYPE OF THING UNDER TEST

WHEN A TEST TAKES PLACE https://medium.com/@copyconstruct/testing-microservices-the-sane-way-9bb31d158c16

https://medium.com/@copyconstruct/testing-in-production-the-safe-way-18ca102d0ef1

WHAT THE TEST IS TRYING TO PREVENT

AN ASIDE http://www.skytopia.com/project/fractal/mandelbulb.html

WHAT IS A TEST?

WHAT IS A TEST? An activity, automated or manual, which:

1. Increases your confidence in some Thing

1. Increases your confidence in some Thing 2. Confirms that your understanding of a Thing or its behavior is still correct

1. Increases your confidence in some Thing 2. Confirms that your understanding of a Thing or its behavior is still correct 3. Increases your understanding of a Thing or its properties

INCREASE CONFIDENCE • A/B tests • Canarying • Traffic replay
• Load test to add traffic above base level

CONFIRM UNDERSTANDING • Unit tests - known good/bad input, known
good/ bad output • Contract-based / property-based testing • Load test on a known configuration that verifies some metrics afterwards (max response time <500ms)

INCREASE UNDERSTANDING • Sprint 100m and take a heart rate
reading • Try opening 10k concurrent connections and see what happens • Exploratory testing of all kinds • Chaos testing

PERFORMANCE TESTING? any test that tests some performance-related property of
a Thing. Often used interchangeably with “load testing”.

LOAD TESTING

GEOFILTERING • Give an IP address + country code •
Returns yes/no

AUTH TOKEN SERVICE • Give username/password • Returns signed JWT
token

OR A COMBINED API • Authenticate and get a token
• Use the token to get geofiltering info

ACCEPTANCE/FUNCTIONAL TESTING

ACCEPTANCE/FUNCTIONAL TESTING • Verify that a service (or another unit!)
conforms to its contract

ACCEPTANCE/FUNCTIONAL TESTING • Verify that a service (or another unit!)
conforms to its contract • Verify that the results it produces make sense

AUTH SERVICE • Produces JWTs and not just empty 2xx
responses • The token contains expected fields

• It’s possible to re-use the same test code for
both acceptance tests and load tests if your tooling supports it

• It’s possible to re-use the same test code for
both acceptance tests and load tests if your tooling supports it • (Artillery does!)

SMOKE TESTS

SMOKE TESTS • Main characteristic is their speed

SMOKE TESTS • Main characteristic is their speed • A
test that determines whether other tests should even run

test that determines whether other tests should even run • Is anything obviously broken? If we plug this thing in and turn it on, is there any smoke?

test that determines whether other tests should even run • Is anything obviously broken? If we plug this thing in and turn it on, is there any smoke? • Run a quick happy-path test case

WHERE DOES PERFORMANCE TESTING FIT INTO THE DELIVERY PROCESS?

Early “write code” stage: help profile code or dependencies The
rest typically is on a deployed service or API

WHAT TYPES OF PERFORMANCE TESTS ARE THERE?

WHAT TYPES OF PERFORMANCE TESTS ARE THERE? • Load test
in CI/CD to continuously verify SLOs (service or composite API) — pre-prod

in CI/CD to continuously verify SLOs (service or composite API) — pre-prod • Load testing to help capacity planning — pre-prod, manual

CAPACITY PLANNING • Identify amount of resources an instance of
a service needs

a service needs • Identify the number of instances needed to meet a performance target

a service needs • Identify the number of instances needed to meet a performance target • Identify whether current capacity is sufficient or some limits will need to be raised pre-prod

in CI/CD to continuously verify SLOs (service or composite API) — pre-prod • Load testing to help capacity planning — pre-prod, manual • Load test a service to better understand its scaling properties and tune configs — pre-prod, manual

in CI/CD to continuously verify SLOs (service or composite API) — pre-prod • Load testing to help capacity planning — pre-prod, manual • Load test a service to better understand its scaling properties and tune configs — pre-prod, manual • Stress test a service to find its limits & understand how it degrades — pre-prod, manual

in CI/CD to continuously verify SLOs (service or composite API) — pre-prod • Load testing to help capacity planning — pre-prod, manual • Load test a service to better understand its scaling properties and tune configs — pre-prod, manual • Stress test a service to find its limits & understand how it degrades — pre-prod, manual • In prod, manual or automatic — add extra traffic as a safety margin; a form of chaos testing

HOW DO WE RUN THESE IN CI/CD?

HOW DO WE RUN THESE IN CI/CD? • Create a
parameterized CI job that can run a load test against a $service in an $environment with a $load_profile and (optionally) verify $slos.

parameterized CI job that can run a load test against a $service in an $environment with a $load_profile and (optionally) verify $slos. • This can act as a stage in other pipelines

parameterized CI job that can run a load test against a $service in an $environment with a $load_profile and (optionally) verify $slos. • This can act as a stage in other pipelines • … or be used manually by a dev/tester for ad-hoc testing via a web UI or a CLI (e.g. on Jenkins or AWS CodeBuild)

HOW DO WE RUN THESE IN CI/CD? • The tests
don’t run on the CI server itself

don’t run on the CI server itself • Best to be able to run them on your own (cloud) infrastructure

don’t run on the CI server itself • Best to be able to run them on your own (cloud) infrastructure • Flexibility when it comes to VPCs or regions

don’t run on the CI server itself • Best to be able to run them on your own (cloud) infrastructure • Flexibility when it comes to VPCs or regions • Critical for testing internal microservices

don’t run on the CI server itself • Best to be able to run them on your own (cloud) infrastructure • Flexibility when it comes to VPCs or regions • Critical for testing internal microservices • More cost-effective too

DONEC QUIS NUNC

HOW DO WE RUN THESE IN CI/CD? ❌

HOW DO WE RUN THESE IN CI/CD? ✅ ❌

HOW DO WE RUN THESE IN CI/CD? • What about
test frequency?

test frequency? • No one-size-fits-all approach

test frequency? • No one-size-fits-all approach • Possible to test every change for microservices on the critical path, but probably excessive for most services

test frequency? • No one-size-fits-all approach • Possible to test every change for microservices on the critical path, but probably excessive for most services • Run on a schedule (e.g. nightly) • Run before a final promotion of a change to prod

DEFINING SLOS

DEFINING SLOS • Setting SLOs should be part of your
team’s “creating a new microservice” checklist

DONEC QUIS NUNC

team’s “creating a new microservice” checklist • Documented alongside API specs, design docs (typically a Confluence/wiki page that follows a template)

team’s “creating a new microservice” checklist • Documented alongside API specs, design docs (typically a Confluence/wiki page that follows a template) • Involve other teams that may rely on the service!

team’s “creating a new microservice” checklist • Documented alongside API specs, design docs (typically a Confluence/wiki page that follows a template) • Involve other teams that may rely on the service! • Better to have some SLOs (& revise) than none at all

WHICH SERVICES SHOULD WE TEST?

WHICH SERVICES SHOULD WE TEST? • What could be tested?

Anything with an API spec.

Anything with an API spec. • Individual services

Anything with an API spec. • Individual services • Composite APIs (that’s a “unit” with its own properties & behavior)

Anything with an API spec. • Individual services • Composite APIs (that’s a “unit” with its own properties & behavior) • Does a microservice have SLOs? Then it should have a performance test to verify those automatically.

HOW DO WE ENCOURAGE COLLABORATION?

HOW DO WE ENCOURAGE COLLABORATION? • No magical solutions

HOW DO WE ENCOURAGE COLLABORATION? • No magical solutions •
Involve all functions in discussions about performance

Involve all functions in discussions about performance • Remove barriers:

Involve all functions in discussions about performance • Remove barriers: • Use tools that everyone has access to • Use tools that everyone can work with

HOW DO WE ENCOURAGE COLLABORATION? • Monorepos help

HOW DO WE ENCOURAGE COLLABORATION? • Monorepos help • Good
tooling helps

tooling helps • Available to everyone, ideally open source

tooling helps • Available to everyone, ideally open source • Easy to install and get started with

tooling helps • Available to everyone, ideally open source • Easy to install and get started with • Uses a language that everyone is familiar with

tooling helps • Available to everyone, ideally open source • Easy to install and get started with • Uses a language that everyone is familiar with • Make load testing reports & findings available to everyone (e.g. on Confluence/your wiki or KB)

HOW DO WE GET STARTED?

ARTILLERY 101

ARTILLERY 101 • Available on npm: npm install -g artillery

• The artillery CLI is used to run tests and create HTML reports

• The artillery CLI is used to run tests and create HTML reports • Tests are written in YAML and can be extended with Javascript

ARTILLERY 101 • Supports HTTP, Socketio, WebSocket, Kinesis, HLS out
of the box.

of the box. • Third-party plugins for SQS, Lambda, SQL etc

of the box. • Third-party plugins for SQS, Lambda, SQL etc • Supports plugins. Out of the box: Statsd/Datadog/ Librato integration.

of the box. • Third-party plugins for SQS, Lambda, SQL etc • Supports plugins. Out of the box: Statsd/Datadog/ Librato integration. • Third-party plugins for other monitoring systems

ARTILLERY 101 • Designed to allow for complex, multi-step virtual
user behavior to be scripted

user behavior to be scripted • Support for randomizing requests and capturing data from responses and re-using it in other requests

user behavior to be scripted • Support for randomizing requests and capturing data from responses and re-using it in other requests • Supports assertions on metrics such as HTTP latency — ie automated checking of SLOs

ARTILLERY 101 • Runs well in Docker, easy to run
on ECS or Kubernetes or in a CI/CD pipeline

on ECS or Kubernetes or in a CI/CD pipeline • Can generate self-contained HTML reports with charts and graphs

on ECS or Kubernetes or in a CI/CD pipeline • Can generate self-contained HTML reports with charts and graphs • Features for creating modular test suites

CONFIG config: target: "" # we don't set a target
by default environments: dev: target: "https://auth-service-dev.acme-corp.internal" defaults: headers: x-api-key: "0xcoffee" local: target: "http://localhost:8080" processor: “./functions.js" plugins: datadog: {} payload: - path: "./username-password.csv" fields: - username - password

ONE OR MORE SCENARIOS scenarios: - name: Authenticate with valid
credentials flow: - post: url: "/auth" json: username: "{{ username }}" password: "{{ password }}" expect: - statusCode: 200 - contentType: json

$ artillery run \ --config config.yaml \ -e dev \
scenario.yaml

ORGANIZING OUR TEST SUITE

ORGANIZING OUR TEST SUITE • Using a monorepo

ORGANIZING OUR TEST SUITE • Using a monorepo • Easier
to get started with, extend & maintain

to get started with, extend & maintain • Easier to share across teams

to get started with, extend & maintain • Easier to share across teams • Helps code reuse

to get started with, extend & maintain • Easier to share across teams • Helps code reuse • Easier to work with in CI/CD pipelines

ORGANIZING OUR TEST SUITE https://github.com/shoreditch-ops/acme-corp-api-tests

ORGANIZING OUR TEST SUITE acme-corp-api-tests/ - services/ - auth-service/ -
scenarios/ - config.yaml - functions.js - overrides.slo-response-time.json - package.json - common-config.yaml

WHY THAT STRUCTURE? • Extensible: • can add a new
service or API easily, or add a new scenario to an existing one • allows for service-specific config such as environment URLs or data from external CSVs • allows for service-specific custom code, e.g. to generate random data in a certain format • Can encode service-specific load phases and SLOs

{ "config": { "phases": [ { "duration": 120, "arrivalRate": 10,
"rampTo": 20, "name": "Warm up the service" }, { "duration": 240, "arrivalRate": 20, "rampTo": 100, "name": "Ramp to high load" }, { "duration": 600, "arrivalRate": 100, "name": "Sustained high load" } ], "ensure": { "maxErrorRate": 0.1, "p99": 200 } } }

RUNNING A TEST artillery run \ —config ./services/auth-service/config.yaml \ --overrides
"$(cat ./services/auth-service/ overrides.slos.json)” \ --e dev ./services/auth-service/login.yaml

"$(cat ./services/auth-service/ overrides.slos.json)” \ --e dev ./services/auth-service/login.yaml Easy to parameterize in CI/CD - service name, load/ SLO override, environment and optionally a scenario

"$(cat ./services/auth-service/ overrides.slos.json)” \ --e dev ./services/auth-service/login.yaml Easy to parameterize in CI/CD - service name, load/ SLO override, environment and optionally a scenario Generic / reusable CI job!

RUNNING A TEST • Reusable by other jobs / pipeline
stages • Or via the UI for ad hoc testing - e.g. in Jenkins or AWS CodeBuild

ORGANIZING OUR TEST SUITE acme-corp-api-tests/ - services/ - auth-service/ -
scenarios/ - config.yaml - functions.js - overrides.slo-response-time.json - package.json - common-config.yaml

WHERE TO START?

WHERE TO START? • Pick one service

WHERE TO START? • Pick one service • Write tests
using the template & set up a CI job to run them

WHERE TO START? • Pick one service • Write tests
using the template & set up a CI job to run them • Show & tell to the rest of the team

WHERE TO START? • A good candidate service:

WHERE TO START? • A good candidate service: • Small
API surface

API surface • Has experienced performance issues, or

API surface • Has experienced performance issues, or • On the critical path for other components, or

API surface • Has experienced performance issues, or • On the critical path for other components, or • Has high performance requirements

API surface • Has experienced performance issues, or • On the critical path for other components, or • Has high performance requirements • For example: an authentication service

SO… WE’VE LOOKED AT

SO… WE’VE LOOKED AT • What performance testing is, and
different types of performance tests

different types of performance tests • Where performance testing fits into the development, testing, and delivery process

different types of performance tests • Where performance testing fits into the development, testing, and delivery process • Running performance tests in CI/CD pipelines

different types of performance tests • Where performance testing fits into the development, testing, and delivery process • Running performance tests in CI/CD pipelines • Setting and verifying SLOs

different types of performance tests • Where performance testing fits into the development, testing, and delivery process • Running performance tests in CI/CD pipelines • Setting and verifying SLOs • The mechanics of setting up a test suite with Artillery

THANKS & OVER TO YOU NOW! slides:

When Performance Matters – Effective Performanc...

When Performance Matters – Effective Performance Testing from the Ground Up

More Decks by hassy veldstra

Other Decks in Programming

Featured

Transcript