Performance Testing From The Ground Up

Eﬀective Performance Testing From The Ground Up When performance matters
Hassy Veldstra [email protected] - @hveldstra DevOpsCon, Dec 2018, Munich

Hello @hveldstra

Open source backend/API testing toolkit Load testing & functional testing
@hveldstra

What this talk is not about @hveldstra

What this talk is not about • Writing performant code
(algorithms, optimization techniques) @hveldstra

(algorithms, optimization techniques) • Proﬁling code to make it run faster, use less memory etc @hveldstra

(algorithms, optimization techniques) • Proﬁling code to make it run faster, use less memory etc • Benchmarking code @hveldstra

What this talk is about @hveldstra

What this talk is about • Understanding where performance testing
ﬁts into delivery process @hveldstra

ﬁts into delivery process • The people/roles involved @hveldstra

ﬁts into delivery process • The people/roles involved • Understanding the various pieces that make up a performance testing strategy @hveldstra

ﬁts into delivery process • The people/roles involved • Understanding the various pieces that make up a performance testing strategy • Understanding of what an eﬀective performance testing approach may look like @hveldstra

Background @hveldstra

@hveldstra

Most importantly Performance is an key requirement @hveldstra

Ecommerce @hveldstra

IOT @hveldstra

Events / ticketing @hveldstra

• Chat • Games • Web APIs • Payments APIs
@hveldstra

How do we meet our performance goals? @hveldstra

Questions, questions @hveldstra

Questions, questions Where does performance testing ﬁt into the development,
testing, and delivery process? @hveldstra

Questions, questions How do we run performance tests in our
CI/CD pipeline? @hveldstra

Questions, questions How do we deﬁne performance goals & SLOs*
for our services? Can we have those checked automatically in CI? @hveldstra

SLOs Service Level Objectives, e.g.: • Support up to 2000
TPS • 99% of all requests should be served in under 200ms • No more than 0.1% of requests should return a 5×x response https://landing.google.com/sre/sre-book/chapters/service-level-objectives/ For a detailed discussion, see Google SRE book: @hveldstra

Questions Questions What types of performance tests are there? Which
ones do we need for our services? @hveldstra

Questions Questions How do we organize our test suites? What
are the best practices around structuring those tests? @hveldstra

Questions Questions How do we pick which services to test?
Do we need to test all of our microservices? What about testing diﬀerent environments and conﬁgurations easily? @hveldstra

Questions Questions How do we encourage collaboration on performance testing
and involve everyone: developers, testers, SREs and product managers? (performance is everyone’s responsibility!) @hveldstra

Questions Questions What’s the best way to get started with
all of this? @hveldstra

Maybe some answers @hveldstra

Maybe some answers Understanding the place of performance testing in
our delivery process @hveldstra

Maybe some answers How to go from zero to a
performance testing suite that’s: @hveldstra

performance testing suite that’s: • Extensible & maintainable @hveldstra

performance testing suite that’s: • Extensible & maintainable • Integrates into your CI/CD pipelines @hveldstra

performance testing suite that’s: • Extensible & maintainable • Integrates into your CI/CD pipelines • Helps verify SLOs automatically @hveldstra

performance testing suite that’s: • Extensible & maintainable • Integrates into your CI/CD pipelines • Helps verify SLOs automatically • Works for developers, testers and SREs @hveldstra

performance testing suite that’s: • Extensible & maintainable • Integrates into your CI/CD pipelines • Helps verify SLOs automatically • Works for developers, testers and SREs • Integrates with monitoring & reporting tools @hveldstra

Maybe some answers @hveldstra • Based on projects & teams
I’ve seen and questions typically raised • YMVV

Deﬁnitions @hveldstra

Deﬁnitions • Testing can suﬀer from lack of precision in
terminology @hveldstra

Definitions • Testing can suffer from lack of precision in
terminology • What is “testing”? Several different dimensions to consider @hveldstra

Type of thing under test @hveldstra

When a test takes place https://medium.com/@copyconstruct/testing-microservices-the-sane- way-9bb31d158c16 @hveldstra

https://medium.com/@copyconstruct/testing-in-production-the-safe-way-18ca102d0ef1 @hveldstra

What the test is trying to prevent @hveldstra

What is a test? @hveldstra

What is a test? An activity, automated or manual, which:
@hveldstra

1. Increases your conﬁdence in some Thing @hveldstra

1. Increases your conﬁdence in some Thing 2. Conﬁrms that your understanding of a Thing or its behavior is still correct @hveldstra

1. Increases your conﬁdence in some Thing 2. Conﬁrms that your understanding of a Thing or its behavior is still correct 3. Increases your understanding of a Thing or its properties @hveldstra

Examples @hveldstra

Increase confidence • A/B tests • Canarying • Traffic replay
• Load test to add traffic above base level @hveldstra

Confirm understanding • Unit tests - known good/bad input, known
good/bad output • Contract-based / property-based testing • Load test on a known configuration that verifies some metrics afterwards (max response time <500ms) @hveldstra

Increase understanding • Sprint 100m and take a heart rate
reading • Try opening 10k concurrent connections and see what happens • Exploratory testing of all kinds • Chaos testing @hveldstra

Performance testing? any test that tests some performance-related property of
a Thing. Often used interchangeably with “load testing”. @hveldstra

Load testing @hveldstra

@hveldstra

Geoﬁltering • Give an IP address + country code •
Returns yes/no @hveldstra

Auth Token Service • Give username/password • Returns signed JWT
token @hveldstra

Or a combined API • Authenticate and get a token
• Use the token to get geoﬁltering info @hveldstra

What types of performance tests are there? @hveldstra

What types of performance tests are there? • Load test
in CI/CD to continuously verify SLOs (service or composite API) — pre-prod @hveldstra

in CI/CD to continuously verify SLOs (service or composite API) — pre-prod • Load testing to help capacity planning — pre-prod, manual @hveldstra

Capacity Planning • Identify amount of resources an instance of
a service needs @hveldstra

a service needs • Identify the number of instances needed to meet a performance target @hveldstra

a service needs • Identify the number of instances needed to meet a performance target • Identify whether current capacity is suﬃcient or some limits will need to be raised pre-prod @hveldstra

in CI/CD to continuously verify SLOs (service or composite API) — pre-prod • Load testing to help capacity planning — pre-prod, manual • Load test a service to better understand its scaling properties and tune conﬁgs — pre-prod, manual @hveldstra

in CI/CD to continuously verify SLOs (service or composite API) — pre-prod • Load testing to help capacity planning — pre-prod, manual • Load test a service to better understand its scaling properties and tune conﬁgs — pre-prod, manual • Stress test a service to ﬁnd its limits & understand how it degrades — pre-prod, manual @hveldstra

in CI/CD to continuously verify SLOs (service or composite API) — pre-prod • Load testing to help capacity planning — pre-prod, manual • Load test a service to better understand its scaling properties and tune configs — pre-prod, manual • Stress test a service to find its limits & understand how it degrades — pre-prod, manual • In prod, manual or automatic — add extra traffic as a safety margin; a form of chaos testing @hveldstra

What types of performance tests are there? • Soak test
- a load test that runs for a longer period of time (1-2 hours)

What types of performance tests are there? • Soak test
- a load test that runs for a longer period of time (1-2 hours) • Spike test - a load test that ramps up load very quickly

Acceptance/functional Testing @hveldstra

Acceptance/functional Testing • Verify that a service (or another unit!)
conforms to its contract @hveldstra

Acceptance/functional Testing • Verify that a service (or another unit!)
conforms to its contract • Verify that the results it produces make sense @hveldstra

Auth service • Produces JWTs and not just empty 2×x
responses • The token contains expected ﬁelds @hveldstra

• It’s possible to re-use the same test code for
both acceptance tests and load tests if your tooling supports it @hveldstra

• It’s possible to re-use the same test code for
both acceptance tests and load tests if your tooling supports it • (Artillery does!) @hveldstra

Smoke tests @hveldstra

Smoke tests • Main characteristic is their speed @hveldstra

Smoke tests • Main characteristic is their speed • A
test that determines whether other tests should even run @hveldstra

test that determines whether other tests should even run • Is anything obviously broken? If we plug this thing in and turn it on, is there any smoke? @hveldstra

test that determines whether other tests should even run • Is anything obviously broken? If we plug this thing in and turn it on, is there any smoke? • Run a quick happy-path test case @hveldstra

Where does performance testing ﬁt into the delivery process? @hveldstra

@hveldstra

Where does Performance testing ﬁt into the delivery process? @hveldstra

Early “write code” stage: help proﬁle code or dependencies The
rest typically is on a deployed service or API @hveldstra

How do we run these in CI/CD? @hveldstra

How do we run these in CI/CD? • Create a
parameterized CI job that can run a load test against a $service in an $environment with a $load_profile and (optionally) verify $slos. @hveldstra

parameterized CI job that can run a load test against a $service in an $environment with a $load_profile and (optionally) verify $slos. • This can act as a stage in other pipelines @hveldstra

parameterized CI job that can run a load test against a $service in an $environment with a $load_profile and (optionally) verify $slos. • This can act as a stage in other pipelines • … or be used manually by a dev/tester for ad-hoc testing via a web UI or a CLI (e.g. on Jenkins or AWS CodeBuild) @hveldstra

How do we run these in CI/CD? • The tests
don’t run on the CI server itself @hveldstra

don’t run on the CI server itself • Best to be able to run them on your own (cloud) infrastructure @hveldstra

don’t run on the CI server itself • Best to be able to run them on your own (cloud) infrastructure • Flexibility when it comes to VPCs or regions @hveldstra

don’t run on the CI server itself • Best to be able to run them on your own (cloud) infrastructure • Flexibility when it comes to VPCs or regions • Critical for testing internal microservices @hveldstra

don’t run on the CI server itself • Best to be able to run them on your own (cloud) infrastructure • Flexibility when it comes to VPCs or regions • Critical for testing internal microservices • More cost-eﬀective too @hveldstra

@hveldstra

How do we run these in CI/CD? ❌ @hveldstra

How do we run these in CI/CD? ✅ ❌ @hveldstra

How do we run these in CI/CD? • What about
test frequency? @hveldstra

test frequency? • No one-size-ﬁts-all approach @hveldstra

test frequency? • No one-size-ﬁts-all approach • Possible to test every change for microservices on the critical path, but probably excessive for most services @hveldstra

test frequency? • No one-size-ﬁts-all approach • Possible to test every change for microservices on the critical path, but probably excessive for most services • Run on a schedule (e.g. nightly) @hveldstra

test frequency? • No one-size-ﬁts-all approach • Possible to test every change for microservices on the critical path, but probably excessive for most services • Run on a schedule (e.g. nightly) • Run before a ﬁnal promotion of a change to prod @hveldstra

Deﬁning SLOs @hveldstra

Deﬁning SLOs • Setting SLOs should be part of your
team’s “creating a new microservice” checklist @hveldstra

team’s “creating a new microservice” checklist • Use & adapt: https://github.com/SkeltonThatcher/run- book-template @hveldstra

@hveldstra

team’s “creating a new microservice” checklist • Documented alongside API specs, design docs (typically a Conﬂuence/wiki page that follows a template) @hveldstra

team’s “creating a new microservice” checklist • Documented alongside API specs, design docs (typically a Conﬂuence/wiki page that follows a template) • Involve other teams that may rely on the service! @hveldstra

team’s “creating a new microservice” checklist • Documented alongside API specs, design docs (typically a Conﬂuence/wiki page that follows a template) • Involve other teams that may rely on the service! • Better to have some SLOs (& revise) than none at all @hveldstra

Which services should we test? @hveldstra

Which services should we test? • What could be tested?
@hveldstra

Anything with an API spec. @hveldstra

Anything with an API spec. • Individual services @hveldstra

Anything with an API spec. • Individual services • Composite APIs (that’s a “unit” with its own properties & behavior) @hveldstra

Anything with an API spec. • Individual services • Composite APIs (that’s a “unit” with its own properties & behavior) • Does a microservice have SLOs? Then it should have a performance test to verify those automatically. @hveldstra

How do we encourage collaboration? @hveldstra

How do we encourage collaboration? • No magical solutions @hveldstra

How do we encourage collaboration? • No magical solutions •
Involve all functions in discussions about performance @hveldstra

Involve all functions in discussions about performance • Remove barriers: @hveldstra

Involve all functions in discussions about performance • Remove barriers: • Use tools that everyone has access to • Use tools that everyone can work with @hveldstra

How do we encourage collaboration? • Monorepos help @hveldstra

How do we encourage collaboration? • Monorepos help • Good
tooling helps @hveldstra

tooling helps • Available to everyone, ideally open source @hveldstra

tooling helps • Available to everyone, ideally open source • Easy to install and get started with @hveldstra

tooling helps • Available to everyone, ideally open source • Easy to install and get started with • Uses a language that everyone is familiar with @hveldstra

tooling helps • Available to everyone, ideally open source • Easy to install and get started with • Uses a language that everyone is familiar with • Make load testing reports & ﬁndings available to everyone (e.g. on Conﬂuence/your wiki or KB) @hveldstra

How do we get started? @hveldstra

Artillery 101 @hveldstra

Artillery 101 • Available on npm: npm install -g artillery
@hveldstra

• The artillery CLI is used to run tests and create HTML reports @hveldstra

• The artillery CLI is used to run tests and create HTML reports • Tests are written in YAML and can be extended with Javascript @hveldstra

Artillery 101 • Supports HTTP, Socketio, WebSocket, Kinesis, HLS out
of the box. @hveldstra

of the box. • Third-party plugins for SQS, Lambda, SQL etc @hveldstra

of the box. • Third-party plugins for SQS, Lambda, SQL etc • Supports plugins. Out of the box: Statsd/Datadog/ Librato integration. @hveldstra

of the box. • Third-party plugins for SQS, Lambda, SQL etc • Supports plugins. Out of the box: Statsd/Datadog/ Librato integration. • Third-party plugins for other monitoring systems @hveldstra

Artillery 101 • Designed to allow for complex, multi-step virtual
user behavior to be scripted @hveldstra

user behavior to be scripted • Support for randomizing requests and capturing data from responses and re-using it in other requests @hveldstra

user behavior to be scripted • Support for randomizing requests and capturing data from responses and re-using it in other requests • Supports assertions on metrics such as HTTP latency — ie automated checking of SLOs @hveldstra

Artillery 101 • Runs well in Docker, easy to run
on ECS or Kubernetes or in a CI/CD pipeline @hveldstra

on ECS or Kubernetes or in a CI/CD pipeline • Can generate self-contained HTML reports with charts and graphs @hveldstra

on ECS or Kubernetes or in a CI/CD pipeline • Can generate self-contained HTML reports with charts and graphs • Features for creating modular test suites @hveldstra

Conﬁg config: target: "" # we don't set a target
by default environments: dev: target: "https://auth-service-dev.acme-corp.internal" defaults: headers: x-api-key: "0xcoffee" local: target: "http://localhost:8080" processor: “./functions.js" plugins: datadog: {} payload: - path: "./username-password.csv" fields: - username - password @hveldstra

One or more scenarios scenarios: - name: Authenticate with valid
credentials flow: - post: url: "/auth" json: username: "{{ username }}" password: "{{ password }}" expect: - statusCode: 200 - contentType: json @hveldstra

$ artillery run \ --config config.yaml \ -e dev \
scenario.yaml @hveldstra

Organizing our test suite @hveldstra

Organizing our test suite • Using a monorepo @hveldstra

Organizing our test suite • Using a monorepo • Easier
to get started with, extend & maintain @hveldstra

to get started with, extend & maintain • Easier to share across teams @hveldstra

to get started with, extend & maintain • Easier to share across teams • Helps code reuse @hveldstra

to get started with, extend & maintain • Easier to share across teams • Helps code reuse • Easier to work with in CI/CD pipelines @hveldstra

Organizing our test suite https://github.com/shoreditch-ops/acme-corp-api-tests @hveldstra

Organizing our test suite acme-corp-api-tests/ - services/ - auth-service/ -
scenarios/ - config.yaml - functions.js - overrides.slo-response-time.json - package.json - common-config.yaml @hveldstra

Why that structure? @hveldstra

Why that structure? • Extensible: @hveldstra

Why that structure? • Extensible: • can add a new
service or API easily, or add a new scenario to an existing one @hveldstra

service or API easily, or add a new scenario to an existing one • allows for service-speciﬁc conﬁg such as environment URLs or data from external CSVs @hveldstra

service or API easily, or add a new scenario to an existing one • allows for service-specific config such as environment URLs or data from external CSVs • allows for service-specific custom code, e.g. to generate random data in a certain format @hveldstra

service or API easily, or add a new scenario to an existing one • allows for service-specific config such as environment URLs or data from external CSVs • allows for service-specific custom code, e.g. to generate random data in a certain format • Can encode service-specific load phases and SLOs @hveldstra

{ "config": { "phases": [ { "duration": 120, "arrivalRate": 10,
"rampTo": 20, "name": "Warm up the service" }, { "duration": 240, "arrivalRate": 20, "rampTo": 100, "name": "Ramp to high load" }, { "duration": 600, "arrivalRate": 100, "name": "Sustained high load" } ], "ensure": { "maxErrorRate": 0.1, "p99": 200 } } } @hveldstra

Running a test artillery run \ —config ./services/auth-service/config.yaml \ --overrides
"$(cat ./services/auth-service/ overrides.slos.json)” \ --e dev ./services/auth-service/login.yaml @hveldstra

Running a test artillery run \ —config ./services/auth-service/config.yaml \ --overrides
"$(cat ./services/auth-service/ overrides.slos.json)” \ --e dev ./services/auth-service/login.yaml Service name, load/SLO override, environment and optionally a scenario → generic CI job @hveldstra

Running a test • Reusable by other jobs / pipeline
stages • Or via the UI for ad hoc testing - e.g. in Jenkins or AWS CodeBuild @hveldstra

Organizing our test suite acme-corp-api-tests/ - services/ - auth-service/ -
scenarios/ - config.yaml - functions.js - overrides.slo-response-time.json - package.json - common-config.yaml @hveldstra

Where to start? @hveldstra

Where to start? • Pick one service @hveldstra

Where to start? • Pick one service • Write tests
using the template & set up a CI job to run them @hveldstra

Where to start? • Pick one service • Write tests
using the template & set up a CI job to run them • Show & tell to the rest of the team @hveldstra

Where to start? • A good candidate service: @hveldstra

Where to start? • A good candidate service: • Small
API surface @hveldstra

API surface • Has experienced performance issues, or @hveldstra

API surface • Has experienced performance issues, or • On the critical path for other components, or @hveldstra

API surface • Has experienced performance issues, or • On the critical path for other components, or • Has high performance requirements @hveldstra

API surface • Has experienced performance issues, or • On the critical path for other components, or • Has high performance requirements • For example: an authentication service @hveldstra

So… we’ve looked at @hveldstra

So… we’ve looked at • What performance testing is, and
diﬀerent types of performance tests @hveldstra

diﬀerent types of performance tests • Where performance testing ﬁts into the development, testing, and delivery process @hveldstra

diﬀerent types of performance tests • Where performance testing ﬁts into the development, testing, and delivery process • Running performance tests in CI/CD pipelines @hveldstra

diﬀerent types of performance tests • Where performance testing ﬁts into the development, testing, and delivery process • Running performance tests in CI/CD pipelines • Setting and verifying SLOs @hveldstra

diﬀerent types of performance tests • Where performance testing ﬁts into the development, testing, and delivery process • Running performance tests in CI/CD pipelines • Setting and verifying SLOs • The mechanics of setting up a test suite with Artillery @hveldstra

email: [email protected] twitter: @hveldstra slides: https://speakerdeck.com/hassy/when-performance-matters- eﬀective-performance-testing-from-the-ground-up Thanks!

Performance Testing From The Ground Up

Performance Testing From The Ground Up

More Decks by hassy veldstra

Other Decks in Programming

Featured

Transcript