The Testing – Monitoring Continuum (DevOps Sydney, 2015)

The Testing↔Monitoring Continuum

Question Time

Ops Who here uses Nagios? Monit+MMonit? ... Serverspec?

Dev Who writes unit tests? Integration tests, eg. using browser-driving
full-stack tools like Selenium, Capybara, etc?

What is Monitoring?

You're developing a product; an app or something. Let's say
we have a bunch of machines running that app.

Load Balancers

Load Balancers App Servers

Load Balancers App Servers Data Store

We're monitoring. What do we do? Well, first we should
probably make sure that the servers are actually up. Easy!

Well, what about more specific things. Is PostgreSQL running on
the database? Can we see its PID?

Is Postgres accepting connections?

Is it accepting connections with the right username + password
for the app? Maybe we stuff up a config rollout.

Okay, but does it have the PG extensions the app
needs, eg. for UUID generation?

Is the app's database named correctly?

Can the app see the tables it needs in the
database?

Can it write to those tables? Maybe we screwed up
the permissions.

THIS IS GETTING A BIT MUCH.

Do we have to do this for every service or
node that we're running? Where do we stop?

Run the App. Well, maybe the best way of doing
this is running the app itself. We could write a bash+curl script that, like, tests just logging in.

Run the App's Tests. But is that testing everything the
app needs to use? Maybe it'll break on the next click. Why not go the whole hog? Our app has an integration test suite (or should have). We spent a lot of money on it!

Story Time Let's say we have a multi-tenant, hosted, Software-as-a-Service
app that users buy instances/accounts for. VM Hosting, Chat, whatever.

Local Dev. Env We'd have unit tests that you run
on your local box.

Local Dev. Env But also those big browser-driven tests as
well. The test runner is still local, against a local copy of your app.

Local Dev. Env Production Staging We have staging and production
environments too.

Local Dev. Env Production Staging Why don't we: * Spin
up a new account on staging. * Run the integration tests against that new account. * Throw away the account afterwards.

Local Dev. Env Production Staging It could be a custom
app kicking off these test runs, but it could easily be Jenkins.

Local Dev. Env Production Staging Do the same for production!
Have these tests run over and over again. Chew up some of your production capacity, but have greater surety that your app works when placed into the staging and production environments you've configured and rolled out.

Local Dev. Env Production Staging We're testing the  app+infrastructure interface.
We're testing that the, say, file upload feature on your chat app actually works with the infrastructure it's relying on.

Local Dev. Env Production Staging It's not super-easy or perfect,
and testing interactions with external systems (particularly payment ones) is hard, and might just involve turning off parts of your test and instrumenting detection of errors instead.

Local Dev. Env Production Staging And finally, to be clear,
this isn't replacing your environment tests (eg. available disk/RAM/CPU) or error-rate instrumentation; this is to alleviate the need for a ton of individual fine-grained service checks that would be better tested by an app being hit by your existing test suite.

Testing Monitoring Back to the title. Instead of Testing and
Monitoring as separate, discrete things, I'd argue that…

Testing Testing  +  Monitoring … Testing is a part of
Good Monitoring.

Fin.   Rob Howard  @damncabbage https://speakerdeck.com/damncabbage/ Thanks! One final thing…

I work at OrionVM and we're hiring; we're building cloud
hosting (physical) infrastructure, and we're after an Ops person (networking+routing, physical server wrangling, configuration management) and a Ruby/JS dev (UI) to help out.

Fin.   Rob Howard  @damncabbage https://speakerdeck.com/damncabbage/

The Testing – Monitoring Continuum (DevOps Sydn...

The Testing – Monitoring Continuum (DevOps Sydney, 2015)

More Decks by Rob Howard

Other Decks in Technology

Featured

Transcript