Upgrade to Pro — share decks privately, control downloads, hide ads and more …

The Testing – Monitoring Continuum (DevOps Sydn...

Rob Howard
January 15, 2015

The Testing – Monitoring Continuum (DevOps Sydney, 2015)

"Testing" and "Monitoring" are often seen as completely separate things. Let's try mixing it up.

(Lightning talk.)

Rob Howard

January 15, 2015
Tweet

More Decks by Rob Howard

Other Decks in Technology

Transcript

  1. Dev Who writes unit tests? Integration tests, eg. using browser-driving

    full-stack tools like Selenium, Capybara, etc?
  2. You're developing a product; an app or something. Let's say

    we have a bunch of machines running that app.
  3. We're monitoring. What do we do? Well, first we should

    probably make sure that the servers are actually up. Easy!
  4. Is it accepting connections with the right username + password

    for the app? Maybe we stuff up a config rollout.
  5. Okay, but does it have the PG extensions the app

    needs, eg. for UUID generation?
  6. Do we have to do this for every service or

    node that we're running? Where do we stop?
  7. Run the App. Well, maybe the best way of doing

    this is running the app itself. We could write a bash+curl script that, like, tests just logging in.
  8. Run the App's Tests. But is that testing everything the

    app needs to use? Maybe it'll break on the next click. Why not go the whole hog? Our app has an integration test suite (or should have). We spent a lot of money on it!
  9. Story Time Let's say we have a multi-tenant, hosted, Software-as-a-Service

    app that users buy instances/accounts for. VM Hosting, Chat, whatever.
  10. Local Dev. Env But also those big browser-driven tests as

    well. The test runner is still local, against a local copy of your app.
  11. Local Dev. Env Production Staging Why don't we: * Spin

    up a new account on staging. * Run the integration tests against that new account. * Throw away the account afterwards.
  12. Local Dev. Env Production Staging It could be a custom

    app kicking off these test runs, but it could easily be Jenkins.
  13. Local Dev. Env Production Staging Do the same for production!

    Have these tests run over and over again. Chew up some of your production capacity, but have greater surety that your app works when placed into the staging and production environments you've configured and rolled out.
  14. Local Dev. Env Production Staging We're testing the
 app+infrastructure interface.

    We're testing that the, say, file upload feature on your chat app actually works with the infrastructure it's relying on.
  15. Local Dev. Env Production Staging It's not super-easy or perfect,

    and testing interactions with external systems (particularly payment ones) is hard, and might just involve turning off parts of your test and instrumenting detection of errors instead.
  16. Local Dev. Env Production Staging And finally, to be clear,

    this isn't replacing your environment tests (eg. available disk/RAM/CPU) or error-rate instrumentation; this is to alleviate the need for a ton of individual fine-grained service checks that would be better tested by an app being hit by your existing test suite.
  17. Testing Monitoring Back to the title. Instead of Testing and

    Monitoring as separate, discrete things, I'd argue that…
  18. I work at OrionVM and we're hiring; we're building cloud

    hosting (physical) infrastructure, and we're after an Ops person (networking+routing, physical server wrangling, configuration management) and a Ruby/JS dev (UI) to help out.