Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Building a Minimum Viable Incident Response Plan

Building a Minimum Viable Incident Response Plan

A minimum viable product (MVP) allows you to obtain rapid feedback, and implement continuous and iterative improvements. When you put an MVP into production, however, you need to take the next step and make sure that you can respond effectively when problems inevitably arise. You need a minimum viable response plan. One that allows you to keep your MVP operational without burdening your team.

In this talk, we'll discuss the full lifecycle of an incident, what it means to be robust versus resilient when building a response plan. And we'll help you determine who is on-call when, and for which kinds of problems. We'll also talk about notifications, escalations, and how to enable learning from each problem your service encounters. This will give you the basis of a minimum viable response plan, so you and your team have a baseline to start and continuously improve up.

Jason Hand

April 09, 2019
Tweet

More Decks by Jason Hand

Other Decks in Technology

Transcript

  1.  Computers do what they're told. If they're told to

    do the wrong thing, they're going to do it and they're going to do it really, really well. - Lawrence Pingree - Gartner 12/71
  2. COMP LI CA TE D Sense-Analyze-Respond (Good Practice) SI MP

    LE Sense-Categorize- Respond (Best Practice) CH A OTI C Act-Sense-Respond (Novel) COMP LE X Probe-Sense-Respond (Emergent) Disorder 19/71
  3.  Incidents aren't deviations from some idyllic norm: they are

    the norm. - Rob England - itskeptic Part of the job 25/71
  4. Detection First phase of an incident Tooling has identified a

    problem Notification has been triggered 37/71
  5. Analysis Retrospective discussion of the timline Opportunity for actionable improvement

    Deeper learning on "How the system actually works" 40/71
  6. Create On-Call  - Roles & Rotations (Primary, Secondary, IC)

     - Contextual, Actionable Alerting 63/71
  7. Create On-Call  - Roles & Rotations (Primary, Secondary, IC)

     - Contextual, Actionable Alerting  - Escalation Paths 63/71