Slide 1

Slide 1 text

Human Factors & PostMortems Daniel Schauenberg [email protected] @mrtazz

Slide 2

Slide 2 text

No content

Slide 3

Slide 3 text

We deploy quite a lot

Slide 4

Slide 4 text

MTTR

Slide 5

Slide 5 text

MTTR > MTBF

Slide 6

Slide 6 text

No content

Slide 7

Slide 7 text

No content

Slide 8

Slide 8 text

No content

Slide 9

Slide 9 text

No content

Slide 10

Slide 10 text

No content

Slide 11

Slide 11 text

No content

Slide 12

Slide 12 text

realtalk: things break

Slide 13

Slide 13 text

New View

Slide 14

Slide 14 text

Complex Socio- Technical Systems

Slide 15

Slide 15 text

Erkenntnis und Irrtum fließen aus denselben psychischen Quellen; nur der Erfolg vermag beide zu scheiden. — Ernst Mach, Erkenntnis und Irrtum (p. 116)

Slide 16

Slide 16 text

Things made sense at the time

Slide 17

Slide 17 text

People don't come to work to do a bad job

Slide 18

Slide 18 text

Nietzschean Anxiety

Slide 19

Slide 19 text

So I always get off the hook whatever I do?

Slide 20

Slide 20 text

There is a difference between explaining and excusing human performance. — Sidney Dekker, The Field Guide to Understanding Human Error (p. 196)

Slide 21

Slide 21 text

Blameless Postmortems

Slide 22

Slide 22 text

Open Meeting

Slide 23

Slide 23 text

Everybody is Invited

Slide 24

Slide 24 text

What happened?

Slide 25

Slide 25 text

Timeline

Slide 26

Slide 26 text

Describe the past Don't excuse it away

Slide 27

Slide 27 text

The Facilitator

Slide 28

Slide 28 text

Guide the Discussion

Slide 29

Slide 29 text

Look out for indicators of Old View thinking

Slide 30

Slide 30 text

Counterfactuals

Slide 31

Slide 31 text

- she should have - if he would have - if they just had - you failed to

Slide 32

Slide 32 text

Biases

Slide 33

Slide 33 text

Hindsight Bias Confirmation Bias Outcome Bias

Slide 34

Slide 34 text

there are many more

Slide 35

Slide 35 text

Who is in charge?

Slide 36

Slide 36 text

Etsy School

Slide 37

Slide 37 text

Taught Facilitator Course

Slide 38

Slide 38 text

3 x 90 minutes

Slide 39

Slide 39 text

Remediation Items

Slide 40

Slide 40 text

incorporate learning and takeaway from the meeting

Slide 41

Slide 41 text

No content

Slide 42

Slide 42 text

turn surprises into known factors

Slide 43

Slide 43 text

MORGUE

Slide 44

Slide 44 text

No content

Slide 45

Slide 45 text

No content

Slide 46

Slide 46 text

No content

Slide 47

Slide 47 text

No content

Slide 48

Slide 48 text

No content

Slide 49

Slide 49 text

No content

Slide 50

Slide 50 text

No content

Slide 51

Slide 51 text

https://github.com/ etsy/morgue

Slide 52

Slide 52 text

Near Miss

Slide 53

Slide 53 text

Pre Mortem

Slide 54

Slide 54 text

"Hey all, I just ran rm -rf $DIR/ and since the variable was empty I deleted my whole VM. This would have been bad in production. Don't do that."

Slide 55

Slide 55 text

Architecture Reviews

Slide 56

Slide 56 text

Operability Reviews

Slide 57

Slide 57 text

It is also worth pointing out that the bias towards investigating failures rather than success itself represents a trade-off. — Erik Hollnagel, The ETTO Principle: Efficiency- Thoroughness Trade-Off

Slide 58

Slide 58 text

Investigate Success

Slide 59

Slide 59 text

Why did it work?

Slide 60

Slide 60 text

Human Error is where you stopped looking

Slide 61

Slide 61 text

peakscale.com/ postmortems

Slide 62

Slide 62 text

codeascraft.com etsy.com/codeascraft/talks etsy.com/careers

Slide 63

Slide 63 text

Questions?

Slide 64

Slide 64 text

Human Factors & PostMortems Daniel Schauenberg [email protected] @mrtazz