Slide 1

Slide 1 text

Leveling Up Your Monitoring 12/12/12 Triangle DevOps - Ben Scofield - @bscofield

Slide 2

Slide 2 text

No content

Slide 3

Slide 3 text

0 Customer alerts

Slide 4

Slide 4 text

0

Slide 5

Slide 5 text

shouldn’t we do ... something?

Slide 6

Slide 6 text

LEVEL UP

Slide 7

Slide 7 text

1 Site monitoring

Slide 8

Slide 8 text

1

Slide 9

Slide 9 text

what about user flows?

Slide 10

Slide 10 text

+XP

Slide 11

Slide 11 text

what about all of our edge cases?

Slide 12

Slide 12 text

LEVEL UP

Slide 13

Slide 13 text

2 Exception alerts

Slide 14

Slide 14 text

2

Slide 15

Slide 15 text

... my mailbox is full.

Slide 16

Slide 16 text

+XP

Slide 17

Slide 17 text

what if the database blows up?

Slide 18

Slide 18 text

LEVEL UP

Slide 19

Slide 19 text

3 System alerts

Slide 20

Slide 20 text

3

Slide 21

Slide 21 text

how do these incidents affect the business?

Slide 22

Slide 22 text

LEVEL UP

Slide 23

Slide 23 text

4 Business metrics

Slide 24

Slide 24 text

4

Slide 25

Slide 25 text

let’s measure all the things!

Slide 26

Slide 26 text

+XP

Slide 27

Slide 27 text

are we measuring too many of the things?

Slide 28

Slide 28 text

+XP

Slide 29

Slide 29 text

do we need to watch all of the things all of the time?

Slide 30

Slide 30 text

LEVEL UP

Slide 31

Slide 31 text

5 Anomaly alerts

Slide 32

Slide 32 text

5

Slide 33

Slide 33 text

welcome to the company! you’re on call

Slide 34

Slide 34 text

LEVEL UP

Slide 35

Slide 35 text

6 Actionable alerts

Slide 36

Slide 36 text

couldn’t a monkey do this?

Slide 37

Slide 37 text

LEVEL UP

Slide 38

Slide 38 text

7 Automation

Slide 39

Slide 39 text

it doesn’t stop

Slide 40

Slide 40 text

Thanks! 12/12/12 Triangle DevOps - Ben Scofield - @bscofield