Slide 1

Slide 1 text

armakuni.com Site Reliability Engineering Self Isolated Edition Billie Thompson

Slide 2

Slide 2 text

2 How are you keeping it together?

Slide 3

Slide 3 text

armakuni.com Site Reliability Engineering Self Isolated Edition Billie Thompson

Slide 4

Slide 4 text

Introductions 4 ● Billie Thompson ● Developer ● Fav. Animal Capybara PurpleBooth ArmakuniHQ Getting to know you, Getting to know all about you

Slide 5

Slide 5 text

5

Slide 6

Slide 6 text

We’ve worked with these people 6

Slide 7

Slide 7 text

7 I invented a game

Slide 8

Slide 8 text

8

Slide 9

Slide 9 text

9 30 Seconds Fast Careful

Slide 10

Slide 10 text

10 I can go faster, but I’ll make more mistakes

Slide 11

Slide 11 text

11 I can go slower, but I’ll get fewer in

Slide 12

Slide 12 text

12 Middle pace is about the best

Slide 13

Slide 13 text

13 30 Seconds

Slide 14

Slide 14 text

14 1 Sprint

Slide 15

Slide 15 text

15 SRE is an implementation of DevOps

Slide 16

Slide 16 text

16 In a few months SRE will be nothing more than DevOps 2.0

Slide 17

Slide 17 text

17 I want to talk about the bits that will be lost

Slide 18

Slide 18 text

18 Great Powers

Slide 19

Slide 19 text

19 The metric you see is the metric you move

Slide 20

Slide 20 text

20 The North Star Metric is the single metric that best captures the core value that your product delivers to customers.

Slide 21

Slide 21 text

21 Focus on feature delivery

Slide 22

Slide 22 text

22 Focus on feature delivery I delivered 15 story points this week! My teams just don’t have the feature velocity

Slide 23

Slide 23 text

23 Focus on feature delivery Focus on stability I delivered 15 story points this week! My teams just don’t have the feature velocity

Slide 24

Slide 24 text

armakuni.com How do you combat the great power of feature delivery Self Isolated Edition Billie Thompson

Slide 25

Slide 25 text

25 What attributes does a countering force need to have ● Fast feedback cycle ● ●

Slide 26

Slide 26 text

26 What attributes does a countering force need to have ● Fast feedback cycle ● Be achievable ●

Slide 27

Slide 27 text

27 What attributes does a countering force need to have ● Fast feedback cycle ● Be achievable ● Have monetary impact

Slide 28

Slide 28 text

28 Focus on feature delivery Focus on stability I delivered 15 story points this week! My teams just don’t have the feature velocity and we don’t know The Error Budget

Slide 29

Slide 29 text

29 Errors are Inevitable

Slide 30

Slide 30 text

30 100% Uptime is Impossible

Slide 31

Slide 31 text

31 100% Uptime is Impossible and undesirable

Slide 32

Slide 32 text

32 Stability-o-meter Never down Always down

Slide 33

Slide 33 text

Our customers would leave us if we had performance in this region 33 Stability-o-meter Never down Always down

Slide 34

Slide 34 text

If we are here we are taking as much risk as our customers will tolerate 34 Stability-o-meter Never down Always down

Slide 35

Slide 35 text

We need a buffer between what we aim for and what our customers need 35 Stability-o-meter Never down Always down

Slide 36

Slide 36 text

This is our error budget we can spend it how we like 36 Stability-o-meter Never down Always down

Slide 37

Slide 37 text

Maybe we release a risky feature 37 Stability-o-meter Never down Always down

Slide 38

Slide 38 text

Maybe we do that DB migration 38 Stability-o-meter Never down Always down

Slide 39

Slide 39 text

Maybe something goes really wrong and we go over 39 Stability-o-meter Never down Always down

Slide 40

Slide 40 text

40 Stability-o-meter Never down Always down SLO Service Level Objective An internal team target

Slide 41

Slide 41 text

41 Stability-o-meter Never down Always down SLA Service Level Agreement An external promise to customers backed up with financial penalties

Slide 42

Slide 42 text

42 Stability-o-meter Never down Always down SLA An external promise to customers backed up with financial penalties The units of this chart are the SLI (Service Level Indicator)

Slide 43

Slide 43 text

https://cre.page.link/art-of-slos-slides Request / Response Availability Latency Quality Data Processing Coverage Correctness Freshness Throughput Storage Throughput Latency SLI Menu

Slide 44

Slide 44 text

44 A running organisation is a complex system in balance

Slide 45

Slide 45 text

45 What are the balancing forces in your organisation?

Slide 46

Slide 46 text

46 Are they explicit?

Slide 47

Slide 47 text

armakuni.com Thank you!

Slide 48

Slide 48 text

48 Questions?! Now or at the end More More More! I Have Questions?