armakuni.com
Site Reliability Engineering
Self Isolated Edition
Billie Thompson
Slide 2
Slide 2 text
2
How are you keeping
it together?
Slide 3
Slide 3 text
armakuni.com
Site Reliability Engineering
Self Isolated Edition
Billie Thompson
Slide 4
Slide 4 text
Introductions
4
● Billie Thompson
● Developer
● Fav. Animal Capybara
PurpleBooth
ArmakuniHQ
Getting to know you, Getting to know all about you
Slide 5
Slide 5 text
5
Slide 6
Slide 6 text
We’ve worked with these people
6
Slide 7
Slide 7 text
7
I invented a game
Slide 8
Slide 8 text
8
Slide 9
Slide 9 text
9
30
Seconds
Fast
Careful
Slide 10
Slide 10 text
10
I can go faster,
but I’ll make more mistakes
Slide 11
Slide 11 text
11
I can go slower,
but I’ll get fewer in
Slide 12
Slide 12 text
12
Middle pace is about the best
Slide 13
Slide 13 text
13
30
Seconds
Slide 14
Slide 14 text
14
1 Sprint
Slide 15
Slide 15 text
15
SRE is an
implementation
of DevOps
Slide 16
Slide 16 text
16
In a few months
SRE will be nothing
more than DevOps 2.0
Slide 17
Slide 17 text
17
I want to talk about
the bits that will be lost
Slide 18
Slide 18 text
18
Great Powers
Slide 19
Slide 19 text
19
The metric you see
is the metric you move
Slide 20
Slide 20 text
20
The North Star Metric is the single
metric that best captures the core
value that your product delivers to
customers.
Slide 21
Slide 21 text
21
Focus on feature delivery
Slide 22
Slide 22 text
22
Focus on feature delivery
I delivered 15 story
points this week!
My teams just don’t have
the feature velocity
Slide 23
Slide 23 text
23
Focus on feature delivery
Focus on stability
I delivered 15 story points
this week!
My teams just don’t have the
feature velocity
Slide 24
Slide 24 text
armakuni.com
How do you combat the great
power of feature delivery
Self Isolated Edition
Billie Thompson
Slide 25
Slide 25 text
25
What attributes does
a countering force
need to have
● Fast feedback cycle
●
●
Slide 26
Slide 26 text
26
What attributes does
a countering force
need to have
● Fast feedback cycle
● Be achievable
●
Slide 27
Slide 27 text
27
What attributes does
a countering force
need to have
● Fast feedback cycle
● Be achievable
● Have monetary impact
Slide 28
Slide 28 text
28
Focus on feature delivery
Focus on stability
I delivered 15 story points
this week!
My teams just don’t have the
feature velocity and we don’t
know
The Error Budget
Slide 29
Slide 29 text
29
Errors are Inevitable
Slide 30
Slide 30 text
30
100% Uptime is Impossible
Slide 31
Slide 31 text
31
100% Uptime is Impossible
and undesirable
Slide 32
Slide 32 text
32
Stability-o-meter
Never down
Always down
Slide 33
Slide 33 text
Our customers would leave
us if we had performance in
this region
33
Stability-o-meter
Never down
Always down
Slide 34
Slide 34 text
If we are here we are taking
as much risk as our
customers will tolerate
34
Stability-o-meter
Never down
Always down
Slide 35
Slide 35 text
We need a buffer between
what we aim for and what
our customers need
35
Stability-o-meter
Never down
Always down
Slide 36
Slide 36 text
This is our error budget we
can spend it how we like
36
Stability-o-meter
Never down
Always down
Slide 37
Slide 37 text
Maybe we release a risky
feature
37
Stability-o-meter
Never down
Always down
Slide 38
Slide 38 text
Maybe we do that DB
migration
38
Stability-o-meter
Never down
Always down
Slide 39
Slide 39 text
Maybe something goes really
wrong and we go over
39
Stability-o-meter
Never down
Always down
Slide 40
Slide 40 text
40
Stability-o-meter
Never down
Always down
SLO
Service Level Objective
An internal team target
Slide 41
Slide 41 text
41
Stability-o-meter
Never down
Always down
SLA
Service Level Agreement
An external promise to
customers backed up with
financial penalties
Slide 42
Slide 42 text
42
Stability-o-meter
Never down
Always down
SLA
An external promise to
customers backed up with
financial penalties
The units of this chart
are the SLI
(Service Level Indicator)
Slide 43
Slide 43 text
https://cre.page.link/art-of-slos-slides
Request / Response Availability
Latency
Quality
Data Processing Coverage
Correctness
Freshness
Throughput
Storage Throughput
Latency
SLI Menu
Slide 44
Slide 44 text
44
A running organisation
is a complex system in balance
Slide 45
Slide 45 text
45
What are the balancing
forces in your organisation?
Slide 46
Slide 46 text
46
Are they explicit?
Slide 47
Slide 47 text
armakuni.com
Thank you!
Slide 48
Slide 48 text
48
Questions?!
Now or at the end
More More More!
I Have Questions?