@gbtekkie
DevOpsDays
Zürich
3
advantages and disadvantages
silos: a possible approach
the need for high availability
what is high availability (HA)?
AGENDA
@gbtekkie
DevOpsDays
Zürich
17
Ability to access the system:
✤ retrieve information
✤ alter information
✤ send new data
AVAILABILITY
Slide 18
Slide 18 text
https:/
/flic.kr/p/dkasBz
Slide 19
Slide 19 text
@gbtekkie
DevOpsDays
Zürich
19
THE 9s DANCE
Uptime Downtime
(per year)
90.000 % 36.50 days one nine
99.000 % 3.65 days two nines
99.900 % 8.76 hrs three nines
99.950 % 4 hrs 23 mins
99.990 % 52.56 mins four nines
99.999 % 5.26 mins five nines
Slide 20
Slide 20 text
@gbtekkie
DevOpsDays
Zürich
20
THE 9s DANCE
Uptime Downtime
(per year)
90.000 % 36.50 days
99.000 % 3.65 days
99.900 % 8.76 hrs
99.950 % 4 hrs 23 mins Amazon SLA
99.990 % 52.56 mins four nines
99.999 % 5.26 mins five nines
@gbtekkie
DevOpsDays
Zürich
DETECT FAILURES AS THEY OCCUR
Slide 31
Slide 31 text
@gbtekkie
DevOpsDays
Zürich
31
HA BEST PRACTICES
1. no single points of failure
2. stateless application design
3. automate infrastructure for consistency &
reliability
4. clever monitoring and alerting
5. geographically distribute your machines
6. keep spare capacity to meet increasing
demand
Slide 32
Slide 32 text
32
A man’s got to
know his
limitations.
- Dirty Harry
Slide 33
Slide 33 text
SILOS
Slide 34
Slide 34 text
@gbtekkie
DevOpsDays
Zürich
34
TRY UPGRADE TO PHP7
Slide 35
Slide 35 text
@gbtekkie
DevOpsDays
Zürich
35
WHAT IS A SILO?
✤ frontend (SPAs, PWAs, etc)
✤ backend (e.g. PHP services)
✤ data (including cache)
1 silo = full setup of servers that deliver
the end-to-end functionality
@gbtekkie
DevOpsDays
Zürich
40
GEOGRAPHICAL DISTRIBUTION
Slide 41
Slide 41 text
@gbtekkie
DevOpsDays
Zürich
41
LIVE UPGRADES
Slide 42
Slide 42 text
@gbtekkie
DevOpsDays
Zürich
42
ADVANTAGES
✤ reuse familiar technology
✤ real A/B testing
✤ no BHUF requirements
✤ no disruption => brand loyalty
✤ lower Total Cost of Ownership
✤ simplify scalability
Slide 43
Slide 43 text
@gbtekkie
DevOpsDays
Zürich
43
DISADVANTAGES
✤ needs razor-sharp DevOps team
✤ small increase in hardware costs on kick-off
✤ adds complexity to the monitoring layer
✤ reconsider traceability
✤ different bug reproducing and hunting
@gbtekkie
DevOpsDays
Zürich
46
FURTHER READING
✤ Wikipedia HA page
✤ OpenStack’s HA concepts
✤ Merge Hemo report from FDA
✤ USA Presidential Policy Directive 21
✤ “Beyond Legacy Code” book
✤ TechCrunch’s summary of sites affected by Michael Jackson’s death
✤ Netflix lessons learned after AWS outage
✤ Netflix Chaos Monkey source code
✤ Brian Adler’s talk on “Architecting for High Availability and Multi-Cloud”