[SCaLE16x] Silo-Based Architectures for High Availability Applications

FOR HIGH AVAILABILITY APPLICATIONS SILO-BASED ARCHITECTURES Georgiana Gligor / Tekkie
Consulting / @gbtekkie

@gbtekkie SCaLE 16X 2 ✤ Geek. Mother. Do-er. ✤ on
LAMP/LEMP stack since 2003 ✤ Architecture / DevOps consultant ✤ RomaniaPHP Organizer ✤ PhD Student @gbtekkie [email protected] GEORGIANA GLIGOR

@gbtekkie SCaLE 16X 3 advantages and disadvantages silos: a possible
approach the need for high availability what is high availability (HA)? AGENDA

@gbtekkie SCaLE 16X 5 https://youtu.be/MQm5BnhTBEQ

6 Software industry is built around anticipating change.

7 anticipate accommodate vs

TYPICAL APPLICATION

@gbtekkie SCaLE 16X 9

@gbtekkie SCaLE 16X master Frontend Business Logic Frontend Frontend Browser
internet Load balancer slave reads writes 11 ADJUSTING

internet Load balancer slave reads writes 12 ADJUSTING redundancy

internet Load balancer slave reads writes 13 ADJUSTING resilience

@gbtekkie SCaLE 16X 14 TYPICAL LAYERING

@gbtekkie SCaLE 16X 15 APPLICATION ARCHITECTURE

HIGH AVAILABILITY

@gbtekkie SCaLE 16X 17 Ability to access the system: ✤
retrieve information ✤ alter information ✤ send new data AVAILABILITY

https:/ /flic.kr/p/dkasBz

@gbtekkie SCaLE 16X 19 THE 9s DANCE Uptime Downtime (per
year) 90.000 % 36.50 days one nine 99.000 % 3.65 days two nines 99.900 % 8.76 hrs three nines 99.950 % 4 hrs 23 mins 99.990 % 52.56 mins four nines 99.999 % 5.26 mins ﬁve nines

@gbtekkie SCaLE 16X 20 THE 9s DANCE Uptime Downtime (per
year) 90.000 % 36.50 days 99.000 % 3.65 days 99.900 % 8.76 hrs 99.950 % 4 hrs 23 mins Amazon SLA 99.990 % 52.56 mins four nines 99.999 % 5.26 mins ﬁve nines

@gbtekkie SCaLE 16X 21 IMPACT $ 144,000 / hour 3600
$ 40 / sec * =

@gbtekkie SCaLE 16X 22 USER BEHAVIOUR amazon facebook youtube Alexa
Rank 6 3 2 daily time on site 12:07 mins 19:27 mins 23:44 mins daily pageviews / visitor 11.83 9.38 12.84 bounce rate 21 % 29 % 33 %

@gbtekkie SCaLE 16X 23 HIGH AVAILABILITY TRIANGLE cost complexity risk

@gbtekkie SCaLE 16X 24 DOWNTIME scheduled ‣ you unscheduled ‣
you ‣ others

@gbtekkie SCaLE 16X 25 HAPPENS TO THE BEST

@gbtekkie SCaLE 16X 26 MICHAEL JACKSON

H.A. SYSTEM CHARACTERISTICS

https://flic.kr/p/quMmFw NO SINGLE POINT OF FAILURE

https://flic.kr/p/RLKw8z RELIABLE CROSSOVER

DETECT FAILURES AS THEY OCCUR

@gbtekkie SCaLE 16X 31 HA BEST PRACTICES 1. no single
points of failure 2. stateless application design 3. automate infrastructure for consistency & reliability 4. clever monitoring and alerting 5. geographically distribute your machines 6. keep spare capacity to meet increasing demand

32 A man’s got to know his limitations. - Dirty
Harry

@gbtekkie SCaLE 16X 34 TRY UPGRADE TO PHP7

@gbtekkie SCaLE 16X 35 WHAT IS A SILO? ✤ frontend
(SPAs, PWAs, etc) ✤ backend (e.g. PHP services) ✤ data (including cache) 1 silo = full setup of servers that deliver the end-to-end functionality

@gbtekkie SCaLE 16X 36 WHAT IS A SILO?

@gbtekkie SCaLE 16X 37 SILO-BASED ARCHITECTURE

@gbtekkie SCaLE 16X 38 MULTIPLE CACHES

@gbtekkie SCaLE 16X 39 A/B TESTING

@gbtekkie SCaLE 16X 40 GEOGRAPHICAL DISTRIBUTION

@gbtekkie SCaLE 16X 41 LIVE UPGRADES

@gbtekkie SCaLE 16X 42 ADVANTAGES ✤ reuse familiar technology ✤
real A/B testing ✤ no BHUF requirements ✤ no disruption => brand loyalty ✤ lower Total Cost of Ownership ✤ simplify scalability

@gbtekkie SCaLE 16X 43 DISADVANTAGES ✤ needs razor-sharp DevOps team
✤ small increase in hardware costs on kick-off ✤ adds complexity to the monitoring layer ✤ reconsider traceability ✤ different bug reproducing and hunting

@gbtekkie SCaLE 16X 44 TAKEAWAYS

@gbtekkie SCaLE 16X 45 ✤ build situational awareness with clever
monitoring ✤ automate outage detection ✤ powerful A/B testing TAKEAWAYS

@gbtekkie SCaLE 16X 46 FURTHER READING ✤ Wikipedia HA page
✤ OpenStack’s HA concepts ✤ Merge Hemo report from FDA ✤ USA Presidential Policy Directive 21 ✤ “Beyond Legacy Code” book ✤ TechCrunch’s summary of sites affected by Michael Jackson’s death ✤ Netflix lessons learned after AWS outage ✤ Netflix Chaos Monkey source code ✤ Brian Adler’s talk on “Architecting for High Availability and Multi-Cloud”

‹#› Questions? } Efficient architecture. Performance oriented. AI enhanced. [email protected]

[SCaLE16x] Silo-Based Architectures for High Av...

[SCaLE16x] Silo-Based Architectures for High Availability Applications

More Decks by Georgiana Gligor

Other Decks in Technology

Featured

Transcript