Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Chaos Engineering in 5 Minutes

Chaos Engineering in 5 Minutes

A very short introduction to Chaos Engineering. Presented in March 2018 at HolidayCheck's Lightning Talks.

Pavlos Ratis

March 29, 2018
Tweet

More Decks by Pavlos Ratis

Other Decks in Technology

Transcript

  1. – Edsger W. Dijkstra “Testing shows the presence, not the

    absence of bugs” CC BY-SA 3.0 - https://en.wikiquote.org/wiki/Edsger_W._Dijkstra#Quotes_about_Dijkstra
  2. BUT

  3. Resiliency (noun) - re·sil·ien·cy • The ability to become strong,

    healthy, or successful again after something bad happens •The ability of something to return to its original shape after it has been pulled, stretched, pressed, bent, etc.
  4. – Hsueh, M.C., Tsai, T.K. and Iyer, R.K., 1997. Fault

    injection techniques and tools. Computer, 30(4), pp.75-82. “Fault injection is important to evaluating the dependability of computer systems.” Fault Injection Public Domain - https://en.wikipedia.org/wiki/NASA
  5. – Principles of Chaos Engineering (http://principlesofchaos.org/) “Chaos Engineering is the

    discipline of experimenting on a distributed system
 in order to build confidence in the system’s capability
 to withstand turbulent conditions in production.”
  6. Chaos as in… • Killing random cloud VM Instances or

    containers • Killing random Kubernetes pods • Killing MySQL Master or Slaves • Introducing extra latency or packet loss between micro- services • Killing a critical supporting service (i.e logging server) while serving traffic • “Unplugging” a whole datacenter or availability zone
  7. Process • Define a steady state hypothesis (i.e HTTP 2xx

    & 3xx) • Run experiments in production environment (i.e. Destroy a critical service) * • Try to disprove the hypothesis * Minimize the Blast Radius
  8. Summary • Uncovers the weaknesses of your system. • Builds

    confidence in your infrastructure. Chaos Engineering…