Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Failing With Grace - Sean O'Connor

Failing With Grace - Sean O'Connor

One of the biggest challenges of building distributed systems is dealing with failure. In this talk we'll cover a number of approaches and tools to help you build systems that deal with failure as gracefully as possible.

PyGotham 2014

August 17, 2014
Tweet

More Decks by PyGotham 2014

Other Decks in Programming

Transcript

  1. How hard can it be? 270,000,000 redirects 26,000,000 shortens 4,000,000

    page crawls 400servers 2data centers each day we handle and we do it with 20+deploys 20engineers EVERY DAY
  2. A distributed system is a software system in which components

    located on networked computers communicate and coordinate their actions by passing messages. The components interact with each other in order to achieve a common goal. Three significant characteristics of distributed systems are: concurrency of components, lack of a global clock, and independent failure of components. - Distributed Systems: Concepts and Design
 via seanoc.co/ds-cite
  3. A distributed system is a software system in which components

    located on networked computers communicate and coordinate their actions by passing messages. The components interact with each other in order to achieve a common goal. Three significant characteristics of distributed systems are: concurrency of components, lack of a global clock, and independent failure of components. - Distributed Systems: Concepts and Design
 via seanoc.co/ds-cite
  4. A distributed system is a software system in which components

    located on networked computers communicate and coordinate their actions by passing messages. The components interact with each other in order to achieve a common goal. Three significant characteristics of distributed systems are: concurrency of components, lack of a global clock, and independent failure of components. - Distributed Systems: Concepts and Design
 via seanoc.co/ds-cite
  5. A distributed system is a software system in which components

    located on networked computers communicate and coordinate their actions by passing messages. The components interact with each other in order to achieve a common goal. Three significant characteristics of distributed systems are: concurrency of components, lack of a global clock, and independent failure of components. - Distributed Systems: Concepts and Design
 via seanoc.co/ds-cite
  6. A distributed system is a software system in which components

    located on networked computers communicate and coordinate their actions by passing messages. The components interact with each other in order to achieve a common goal. Three significant characteristics of distributed systems are: concurrency of components, lack of a global clock, and independent failure of components. - Distributed Systems: Concepts and Design
 via seanoc.co/ds-cite
  7. A distributed system is a software system in which components

    located on networked computers communicate and coordinate their actions by passing messages. The components interact with each other in order to achieve a common goal. Three significant characteristics of distributed systems are: concurrency of components, lack of a global clock, and independent failure of components. - Distributed Systems: Concepts and Design
 via seanoc.co/ds-cite
  8. A distributed system is a software system in which components

    located on networked computers communicate and coordinate their actions by passing messages. The components interact with each other in order to achieve a common goal. Three significant characteristics of distributed systems are: concurrency of components, lack of a global clock, and independent failure of components. - Distributed Systems: Concepts and Design
 via seanoc.co/ds-cite
  9. A distributed system is one in which the failure of

    a computer you didn't even know existed can render your own computer unusable. - Leslie Lamport