Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Ensuring Success During Disaster - SRECon 2015

Ensuring Success During Disaster - SRECon 2015

Video will be at SRECon Program: https://www.usenix.org/conference/srecon15/program/presentation/barth

Surviving a large scale outage requires more than just standing up a few extra servers. Validation and capacity planning can mean the difference between proper mitigation, or just a bunch of wasted effort. This talk will explore how to ensure DR success, gleaned from PagerDuty's production systems.

Doug Barth

March 16, 2015
Tweet

More Decks by Doug Barth

Other Decks in Technology

Transcript

  1. 3/20/15 ENSURING SUCCESS DURING DISASTER DR vs. HA Data DR

    Failover Active/Active Legacy Systems Q&A
  2. 3/20/15 ENSURING SUCCESS DURING DISASTER A plan for surviving rare

    failure events that threaten our ability to continue operating