Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Disaster Recovery on AWS

Disaster Recovery on AWS

If you're building SaaS, your platform is your business. Component failure, malicious third parties, and operational mishaps can threaten your data, and materially impact the health and longevity of your company. We'll walk you through the ways in which successful hypergrowth SaaS businesses proactively identify and mitigate these risks.

The Scale Factory

April 04, 2023
Tweet

More Decks by The Scale Factory

Other Decks in Technology

Transcript


  1. DISASTERS_
    Photo: NOAA/Unsplash
    (YOUR DATA CENTRE)

    View Slide


  2. NOT JUST
    ACTS OF GOD_
    (YOUR FINGER)
    Photo: -goldy-/iStock

    View Slide

  3. KEY TERMS FOR
    RECOVERY_
    Recovery Time Objective (RTO)
    Recovery Point Objective (RPO)
    Service Level Agreement (SLA)
    Service Level Objective (SLO)

    View Slide

  4. RECOVERY TIME
    OBJECTIVE_
    The maximum amount of time a service can be degraded.
    Measured from start of incident to restoration of service.
    In short: downtime.

    View Slide

  5. RECOVERY POINT
    OBJECTIVE_
    The maximum amount of time since the last data
    recovery point.
    How much data loss is acceptable.

    View Slide

  6. SERVICE LEVEL
    OBJECTIVE_
    SERVICE LEVEL
    AGREEMENT_
    Defines a service level indicator (metric)
    and its desired value.
    E.g. The service must be accessible
    99.95% of the time within a month.
    An agreement between a service
    provider and their customers.
    Defines a service level objective and the
    compensation if this objective is not met.

    View Slide

  7. LET’S MAKE
    A PLAN!_

    View Slide

  8. WHAT’S IN
    A PLAN?_ Triggering events
    RTO/RPO
    Key personnel
    Recovery steps
    Testing procedure

    View Slide


  9. TRIGGERING
    EVENTS_
    Photo: Max LaRochelle / Unsplash

    View Slide


  10. RTO/RPO_
    Photo: Afif Ramdhasuma / Unsplash

    View Slide


  11. Photo: Vlad Hilitanu / Unsplash
    KEY
    PERSONNEL_

    View Slide


  12. Photo: Bruno Nascimento / Unsplash
    RECOVERY
    STEPS_

    View Slide


  13. Photo: Nguyen Dang Hoang Nhu / Unsplash
    TESTING
    PROCEDURE_

    View Slide

  14. Photo: Colton Sturgeon / Unsplash
    DETERMENING
    RTO AND RPO IS
    A BALANCING ACT_
    What disasters do you need to plan for?
    How likely are they to occur?
    How does this fit in with a larger business
    continuity plan?
    How much is it going to cost?

    View Slide

  15. UNDERSTANDING
    YOUR RESPONSIBILITIES_
    Do you have SLAs with your customers?
    Do you have SLAs with your vendors?

    View Slide

  16. MORE
    POLLS!_

    View Slide

  17. DISASTER
    RECOVERY
    STRATEGIES_ COST/COMPLEXITY
    LOW HIGH
    BACKUP
    AND
    RESTORE
    PILOT
    LIGHT
    WARM
    STANDBY
    MULTI
    SITE
    RPO/RTO
    HOURS SECONDS

    View Slide

  18. BACKUP AND
    RESTORE_
    Cost: $$$$
    Simple
    RTO/RPO of hours
    Recover faster with IaC

    View Slide

  19. PILOT
    LIGHT_
    Cost: $$$$
    “Live” data
    Services need time to start
    RPO/RTO < 1 hour

    View Slide

  20. WARM
    STANDBY_
    Cost: $$$$
    “Live” data
    Services can handle some
    traffic immediately
    Testable
    RPO/RTO of minutes

    View Slide

  21. MULTI-SITE
    ACTIVE/ACTIVE_
    Cost: $$$$
    No failover
    RPO ~ seconds
    Real-time RTO

    View Slide

  22. “NO PLAN OF OPERATIONS REACHES WITH ANY
    CERTAINTY BEYOND THE FIRST ENCOUNTER
    WITH THE ENEMY'S MAIN FORCE
    HELMUTH VON MOLTKE
    DON’T FORGET
    TO TEST_

    View Slide

  23. HOW WE
    CAN HELP_
    BACKUP & DISASTER RECOVERY REVIEW
    You’ll get:
    A hands-on assessment.
    A data catalogue.
    A disaster recovery readiness report.
    OR
    Book in a free surgery to discuss your DR strategy

    View Slide

  24. MORE
    POLLS!_

    View Slide

  25. KEY
    TAKEAWAYS_
    Identify your RTOs and RPOs
    You need a plan!
    Architect for success
    Test your plan!

    View Slide

  26. KEEP IN
    TOUCH_
    http:/
    /www.scalefactory.com/
    https:/
    /github.com/scalefactory
    @scalefactory
    [email protected]

    View Slide