Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Tracking Service Infrastructure at Scale

Tracking Service Infrastructure at Scale

Talk from SRECon North America 2017 on tracking and automating service infrastructure at Shopify

John Arthorne

March 13, 2017
Tweet

More Decks by John Arthorne

Other Decks in Technology

Transcript

  1. Still in “double all the things” mode SRE mindset helped

    us get ahead of the growth Concern is more about growth rate than actual #’s
  2. Collective Ownership in common Ability to deliver with high speed

    Works well in small teams No specialized roles Authoritarian No change without permission Bureaucratic, slow, safe The norm in massive orgs Highly specialized roles Shopify 2015 Shopify 2017
  3. Tier Impact Needs 1 Critical Playbooks, defined SLO, resiliency patterns,

    DC failover, scheduled load tests, security reviews 2 Important On call, monitoring with alerts, metrics instrumentation, dedicated DB, load tested, rolling deploy (preboot) 3 Useful >1 owner, deploy automation, CI, standard dev setup, uptime monitor, bugsnag, log retention, backups, SSL 4 Experiments Owner, Security bugs, resolve outages