Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Tracking Service Infrastructure at Scale
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
John Arthorne
March 13, 2017
Technology
1
490
Tracking Service Infrastructure at Scale
Talk from SRECon North America 2017 on tracking and automating service infrastructure at Shopify
John Arthorne
March 13, 2017
Tweet
Share
More Decks by John Arthorne
See All by John Arthorne
Expect the Unexpected: Preparing SRE Teams for Responding to Novel Failures
jarthorn
0
390
Black Friday: Lessons in Resiliency and Incident Response at Shopify
jarthorn
0
440
Operating Systems in Cloud City
jarthorn
0
380
Tracking and automating software infrastructure with GitHub
jarthorn
0
530
Slack Superpowers
jarthorn
0
420
Continuous Delivery at Shopify
jarthorn
0
920
Planetary Scale Web Architecture: A Gentle Introduction
jarthorn
1
350
DevOps Toronto 2016: Scaling out Continuous Delivery
jarthorn
1
920
DevOps Vancouver 2016: Scaling out Continuous Delivery
jarthorn
0
670
Other Decks in Technology
See All in Technology
ソフトバンク流!プラットフォームエンジニアリング実現へのアプローチ
sbtechnight
1
180
Claude Code のコード品質がばらつくので AI に品質保証させる仕組みを作った話 / A story about building a mechanism to have AI ensure quality, because the code quality from Claude Code was inconsistent
nrslib
12
8.5k
Claude Code Skills 勉強会 (DevelersIO向けに調整済み) / claude code skills for devio
masahirokawahara
1
22k
Scrumは歪む — 組織設計の原理原則
dashi
0
200
詳解 強化学習 / In-depth Guide to Reinforcement Learning
prinlab
0
230
Postman v12 で変わる API開発ワークフロー (Postman v12 アップデート) / New API development workflow with Postman v12
yokawasa
0
140
Go標準パッケージのI/O処理をながめる
matumoto
0
220
わたしがセキュアにAWSを使えるわけないじゃん、ムリムリ!(※ムリじゃなかった!?)
cmusudakeisuke
1
770
2026年もソフトウェアサプライチェーンのリスクに立ち向かうために / Product Security Square #3
flatt_security
1
620
銀行の内製開発にて2つのプロダクトを1つのチームでスクラムしてみてる話
koba1210
1
140
プラットフォームエンジニアリングはAI時代の開発者をどう救うのか
jacopen
7
3.7k
ガバメントクラウドにおけるAWSの長期継続割引について
takeda_h
2
1.1k
Featured
See All Featured
From π to Pie charts
rasagy
0
150
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
234
17k
SEO in 2025: How to Prepare for the Future of Search
ipullrank
3
3.4k
What Being in a Rock Band Can Teach Us About Real World SEO
427marketing
0
190
コードの90%をAIが書く世界で何が待っているのか / What awaits us in a world where 90% of the code is written by AI
rkaga
60
42k
Darren the Foodie - Storyboard
khoart
PRO
3
2.9k
Scaling GitHub
holman
464
140k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
141
35k
Heart Work Chapter 1 - Part 1
lfama
PRO
5
35k
Avoiding the “Bad Training, Faster” Trap in the Age of AI
tmiket
0
100
30 Presentation Tips
portentint
PRO
1
250
Learning to Love Humans: Emotional Interface Design
aarron
275
41k
Transcript
None
None
A startup building hosted software for commerce Facing rapid growth
in customers, RPM, devs, deploys
Still growing fast, things on fire all the time Production
Engineering to the rescue!
Still in “double all the things” mode SRE mindset helped
us get ahead of the growth Concern is more about growth rate than actual #’s
None
None
None
None
None
None
None
• • • ➢
None
None
Collective Ownership in common Ability to deliver with high speed
Works well in small teams No specialized roles Authoritarian No change without permission Bureaucratic, slow, safe The norm in massive orgs Highly specialized roles Shopify 2015 Shopify 2017
• • •
None
None
• • •
Tier Impact Needs 1 Critical Playbooks, defined SLO, resiliency patterns,
DC failover, scheduled load tests, security reviews 2 Important On call, monitoring with alerts, metrics instrumentation, dedicated DB, load tested, rolling deploy (preboot) 3 Useful >1 owner, deploy automation, CI, standard dev setup, uptime monitor, bugsnag, log retention, backups, SSL 4 Experiments Owner, Security bugs, resolve outages
None
None
None
None
None
None
None
Office Hours Keep In Touch - -