Link
Embed
Share
Beginning
This slide
Copy link URL
Copy link URL
Copy iframe embed code
Copy iframe embed code
Copy javascript embed code
Copy javascript embed code
Share
Tweet
Share
Tweet
Slide 1
Slide 1 text
The Path to SRE @dschenkelman Director of Engineering @auth0
Slide 2
Slide 2 text
SRE
Slide 3
Slide 3 text
Why?
Slide 4
Slide 4 text
Reliability is the one feature every customer uses - an @auth0 SRE
Slide 5
Slide 5 text
Auth0 User Auth0 Customer App
Slide 6
Slide 6 text
Context
Slide 7
Slide 7 text
Focused Investment Like Security but for Reliability
Slide 8
Slide 8 text
Scale
Slide 9
Slide 9 text
Research
Slide 10
Slide 10 text
Companies
Slide 11
Slide 11 text
Organizations
Slide 12
Slide 12 text
Style
Slide 13
Slide 13 text
Sponsors
Slide 14
Slide 14 text
Who?
Slide 15
Slide 15 text
Spectrum Systems Software
Slide 16
Slide 16 text
The Usual Suspects
Slide 17
Slide 17 text
Teachers
Slide 18
Slide 18 text
Advocates
Slide 19
Slide 19 text
Problem solvers
Slide 20
Slide 20 text
Know the system
Slide 21
Slide 21 text
Experience
Slide 22
Slide 22 text
node.js
Slide 23
Slide 23 text
Educate
Slide 24
Slide 24 text
What we do SRE identifies, develops, refines, and disseminates the libraries, services, practices, and processes key to system reliability.
Slide 25
Slide 25 text
SRE does not force itself on other teams
Slide 26
Slide 26 text
SRE does not handle all incident response
Slide 27
Slide 27 text
Involvement Spectrum SRE Run Service Embedding Consultancy Office Hours/ Workshops
Slide 28
Slide 28 text
Contacting SRE
Slide 29
Slide 29 text
The brand
Slide 30
Slide 30 text
Logo
Slide 31
Slide 31 text
Office Hours
Slide 32
Slide 32 text
Brown bags
Slide 33
Slide 33 text
Investigations
Slide 34
Slide 34 text
Flexibility
Slide 35
Slide 35 text
Incidents
Slide 36
Slide 36 text
Execute!
Slide 37
Slide 37 text
You are selling TRUST
Slide 38
Slide 38 text
SLOs
Slide 39
Slide 39 text
R2
Slide 40
Slide 40 text
No content
Slide 41
Slide 41 text
No content
Slide 42
Slide 42 text
No content
Slide 43
Slide 43 text
Incident Response
Slide 44
Slide 44 text
Distributed Traces
Slide 45
Slide 45 text
Rate limiting
Slide 46
Slide 46 text
CI/CD
Slide 47
Slide 47 text
Complex Issues
Slide 48
Slide 48 text
Today
Slide 49
Slide 49 text
Org IAM DX Platform SRE
Slide 50
Slide 50 text
Results • 5/11 teams doing R2s organically • > 5x more frequent deploys with < 10x duration • 80% critical services with tracing
Slide 51
Slide 51 text
Results (2) • 5 complex issues solved • > 99.99% reliability for User Management API • ~8ms -> ~3ms 99th perc latency for rate limits
Slide 52
Slide 52 text
Success
Slide 53
Slide 53 text
Vision Subject to change :) IAM DX Platform SRE PR SRE AR SRE AR SRE OX
Slide 54
Slide 54 text
Thanks @dschenkelman