SRE ͱԿ͔
• SRE = Site Reliability Engineering
• ىݯʮαʔϏεӡ༻Λ Software Engineer ʹΑ࣮ͬͯݱ͢
Δ͜ͱʯ
(*1)
• ίΞίϯηϓτͱͯ͠ SLI/SLO(*2)
͕͋ΓɺϢʔβ͕ظ͢Δ
αʔϏεϨϕϧΛࢦඪԽ͠ɺػೳ։ൃͱඇػೳ։ൃͷͲͪ
Βʹࢿ͢Δ͔ͷࢦͱ͢Δ
*1 Site Reliability Engineering: https://sre.google/sre-book/introduction/ our Site Reliability Engineering teams focus on hiring software engineers to run our products and to
create systems to accomplish the work that would otherwise be performed, often manually, by sysadmins.
*2 Service Level Indicator / Service Level Objectives ͷ͜ͱ
༨ஊɿ100% ৴པੑඪؒҧͬͨඪ
• 100% is the wrong reliability target(*3)
• 99.9, 99.99% ͱ 9ͷܻΛ૿͢ͱͦͷͨΊͷίετ͕େ͖͔͔͘Δ
• 100% ෆՄೳ = োى͖Δͷɺͱ͍͏લఏΛ͖࣋ͭ
*3 Site Reliability Engineering: https://sre.google/sre-book/introduction/ The error budget stems from the observation that 100% is the wrong reliability target for basically
everything
༨ஊɿ৴པੑ࠷ॏཁͳػೳͷ1ͭ
• Reliability Is the Most Important Feature(*4)
• γεςϜ͕৴པͰ͖ͳ͚ΕɺϢʔβͦΕΛ৴པ͠ͳ͍
• Ϣʔβ͕γεςϜΛ৴པ͠ͳ͚ΕɺΘͳ͍
• γεςϜωοτϫʔΫޮՌʹΑΓ͕ΔͨΊɺϢʔβ͕͍ͳ͍γε
ςϜՁ͕ͳ͍
• ଌఆ߲৻ॏʹબ͠ͳ͍͞
*4 The Site Reliability Workbook: https://sre.google/workbook/reaching-beyond/