Beyer, Betsy, et al., “Site reliability engineering: How Google runs production systems.”, O'Reilly Media, Inc., 2016. ※1 Figure III-1 ৴པੑ੍ޚͷߏΛ֊Խ͠ɺશମ၆ᛌ
Burgess, Computer Immunology, USENIX LISA 1998. USENIX board: “͍ɺզʑֶऀͩ͠ɺγεςϜཧʹՊֶత ͳ͜ͱݚڀతͳ͜ͱ໘ന͍͜ͱԿͳ͍ɻ”ʢ༁ʣ ※2 ※1 Thomas Limoncelli, “LISA made LISA obsolete (That's a compliment!)”, 2022. https://www.usenix.org/publications/loginonline/lisa- made-lisa-obsolete-thats-compliment ※1
TopologiesʹΑΔιϑτ ΣΞϓϩμΫτͷదԠܕ৫ ઃܭ๏ ※2 Skelton, Matthew, and Manuel Pais, “Team Topologies: Organizing Business and Technology Teams for Fast Flow”, IT Revolution, 2019. ※1 N. Forsgren, H. Jez Humble, and K. Gene, “Accelerate: The science of lean software and devops: Building and scaling high performing technology organizations”, IT Revolution, 2018. ※2 ※1 ΦϒβʔόϏϦςΟ ςϨϝτϦʔʹجͮ͘ ԋ៷ʹΑΔσόοά๏
Τοδέʔεͷ ݟಀ͠ αϯϓϦϯά ※1 Paige Cruz, “99.99% of Your Traces Are (Probably) Trash", SREcon24 Americas, 2024. ※2 Zhang, Lei et al, “The Bene fi t of Hindsight: Tracing Edge-Cases in Distributed Systems.”, NSDI, 2022. ͋Δ͖ঢ়ଶɿোൃੜલޙ͚ͩτϨʔε͢Ε͍͍ͷͰʁ ※2 ※1
through the Life Cycle of Faults in Clouds: Guidelines on Fault Handling”, ISSRE’22. Fig. 2ΑΓసࡌ 1. ϥΠϑαΠΫϧͷ֤ஈ֊Ͱͷॴཁ࣌ؒΛܭଌ͢Δ 2. ֤ஈ֊ͰɺྨࣅͷཁҼͰॴཁ͕࣌ؒେ͖͍ՕॴΛಛఆ͢Δ 3. ࠷େͷՕॴ͔Β༏ઌͯࠜ͠ຊతͳվળΛߦ͏
Lilia et al., “Fail through the Cracks: Cross-System Interaction Failures in Modern Cloud Systems.”, EuroSys 2023. ※1 ݩจࠃࡍձٞͷ EuroSys’23Ͱൃද͞Εͨɻ ஶऀͷҰਓ͕SREconͰൃ ද͍ͯ͠Δɻ ߨԋͰεΩοϓ
Bouskill, “Measuring Reliability Culture to Optimize Tradeoffs: Perspectives from an Anthropologist”, SREcon24 Americas 54%ͷνʔϜ͕ ”Find it hard to identify reliability gaps” ൃදऀਓྨֶͷത࢜߸ͱӸֶͷम࢜߸Λͭɻ ৴པੑ্ͷͨΊͷ۩ମతͳΞΫ γϣϯ͕໌֬Ͱͳ͍ɺ·ͨ༏ઌ ॱҐ͚͕͍͠ͱ͍͏՝
A Comedy in Three Parts”, SREcon19 Asia. @SREcon19 Asia ೝ৺ཧֶऀͷBainbridgeʹ ΑΔ1983ͷจ ※1 L. Bainbridge, “Ironies of Automation”, Automatica, Vol.19, No.6, pp.775–779 1983. ※2 B. Strauch, "Ironies of Automation: Still Unresolved After All These Years". IEEE Transactions on Human-Machine Systems, Vol.48, No.5, pp.419–433 2018. 2018Ͱଓ͘Ͱ͋Δ ※1 ※2 ࣗಈԽγεςϜ͕ਓؒͷೳ ྗෆΛӅṭͯ͠͠·͏৽ ͍͠ൽΛఏࣔ
Woods, Laura Nolan, You've Lost That Process Feeling: Some Lessons from Resilience Engineering, SREcon21 2021. ɾݪࢠྗൃిॴͷΦϖϨʔλʔɺ ੍ޚγεςϜͷΧϯλʔͷҰఆ ϕʔεͷԻͰਖ਼ৗੑΛײ֮తʹཧ ղ͍ͯͨ͠ ɾSLOͷൣғͰਖ਼ৗͰ͋ͬͯɺ ෦తͳҟৗʹ͙͢ʹؾ͚ͮΔ
@SREcon20 Americas Laura Maguire, The Secret Lives of SREs - Controlling the Costs of Coordination across Remote Teams, SREcon20 Americas, 2020. ※1 Laura Maguire, Controlling the Costs of Coordination in Large-scale Distributed Software Systems, Dissertation, The Ohio State University, 2020. ※1 ɾൃදऀIntegrated Systems Engineeringͷത࢜՝ఔݚڀͰ ̐ͭͷ৫ͷ̒̎ݸͷΠϯγσϯτରԠࣄྫΛௐࠪɻ
ͦͷཧ͕௨༻͢Δൣғɻ(boundary condition) ɾwhyɿҼՌ͕ؔͳͥͦ͏ͳͷ͔ʹର͢Δઆ໌ɻ ※1 ”The primary goal of a theory is to answer the questions of how, when, and why, unlike the goal of desciption, which is to answer the question of what". (Bacharach, 1989, pp.498) ※2 ೖࢁ ষӫ, ੈքඪ४ͷܦӦཧ, μΠϠϞϯυࣾ, 2019. ※2 ※1
Sutton, Robert I. and Barry M. Staw. “What Theory is Not.” Administrative Science Quarterly 40, p.371, 1995. ※2 ೖࢁ ষӫ, ੈքඪ४ͷܦӦཧ, μΠϠϞϯυࣾ, 2019. ※̍ ※2