一人から始めたSREチーム3年の歩み - 求められるスキルの変化とチームのあり方 - / The three-year journey of the SRE team, which started all by myself
by
VTRyo
Link
Embed
Share
Beginning
This slide
Copy link URL
Copy link URL
Copy iframe embed code
Copy iframe embed code
Copy javascript embed code
Copy javascript embed code
Share
Tweet
Share
Tweet
Slide 1
Slide 1 text
Ұਓ͔Β࢝Ίͨ SREνʔϜ3ؒͷาΈ - ٻΊΒΕΔεΩϧͷมԽͱνʔϜͷ͋Γํ - VTRyo Money Forward, Inc - Platform and Reliability Engineeringຊ෦ SRE Kaigi 2025
Slide 2
Slide 2 text
SES, ࣗࣾ։ൃ, ελʔτΞοϓͱࣾ ܦͯ2021ʹϚωʔϑΥϫʔυೖࣾ ଞελʔτΞοϓʹࢀըத ࠷ۙEbitengineͰήʔϜ੍࡞த VTRyo -> VTuberͱͳΜͷؔ͋Γ·ͤΜ Money Forward TECH DAY’24 Developers Boost 2023 ͚ࣗͩͷ୭૾Ͱ͖ͳ͍ΩϟϦΞͷҭͯํ ϕετεϐʔΧʔ SRE NEXT 2022 Ұਓ͔Β࢝ΊΔϓϩμΫτSRE ஶ ʮITΤϯδχΞͷͨΊͷ ۮવ͔Β࢝ΊΔ ΩϟϦΞϓϥϯʯ
Slide 3
Slide 3 text
ࡢΫϥϑτ🍺ͷ৹ࠪһࢿ֨Λऔಘ αφ݈߁࢜, ࢣࢿ֨ࡁΈ झຯ͕ଟ͍ͷͰ࠙ձͰͥͻ ϏʔϧΛҿΈʹυΠπɾνΣίɾϕϧΪʔ ΧϨʔΛ৯ʹεϦϥϯΧ ϥʔϝϯೋΛ৯ʹશࠃ Ҏલٕज़ಉਓࢽ੍࡞ͯ͠·ͨ͠ ଞʹ͋Δ͚Ͳॻ͖͖Εͳ͍ͷͰུ
Slide 4
Slide 4 text
͓Βͤ SRE Kaigi 2025͔ͤͬ͘ͷΦϑϥΠϯ։࠵ͳͷͰ… εϥΠυਤΛத৺ʹ ݴ༿Ͱઆ໌ͨ͠ͷޙϒϩάʹ Xʹͯ #srekaigi #room_b Ͱͷ࣮گײେܴͰ͢
Slide 5
Slide 5 text
ࠓ࣋ͪؼΕΔ͜ͱ ৫ઃܭʹؔΘ͍ͬͯΔํ اۀνʔϜͷϑΣʔζʹΑͬͯSREͷ͋Γํ͕ Ͳ͏มΘΔͷ͔ ͲͷϑΣʔζʹͲΜͳεΩϧΛ࣋ͬͨSRE͕͍Δͱ Α͍ͷ͔ɻνʔϜઃܭͷࢀߟʹ
Slide 6
Slide 6 text
ࠓ࣋ͪؼΕΔ͜ͱ SREݸਓͷํ ϑΣʔζʹΑͬͯٻΊΒΕΔεΩϧηοτ͕ Ͳ͏มΘΔ͔ ظɺ֦େظɺ҆ఆظ͕͋ͬͨͱͯ͠ɺ ͕ࣗͲͷنͷSREͱͯ͠దͳͷ͔
Slide 7
Slide 7 text
ઌʹࣗຫ͍͍ͯ͠Ͱ͔͢ɺ͠·͢Ͷ ࣗࣾͷظදজࣜʹͯνʔϜ͕ϓϩδΣΫτΛ֫ಘ͠·ͨ͠ʂ
Slide 8
Slide 8 text
࣍ ࣌ܥྻͰݟΔࢲͨͪSREͷऔΓΈ 3ͰมΘͬͨ͜ͱɺมΘΒͳ͍͜ͱ SREݸਓͱͯ͠ߟ͍͑ͯͨ͜ͱ ·ͱΊ
Slide 9
Slide 9 text
ϚωʔϑΥʔϫʔυͷSREʹ͍ͭͯ
Slide 10
Slide 10 text
࣌ܥྻͰݟΔࢲͨͪSREͷऔΓΈ ॱΛͬͯհ͍͖ͯ͠·͢ ҰਓͷSREظ νʔϜ֦େظ ҆ఆظ
Slide 11
Slide 11 text
ҰਓͷSREظ ϩʔυϚοϓ ࠷ॳʹܭը͞Εͨ͜ͱ͕ มߋʹͳΔՄೳੑ͕͋Δ Embedded SRE ઙ͘ͱ͍ܦݧ ࣮ߦྗ ΧΦεঢ়ଶΛ͑ΒΕΔ ମ੍ εΩϧηοτ
Slide 12
Slide 12 text
ҰਓͷSREظ
Slide 13
Slide 13 text
ҰਓͷSREظ Ҿ༻: SRE NEXT 2022 Ұਓ͔Β࢝ΊΔϓϩμΫτSRE
Slide 14
Slide 14 text
ҰਓͷSREظ Ҿ༻: SRE NEXT 2022 Ұਓ͔Β࢝ΊΔϓϩμΫτSRE 1ؒͰ100ຊఔ υΩϡϝϯτΛॻ͖ ଟ͘ͷਓʹಡΜͰΒ͑Δ Α͏ʹ͍ͯͨ͠
Slide 15
Slide 15 text
ҰਓͷSREظ Ҿ༻: SRE NEXT 2022 Ұਓ͔Β࢝ΊΔϓϩμΫτSRE
Slide 16
Slide 16 text
ҰਓͷSREظ Ҿ༻: SRE NEXT 2022 Ұਓ͔Β࢝ΊΔϓϩμΫτSRE
Slide 17
Slide 17 text
ҰਓͷSREظ ࠾༻ใͰར༻Ͱ͖Δʂ ࣗͨͪԿऀͳͷ͔ ظΛࣔͤΔ ࠓޙ݁ہ ΊͪΌͪ͘Ό͏͜ͱʹͳΔ Ҿ༻: SRE NEXT 2022 Ұਓ͔Β࢝ΊΔϓϩμΫτSRE ૣΊʹ࡞͓ͬͯ͘ͱྑ͍ ϛογϣϯ&Ϗδϣϯ
Slide 18
Slide 18 text
ҰਓͷSREظ Ҿ༻: SRE NEXT 2022 Ұਓ͔Β࢝ΊΔϓϩμΫτSRE
Slide 19
Slide 19 text
ҰਓͷSREظ औΓΈαϚϦ SREͱͲ͏͍͏ׂͳͷ͔ɺԿΛ͢Δͷ͔ڭ׆ಈ ݱ͕ۤ͠ΜͰ͍Δ෦Λੵۃతʹर্͍͛ͯ·ͣ վળ͢Δ ࢹվળɺΠϯϑϥ໘ͷαϙʔτʹද͞ΕΔ ༏ઌͷͨΓΛ͚ͭͳ͕ΒϩʔυϚοϓΈ্͛ ϛογϣϯɾϏδϣϯͷ࡞ ࠾༻׆ಈͷ։࢝
Slide 20
Slide 20 text
νʔϜ֦େظ ϩʔυϚοϓ ϩʔυϚοϓͱϓϩμΫτ ͔ΒͷґཔʹରԠ͢Δ όϥϯε͕͏·͘औΕͳ͍ Enabling SRE ઙ͘ͱ͍ܦݧ ࣮ߦྗ λεΫཧ ˑνʔϜϏϧσΟϯά ˑଞऀධՁ ମ੍ εΩϧηοτ
Slide 21
Slide 21 text
νʔϜ֦େظ ཏΛ্͛ΔͨΊʹͦΕͧΕ ଞͷϝϯόʔͱର͕ඃΒͳ͍Α͏ ͳମ੍ΛΜͰ͍ͨ
Slide 22
Slide 22 text
νʔϜ֦େظ ͨͩཏతʹEnabling׆ಈਐḿ A͞Μ͕ٳΉͱ୲ϓϩμΫτͷ ਐḿ͕0ʹͳΔ શମͷঢ়ଶఈ্͛ͱͯ͠ औͬͯΑ͍τϨʔυΦϑͩͱࢥ͏
Slide 23
Slide 23 text
ͳͥνʔϜʹϛογϣϯ͕ඞཁ͔ʁͦͷॏཁੑ Ξϓϩʔνํ๏ղܾͷํɺϛογϣϯʹΑͬͯ มΘΔ ҙࢥܾఆͷ࣠ʹϛογϣϯͷଘࡏཱ͕ͭ νʔϜ֦େظ ՝Λͯ͢SRE͕ר͖औΔ ͜ͱͰ͖Δ SREҎ֎͕ӡ༻Մೳͳঢ়ଶΛ Ռͱ্ͯ͛͠Δ͜ͱͰ͖Δ ࣗͨͪͷ࣮ݱ͍ͨ͠ੈք ʹΑͬͯՌ͕มΘΔ
Slide 24
Slide 24 text
࣌ͷϓϩμΫτ͝ͱͷ՝ A: େنσʔλϕʔεͷετϨʔδݶք B: Amazon EKSͷϦϓϨΠε͕͞Ε͍ͯͳ͍ C: ·ͩSRE׆ಈͰ͖͍ͯͳ͍νʔϜ͕͋Δ νʔϜ֦େظ ͦΕͧΕશ͘ҟͳΔ՝͕ଘࡏ ϝϯόʔͷօ͞Μඇৗʹ ؤுͬͯ͘Ε·ͨ͠
Slide 25
Slide 25 text
ҰํͰ՝
Slide 26
Slide 26 text
ͦΕͧΕͷݸੑ͕ࡍཱͪ࢝Ί͍͕ͯͨ… ϝϯόʔͷಘҙͰΞαΠϯ͢Δ͜ͱʹͳΔ ͦΕͧΕͷઐੑΛνʔϜͱͯ͠࠷େԽ͢Δ༨༟ͳ ͔ͬͨ ༨༟Λ࣋ͬͯऔΓΉʹɺਓ͕Γͳ͍ঢ়ଶ νʔϜ֦େظ
Slide 27
Slide 27 text
νʔϜ֦େظ औΓΈαϚϦ ҰͭͷϓϩμΫτ͚ͩͰͳ͘ɺෳ୲͢ΔΑ͏ʹ EmbeddedͰݶքͱͳΓEnablingମ੍ʹมߋ ϓϩμΫτݸผͷ՝ʹରԠ SRE׆ಈʹඞཁͳجૅߏஙʢSLO, จԽৢͳͲʣ ܧଓ νʔϜϏϧσΟϯάʹؔ࿈͢ΔऔΓΈ͕։࢝͞ΕΔ
Slide 28
Slide 28 text
νʔϜ҆ఆظ ϩʔυϚοϓ ϩʔυϚοϓ͕ܭը௨Γʹ ਐΈɺґཔλεΫ࣮֬ʹ ղܾͰ͖͍ͯΔঢ়ଶ Enabling SRE ߴ͍ઐੑ νʔϜྗΛ࠷େԽ ظࢹͰͷղܾ ଟ͘ͷؔऀͱௐ ମ੍ εΩϧηοτ
Slide 29
Slide 29 text
νʔϜ҆ఆظ νʔϜͷ૯߹ྗͰରԠ ϓϩμΫτݸผͷ՝ʹରԠ Ͱ͖Δ
Slide 30
Slide 30 text
Δ͖͜ͱʹऔΓΈͳ͕ΒɺνϟϨϯδ͢Δ ܦݧ͕ઙ͍ྖҬʹνϟϨϯδ͢Δ ܦݧ͕͋Δϝϯόʔ͕ϨϏϡʔαϙʔτʹճΔ ϓϩμΫτνʔϜ͔ΒͷґཔʢHelp wantedλεΫʣΛ ͬͯ͋ΒΏΔྖҬͷλεΫΛܦݧ͢Δ ߴͳઐੑ͕ඞཁͳͷɺͦͷਓ͕Ϧʔυ͢Δ νʔϜ҆ఆظ
Slide 31
Slide 31 text
ΦϯϘʔσΟϯάϑϩʔͷ্ νʔϜ҆ఆظ ※͜ͷ͋ͱ֦େ͠·͢ άϩʔόϧԽͨ͠ͷΛ͖͔͚ͬʹ Ͱݟ͍͢ܗʹ ԿΑΓɺ͜ͷํָ͕͠ΜͰΒ͑ ΔͷͰͱࢥͬͯ࡞
Slide 32
Slide 32 text
νʔϜ҆ఆظ
Slide 33
Slide 33 text
܁Γฦ͠Λճආ͢ΔͨΊͷπʔϧ܈Λ։ൃ DBΤϯδϯΞοϓάϨʔυݕূΛ࠷Ͱճͨ͢Ί CLI: PlatinumʢPercona Toolkit pt-upgradeΛwrapʣ Amazon RDS Blue/Green DeploymentsΛ҆શ͔ͭ࠷ Ͱ࣮ߦ͢ΔͨΊ CLI: Turquoiseʢ੨/͔Β༝དྷɻAWS CLIΛwrapʣ νʔϜ҆ఆظ ϝϯόʔશһ͕։ൃΛ ָ͠Μͩ
Slide 34
Slide 34 text
কདྷൃੜ͢Δɺ৽͍͠औΓΈͷτϥΠ ංେԽ͠ଓ͚ΔେنσʔλϕʔεɻࠓޙͷରԠ KafkaʢAmazon MSKʣͷӡ༻ࢹ PostmortemͷϩʔϧϓϨΠݚम ଞ νʔϜ҆ఆظ ϝϯόʔ͕୲ͯ͠ ਐΊ͍ͯͨ
Slide 35
Slide 35 text
ҰํͰ՝
Slide 36
Slide 36 text
νʔϜ҆ఆظ SRESREͷϩʔυϚοϓΛ ෦ͰਐΊΒΕΔ ϓϩμΫτνʔϜ ૬ஊࣄ߲ΛSREʹ͍߹ΘͤΔ
Slide 37
Slide 37 text
νʔϜ҆ఆظ ϦϦʔεใΛͯ͢ѲͰ͖ͳ ͘ͳͬͨ ϓϩμΫτͷݒ೦ࣄ߲Λහײʹ ΩϟονͰ͖ͳ͘ͳͬͨ ͩΜͩΜ ࡞ۀΛड͚͚͍ͯΔ͚ͩײ
Slide 38
Slide 38 text
ؔੑΛҡ࣋͢ΔͨΊʹ Production meetingΛఆظ։࠵ ϓϩμΫτͷऀ։ൃऀΛটͨ͠૬ޓ ใަͷ SRE͔ΒҰఆظؒͷϝτϦΫεϨϙʔτ ϓϩμΫτ͔ΒϦϦʔεใݒ೦ࣄ߲ڞ༗ νʔϜ҆ఆظ
Slide 39
Slide 39 text
ݴ༿Ͱ͑Δ͚ͩͰจԽΒͳ͍ ಄ͰΘ͔͍͕ͬͯͨɺจԽΈؚΊͯ࡞Β ͳ͍ͱਓͷೖΕସΘΓͰফ͍͑ͯ͘Մೳੑ͕ߴ͍ Կߟ͑ͣʹґཔʹରԠ͍ͯ͠Δͱɺ࠷ॳʹةዧ͍ͯ͠ ͨ࡞ۀһײ͕͡ΜΘΓͱਫ਼ਆΛḝΈͩ͢ νʔϜ҆ఆظ
Slide 40
Slide 40 text
νʔϜ҆ఆظ औΓΈαϚϦ EOLͷΞοϓσʔτରԠɻશ෦Ͱ5ͭͷDBΤϯδϯΞο ϓσʔτݕূΛπʔϧ։ൃͰॖ ֤ϓϩμΫτͷSLOͷ࠶ఆٛ, PostmortemϩʔϧϓϨΠ ݚम, Production meetingͳͲͷSRE׆ಈ ԣஅతʹඞཁͳऔΓΈʢKafka, Envoy, Datadog logs ϑΥʔϚοτͷ౷Ұʣ ಄ͷϓϩδΣΫτ͜ͷ
Slide 41
Slide 41 text
3ͰมΘͬͨ͜ͱɺมΘΒͳ͍͜ͱ ϩʔυϚοϓ ࠷ऴ౸ୡ มΘΒͳ͍ɻม͑ͳ͍ SREͷମ੍มΘΔ ਓͷೖΕସΘΓ͋Δ νʔϜͷϑΣʔζ ৫ͷϑΣʔζ ϓϩμΫτͷϑΣʔζ ͳͲͰҟͳΔ ମ੍ εΩϧηοτ
Slide 42
Slide 42 text
࠷ऴ౸ୡಉ͡ɺମ੍ਐԽɺ׆༂εΩϧͷมԽ ৫ͷٸɾฤมߋʹɺҰϛογϣϯΛม͑ Δ͖͔໎ͬͨɻ৺͔Β৴͡ΒΕΔ৴೦ม͑ͳ͍ ৫ͷܗʹ߹ΘͤɺSREͷରԠํ๏ਐԽ͢Ε͍͍ ϑΣʔζͰඞཁͳਓࡐ͕ҟͳΔͷඞવ 3ͰมΘͬͨ͜ͱɺมΘΒͳ͍͜ͱ
Slide 43
Slide 43 text
SREݸਓͱͯ͠ߟ͍͑ͯͨ͜ͱ
Slide 44
Slide 44 text
ࣗͲͷϑΣʔζʹ߹͏ͷͰ͠ΐ͏͔ ͪΖΜͲͷϑΣʔζʹదԠͰ͖ͨํ͕ྑ͍͕… Ͳ͏ߩݙͰ͖Δ͔Θ͔Βͣ໎ࢠʹͳΔ͜ͱ͋Δ ࠷ً͘ॠ͕ؒͲ͜ͳͷ͔ߟ͓͍͑ͯͯଛͳ͍ SREݸਓͱͯ͠ߟ͍͑ͯͨ͜ͱ
Slide 45
Slide 45 text
SREݸਓͱͯ͠ߟ͍͑ͯͨ͜ͱ ͜ΕΒ ͯ͢ͷೳྗ͕͋Δਓك
Slide 46
Slide 46 text
SREݸਓͱͯ͠ߟ͍͑ͯͨ͜ͱ ঃʑʹɺͲ͏ߩݙ͖͔͢Θ͔Βͳ͘ͳͬͨ ߴ͍ٕज़ྗΛ࣋ͬͨਓͷ ͕ͬͨͱࢥͬͨ
Slide 47
Slide 47 text
SREݸਓͱͯ͠ߟ͍͑ͯͨ͜ͱ ൴Β૬ޓతʹྑ͍ӨڹΛ ༩͑ΒΕΔͷͰͳ͍͔
Slide 48
Slide 48 text
·ͱΊ
Slide 49
Slide 49 text
·ͱΊ ৫ϓϩμΫτͷϑΣʔζ͕มΘΕSREมΘΔ ରʹ߹ΘͤͯॊೈʹมԽ͢Δ ͨͩ͠ϛογϣϯม͑ͳͯ͘ྑ͍ ً͕͚ࣗΔϑΣʔζ͕͋Δ ্ཱͪ͛ʁ֦େʁ҆ఆʁͬͱࡉ͔͍୯ҐͰ̋ ޓ͍ͷً͖૬ޮՌΛൃੜͤ͞Δ͜ͱͰ͖Δ
Slide 50
Slide 50 text
No content