機械学習とのつきあいかた / How to get involved with Machine Learning @ Wantedly

機械学習とのつきあいかた / How to get involved with Machine Learning @ Wantedly

2019年新人研修で使った資料です。

以下の内容を話した全体的にはポエムです。

1. Wantedlyの機械学習
2. MLと組織
3. MLとデータ
4. MLプロジェクの進め方

Cecfa78bd810db9409151a7ee9aaa346?s=128

Makoto Tanji

April 24, 2019
Tweet

Transcript

  1. 2.

    ©2019 Wantedly, Inc. • ML Engineer at Wantedly, Inc. •

    Wantedly Visit 2015 - 2016 • Client Growth, Scout • Wantedly People 2016 - current • Server side • Machine Learning • Data Analysis ࣗݾ঺հ Who am I? Makoto Tanji
  2. 4.

    ©2019 Wantedly, Inc. 1. ςʔϚ 2. Wantedlyͷػցֶश 3. Take home

    messages 1. MLͱ૊৫ 2. MLͱσʔλ 3. MLϓϩδΣΫͷਐΊํ 4. ·ͱΊ Contents
  3. 9.

    ©2019 Wantedly, Inc. Wantedlyͷػցֶश׆༻ࣄྫ  7JTJU w ืूͷਪન w 1VTI௨஌ͷύʔιφϥΠζ

    w εΧ΢τϑΟϧλͷվળ w ʜ  1FPQMF w ໊ࢗը૾ͷ0$3ͱ෼ྨ w ਓͷਪન w هࣄͷਪનɺ1VTIͷ։෧཰༧ଌ w ༷ʑͳݻ༗දݱநग़ w ʢػցֶशͷϦϙδτϦ͸.-λά͕͍ͭͯ·͢ʣ w IUUQTHJUIVCDPNXBOUFEMZ VUG&$RNMUZQFMBOHVBHF
  4. 14.

    ©2019 Wantedly, Inc. ֶशࡁΈͷAPIύλʔϯ  ϚΠΫϩαʔϏεͷαʔόͷ"1*ͱͯ͠ఏڙ 1. OCRʢจࣈೝࣝʣAPIɺςΩετ͔Βͷநग़APIɺಉҰੑ൑ఆAPIͳͲ୔ࢁ 2. લ΋ֶͬͯशͨ͠ϞσϧΛಈ͔ͯ݁͠ՌΛฦ͢

    3. e.g. Peopleͷ໊ࢗೝࣝؔ࿈API͸͜ͷྫ͕ଟ͍  ߟ͑Δ͜ͱ • ਫ਼౓͕஌Βͳ͍͏ͪʹམ͍ͪͯͳ͍͔ • ϝτϦΫεΛຖ೔औΔɻ৽͘͠APIΛ࡞Δͱ͖͸νΣοΫϦετʹೖΕΔ • ίετ • CPU཯଎ʹͳ͍ͬͯΔ͜ͱ͕ଟ͘ɺkubernetesͷϦιʔεΛେྔʹ࢖ͬͯͳ͍͔
  5. 15.

    ©2019 Wantedly, Inc. ਪનύλʔϯ  લ΋͓ͬͯ͢͢ΊείΞΛܭࢉɾఏڙ 1. Ϣʔβ͕ϖʔδʹΞΫηεͨ͠ͱ͖ʹҰॠͰΦεεϝͷืूΛग़͍ͨ͠ 2. Ϣʔβͷաڈͷߦಈ͔Βࣄલʹܭࢉ͓ͯ͘͠

    3. Ϣʔβ×ืूͷ݁ՌΛσʔλετΞʹอଘ  ߟ͑Δ͜ͱ • ʮϢʔβ×ืूʯ͕͸͍ΔσʔλετΞ • ຖ೔ͷֶशJob͕ࢭ·͍ͬͯͳ͍͔ʁ • Job͕ଟஈʹͳΔͱಛʹΘ͔Γʹ͘͘ͳΔˠData Pipelineͷ࿩
  6. 16.

    ©2019 Wantedly, Inc. ͦͷଞ  σʔλͷਖ਼نԽ 1. ࡶଟͳ৘ใΛਖ਼نԽɾಉҰੑͳͲΛ൑ఆ͢Δ 2. ࣙॻΛ࡞Δ࢓ࣄ

    3. Ϣʔβʹ͸ݟ͑ͳ͍͚Ͳ಺෦Ͱݡ͘ͳ͍ͬͯΔ  #JH2VFSZ.- 1. ಺෦ͷϩδοΫͰ࢖͍ͬͯΔ 2. ࣮͸࢖ΘΕͯ·͢
  7. 22.

    ©2019 Wantedly, Inc. ૊৫ͷ࿩  Ϣʔβ·Ͱͷڑ཭͕͍ۙ 1. ߦͬͨվળ͕μΠϨΫτʹϢʔβ·Ͱಧ͘  ϓϩμΫτʹՁ஋ͷ͋ΔվળΛߦ͍΍͍͢

    1. ϓϩμΫτͷ޲͔͍ͬͯΔํ޲Λڞ༗͍ͯ͠ΔͷͰᴥᴪ͕ى͜ΓͮΒ͍ 2. ඞཁͰ͋Ε͹ϓϩμΫτͷϏδωεϩδοΫ·ͰखΛೖΕΒΕΔ  վળ͚ͩͰͳ͘৽͍͠ػೳΛࣗવͱٞ࿦͢Δ 1. ։ൃνʔϜͷҰһͳͷͰ਺஋తͳվળ͚ͩͰͳ͘ʮ͜͏͋Δ΂͖ʯํ޲Λٞ࿦ͯ͠ਐΊΒΕΔ
  8. 23.

    ©2019 Wantedly, Inc. ૊৫ͷ࿩ ଞͷ૊৫ͷ͋Γํ σʔλαΠΤϯςΟετ ΤϯδχΞ σβΠφ Ϛωʔδϟ શࣾԣஅ

    MLνʔϜ ϓϩμΫτνʔϜ શࣾԣஅ σʔλ෼ੳνʔϜ MLΤϯδχΞ ґཔ ݁Ռ
  9. 24.

    ©2019 Wantedly, Inc. ૊৫ͷ࿩ ଞͷ։ൃνʔϜͱ͸Ͳ͏ؔΘΔͷ͔ʁ 1. ։ൃελΠϧ: APIΛఆٛͯ͠WEBଆͱMLଆͰฒߦͯ͠։ൃ 1. e.g.

    Ϣʔβʹෳ਺ͷهࣄΛฦ͍ͨ͠ɻهࣄͷϦετ͸WEBଆͰऔಘ͢ΔɻMLͷAPIΛݺͼग़ͯ͠ॱংΛܾఆ͢Δ 2. ؔΘΓํ 1. WEBଆ: લ΋ͬͯAPI spec͚ܾͩΊ͓ͯ͘ͱɺSTUBͯ͠MLଆΛ଴ͨͣʹ։ൃՄೳɻ 2. Πϯϑϥ: service-in͢ΔલʹɺෛՙͷϝτϦΫεͱ࠷ѱࢮΜͰ΋͍͍API͔ΫϦςΟΧϧͳAPI͔Λ͢Γ߹Θͤ 3. ίϛϡχέʔγϣϯ 1. ࣄલʹʮਫ਼౓ͷظ଴஋ʯΛ͢Γ߹ΘͤΔͱ͍͍ɻ 2. ௒ݡ͍API͕αΫοͱͰ͖Δ͜ͱ͸΄΅ແ͍ɻ99.9ˋͷਫ਼౓͕ඞཁͳཁ݅ͳͷ͔ɺݟͤํΛڞ༗͢Δ
  10. 31.

    ©2019 Wantedly, Inc. ؾΛ͚ͭΔ͜ͱ w σʔλपΓͰ࣮ࡍ͋ͬͨ໰୊ 1. RPAͷ׆༻Ͱҙຯͷͳ͍େྔͷΞΫηε͕͋ͬͨͨΊɺݟ͔͚ͷฏۉར༻਺্͕͕ͬ ͨɻίϯόʔδϣϯΛ൐Θͳ͍ϩά͕૿͑ͯ਺஋͕มԽͨ͠ 2.

    ৽͍͠ػೳͷϦϦʔεͰςʔϒϧͷΧϥϜʹ৽͍͠஋͕ೖΔΑ͏ʹͳͬͨɻલ΋ͬͯ ݕ஌ͨͨ͠ΊϦϦʔεલʹमਖ਼Ͱ͖͕ͨɺ஌Βͳ͔ͬͨΒϞσϧ͕յΕ͍͔ͯͨ΋
  11. 38.

    ©2019 Wantedly, Inc. Machine Learning Best Practice • ϧʔϧ1: ػցֶशΛ࢖Θͳ͍Ͱ࢝ΊΒΕͳ͍͔ߟ͑Δ

    • ϧʔϧ2: ઃܭͯ͠ࢦඪΛ࡞ΔʢݱࡏͷγεςϜΛཧղ͔ͯ͠ΒࢦඪΛ࡞Δʣ • ϧʔϧ3: ϩδοΫ͕ෳࡶʹͳͬͬͨΒػցֶशΛબ୒͢Δ ref: https://developers.google.com/machine-learning/guides/rules-of-ml/
  12. 40.

    ©2019 Wantedly, Inc. ϓϩδΣΫτͷਐΊํ ख୳Γظ ϧʔϧ૿Ճظ ػցֶश • ڞ௨ͷ஌ਓ਺͕ଟ͍ਓʢΛਪન͢Δʣ •

    ϝτϦΫε=ϦΫΤετ਺ͱঝೝ཰ • ڞ௨ͷ஌ਓ਺͕ଟ͍ਓ • ಉ͡ձࣾͷਓ • ڞ௨ͷ໊ࢗΛ͍࣋ͬͯΔ਺͕ଟ͍ਓ • ಉ࣌ʹεΩϟϯ͞ΕͨਓͰڞ௨ͷͭ ͳ͕Γ͕͋Δ • ͜ͷ૊Έ߹Θͤ • ɾɾɾ • Deep Learningϕʔεͷਪન • + ABςετ
  13. 44.

    ©2019 Wantedly, Inc. References • Rules of Machine Learning: Best

    Practices for ML Engineering • https://developers.google.com/machine-learning/guides/rules-of-ml/ • Wantedly ͷػցֶशϓϩμΫτ։ൃΛࢧ͑Δػցֶशج൫ / #rejectcon2018 • https://speakerdeck.com/south37/number-rejectcon2018 Photo Credit • https://unsplash.com/photos/PMwu9gfCSbw