Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
toC企業でのデータ活用 (PyData.Okinawa + PythonBeginners沖...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
takegue
June 15, 2019
Education
1.2k
4
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
toC企業でのデータ活用 (PyData.Okinawa + PythonBeginners沖縄 合同勉強会 2019)
https://pydataokinawa.connpass.com/event/80271/
takegue
June 15, 2019
More Decks by takegue
See All by takegue
不自然言語の自然言語処理: コード補完を支える最新技術
takegue
1
910
つかわれるプラットフォーム 〜デザイン編〜@DPM#2
takegue
2
12k
カルチャーとエンジニアリングをつなぐ データプラットフォーム
takegue
4
6.6k
Rettyにおけるデータ活用について
takegue
0
940
Sparse Overcomplete Word Vector Representations
takegue
0
260
Aligning Sentences from Standard Wikipedia to Simple Wikipedia
takegue
0
250
High-Order Low-Rank Tensors for Semantic Role Labeling
takegue
0
140
Dependency-based empty category detection via phrase structure trees
takegue
0
110
A simple pattern-matching algorithm for recovering empty nodes and their antecedents
takegue
0
160
Other Decks in Education
See All in Education
The Lotus and the Frog
vyadav
0
120
SARA Annual Report 2025-26
sara2023
1
360
偶然のチャンスを掴みに行けるのは君だ!
kotomin_m
2
130
From Days to Minutes: How We Taught an AI to Onboard 50+ Tenants on our AI Features
mfcabrera
0
170
良い塩梅を実現する、AWSネットワーク3分クッキング
masakiokuda
1
260
「機械学習と因果推論」入門 ⑤ 因果効果推定の一般化
masakat0
0
110
[2026前期火5] 論理学(京都大学文学部 前期 第6回)「かつとまたはの規則」
yatabe
0
280
共感から、つくる: 変わり続ける自分と、誰かのための創造
micknerd
1
380
BITCOIN : Les fondamentaux !
rlifchitz
0
170
輻射安全管理系統2.0暨輻防e++學園平台說明會
aecrp
0
640
「機械学習と因果推論」入門 ② 回帰分析から因果分析へ
masakat0
0
710
Catecismo 26 #1 - Aula inaugural
cm_manaus
0
180
Featured
See All Featured
[SF Ruby Conf 2025] Rails X
palkan
2
1.1k
Intergalactic Javascript Robots from Outer Space
tanoku
273
27k
Building AI with AI
inesmontani
PRO
1
1.1k
Into the Great Unknown - MozCon
thekraken
41
2.6k
技術選定の審美眼(2025年版) / Understanding the Spiral of Technologies 2025 edition
twada
PRO
118
120k
Are puppies a ranking factor?
jonoalderson
1
3.6k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
12
1.2k
The B2B funnel & how to create a winning content strategy
katarinadahlin
PRO
1
390
Building Adaptive Systems
keathley
44
3.1k
Marketing Yourself as an Engineer | Alaka | Gurzu
gurzu
0
240
How Software Deployment tools have changed in the past 20 years
geshan
0
34k
Code Reviewing Like a Champion
maltzj
528
40k
Transcript
PyData.Okinawa + PythonBeginnersԭೄ ߹ಉษڧձ 2019 फ़ี (@takegue) toCاۀͰͷσʔλ׆༻; αΠΤϯεɺΤϯδχΞϦϯάͦͯ͠σβΠϯɺΞʔτ
8IP"N* फ़ีʢ @takegue ʣ Retty ← म࢜ʢNLP; ػց༁ʣˡ ߴઐ
Core Value: Data Architect σʔλͷՁΛ࠷େԽ͢ΔΈ/ઃܭͷ࣮ݱ ࣥච׆ಈ: ʮ༏ઌֶशʹΑΔਪનจ͔Βͷݟग़͠நग़ʯ ʮͬͯΈΑ͏ʂ ػցֶशʢSotware Designʣʯ ʮࢼֶͯ͠Ϳ ػցֶशೖʯଞ… ߴAIਓࡐ͔ʁ ͦͷଞ: https://shwca.se/takegue
Data Architectͷ͓͠͝ͱ: ྲྀ௨ͱσʔλͷܦࡁݍΛ࡞Δ͜ͱ ΞφϦετ σʔλϚʔτ ϓϩμΫτ σʔλΣΞϋε
σʔλʹؔΘΔδϣϒɺ͍ΖΜͳδϣϒ͕͋Δ • Data Scientists • Data Infrastructure Engineer • ML
Engineer / SysML • BI Engineer / Data Platform Engineer • Data Visualization Engineer / Data Analyst • Data Application Engineer
܅ͲΜͳδϣϒʹͳΓ͍ͨʁʂ
͜ͷઌੜ͖ΔͨΊʹ͜͏ͳΓ͍ͨ ʮ͜ͷձࣾͷ໋ߝԶ͕Ѳ͍ͬͯΔʯʢը૾ུʣ ݴͬͯΈͨ͘ͳ͍ʁ
toCاۀͱͯ͠ͲͷΑ͏ʹσʔλ׆༻ʹऔΓΜͰ͍Δͷ͔ʁ
·ͣαʔϏεհ: Retty
None
https://retty.me/announce/philosophy/
None
σʔλʹؔΘΔλεΫɺ͍ΖΜͳλεΫ͕͋Δ
σʔλʹؔΘΔλεΫɺ͍ΖΜͳλεΫ͕͋Δ ΞϓϦ πʔϧ
σʔλʹؔΘΔλεΫɺ͍ΖΜͳλεΫ͕͋Δ ΞϓϦ πʔϧ ৫نͰͷ εέʔϧϝϦοτ͕ߴ͍ ϢʔβنͰͷ εέʔϧϝϦοτ͕ߴ͍
σʔλʹؔΘΔλεΫɺ͍ΖΜͳλεΫ͕͋Δ நత ۩ମత ΞϓϦ πʔϧ
σʔλʹؔΘΔλεΫɺ͍ΖΜͳλεΫ͕͋Δ நత ۩ମత ΞϓϦ πʔϧ ظˍܧଓత։ൃ͚ʢR&Dʣ ظత/ूதత։ൃ ςί͕ޮ͖͍͢; 3ഒͷੜ࢈ੑˠ
10ഒͷՌʹมΘͬͨΓ͢Δ 10ഒͷੜ࢈ੑͷҧ͍͕ͦͷ··10ഒͷՌͷࠩ
σʔλʹؔΘΔλεΫɺ͍ΖΜͳλεΫ͕͋Δ நత ۩ମత ΞϓϦ πʔϧ Ϩίϝϯυ ݕࡧ ίϯςϯπੜ ଟݴޠରԠ ࠂ
ίϯςϯπࢹ ࢹ (ҟৗݕ) ऩӹ༧ଌ ࣗಈQA ୳ࡧతσʔλੳ ϝτϦΫε։ൃ ύϑΥʔϚϯεੳ Ծઆݕূ
σʔλʹؔΘΔλεΫɺ͍ΖΜͳλεΫ͕͋Δ நత ۩ମత ΞϓϦ πʔϧ Ϩίϝϯυ ίϯςϯπੜ ଟݴޠରԠ ࠂ ίϯςϯπࢹ
ࢹ (ҟৗݕ) ऩӹ༧ଌ ࣗಈQA ୳ࡧతσʔλੳ ϝτϦΫε։ൃ ύϑΥʔϚϯεੳ Ծઆݕূ ݕࡧ
ػցֶशϓϩΤΫτͷҰྫ: ʮ༏ઌֶशʹΑΔਪનจ͔Βͷݟग़͠நग़ʯ from http://www.orsj.or.jp/archive2/or62-11/or62_11_731.pdf
ΩϟονίϐʔΛࣗಈతʹܾΊΕΔΑ͏ʹ͍ͨ͠ తʹ͍͏ͱ ͓ళͷͨΊͷΩϟονίϐʔΛ࡞Δ
͡Δͱ͖ʹԿΛߟ͑Δ͔ʁ •ΤϯδχΞϦϯάγϯΩϯά (ٕज़తʹͳΜͱ͔͢Δ) •ϓϩμΫτγϯΩϯά (Ϣʔεέε/Ձਫ४ΛఆΊΔ) •αΠΤϯεγϯΩϯά (ͷຊ࣭Λ͏)
ϓϩμΫτγϯΩϯά • ࣭ج४ͱͯ͠ʮ৴པʯΛଛͳΘͳ͍͔ʁ • ӕΛ͔ͭͳ͍͜ͱ • ޱޠతͰͳ͍ͳͲʮΒ͠͞ʯΛද͢બੑ͕͋Δ • αʔϏεͷڧΈʹͳΔΑ͏ͳͷ͕·͍͠ •
Ωϟονίϐʔͱͯ͠ͷཱͪҐஔ; ັྗతͳจͰ͋Δ͜ͱ • ͋ͨΓ͞ΘΓͷͳ͍ฏۉతͳจষΛٻΊ͍ͯΔΘ͚Ͱͳ͍ • ϓϩμΫτͷ౷߹ͷੑ͕ߴ͍͜ͱ͕·͍͠ • จࣈ੍ݶͷ (PCεϚϗ)
αΠΤϯεγϯΩϯά • ັྗతʹײ͡ΔจͱԿ͔ʁ ◦ධՁͷઃܭ ▪ આಘྗ͕͋ΔΩϟονίϐʔ → CTR্͕Γͦ͏ʁ ▪ CTR͕͕͋ΔΩϟονίϐʔ
≠ આಘྗʁ • ΩϟονίϐʔΒ͍͠ͱԿ͔ʁ ◦Ωϟονίϐʔͷྲྀெੑ ≠ จͱͯ͠ͷྲྀெੑ ◦จͱͯ͠ଟগ่Ε͍ͯͯྑ͍ʢϦζϜ͕͋Δͱྑ͍ʣ ◦ʮ͜ͷ͓ళͷεύήοςΟඒຯ͍͠Ͱ͢ʯ ◦ʮඒຯͳεύήοςΟΛఏڙʂʯ • Ωϟονίϐʔʮޱίϛʯͷཁͳͷ͔ • ͦͦNLPͱͯ͠Ͳ͜·Ͱ͕Ͱ͖Δൣғͳͷ͔ʁ ◦ ػցతʹྲྀெͳจΛੜ͢Δ͜ͱͰ͖Δ͔ʁ ▪ ػցֶशόοΫάϥϯυͱͯ͠ͷݟ ◦ Ͳ͏͍͏࣮ݧઃఆͩͬͨΒ͏·͘ਐΊΒΕΔ͔ʁ
ΤϯδχΞϦϯάʹ͜ΕΒΛ౿·͑ͯͳΜͱ͔͢Δ • ϓϩμΫτΠϯͷखؒʁ ◦ DBʹಥͬࠐΜͰͪΐͬͱίʔυΛॻ͖͑Δ͚ͩɺ͓खܰ؆୯ʂ ◦ ࢼߦࡨޡͷํʹ͕͖͍࣌ؒ͢͞ •
࠷ߴਫ਼ͷख๏͕ඞཁͳ༁Ͱͳ͍ ◦ ख๏ࣗମʹ৽نੑ͕ͳͯ͘ྑ͍ɻݟͷ৽نੑཉ͍͠ ◦ ࢼͯ͠ධՁͯ͠վળͰ͖Δͷ͕ྑ͍ ◦ ֶशʹ͕͔͔࣌ؒΔେنֶश࠷ॳΒͳ͍ • ݱঢ়͋ΔσʔληοτͷѲ ◦ Ωϟονίϐʔͷจ͔ͳΓ͋Δ (20ສจڧ) ˍ ޱίϛͨ͘͞Μ͋Δʂ ◦ ੜ͢ΔͨΊʹશʹඋ͞Εͨσʔληοτͳ͍ ˍ ظܾઓ (1.0ϱ݄) ◦
ྫ͑ … • ςϯϓϨʔτࢤ ◦ େྔͷൈ͚͕݀͋ΔςϯϓϨʔτΛ༻ҙ͠ ٖࣅతʹେྔͷจΛੜ͠ɺͦ͜ͷத͔Βྑ͍ͷΛબͿ ▪ ΩϟονίϐʔͷݴޠϞσϧͰྲྀெੑධՁͰ͖Δʂ ▪
ΩϟονίϐʔͷςϯϓϨʔτΛ͍͔ʹఏڙͰ͖Δ͔ʁ • શจੜࢤ ◦ GANGAN͍͜͏ͥʂ ◦ ͬͨ͜ͱͳ͍͠ɺָͬͯͯͦ͠͏ • ޱίϛཁࢤ ◦ ޱίϛΛཁ੍ͯ͠ݶ͞ΕͨจࣈͰจΛͭ͘Δ
Ͳ͏͔ͨ͠ʁ • ཁʢநग़ʣࢤͷΞϓϩʔνͱͯ͠ΛϞσϧԽ • ̎ͭͷจʹରͯ͠ɺࣄྫؒͷॱংؔ>= Λֶश͢Δ2ྨثΛߏங͢Δͱͯ͠ϞσϧԽ ɹɹɹ ྑ͍ΩϟονίϐʔΛઈରతͳࢦඪͰܭଌ͢Δͷ͍͕͠ ɹɹɹ
૬ରతͳؔ؆୯ʹఆٛͰ͖Δɻ ɹɹɹ ɹɹɹॱং͕ؔఆٛͰ͖Δͱιʔτ͕Ͱ͖Δʂ f(X1 , X2 ) = F(ϕ(X1 ) − ϕ(X2 ))) = { 1, if X1 ≥ X2 0, otherwise f(“͜ͷ͓ళͷຯḩो͏·͍”, “ࣗՈͷຯḩो͓;͘Ζͷຯʂ”ʣ = “͜ͷ͓ళͷຯḩो͏·͍” =< “ࣗՈͷຯḩो͓;͘Ζͷຯʂ"
Ͳ͏͔ͨ͠ʁ • Ωϟονίϐʔͷจ >= ޱίϛ͔ΒϥϯμϜʹΓग़ͨ͠จɹͰେྔͷٖࣅσʔλΛੜ ɹɹ େྔͷ܇࿅ࣄྫˍग़ྗͷ࣍ݩ2Ͱ͋ΔͨΊɺֶशͰ͖ͦ͏ͳؾ͕͢Δ ΦϯϥΠϯߋ৽͕ՄೳͳϩδεςΟοΫճؼΛྨثʹར༻͢Δ͜ͱͰ ɹɹ σʔλྔʹରͯ͠ͳֶ͘शͰ͖ΔΑ͏ʹ
(sklearn.linear_model.SGDClassifier Λར༻) ɹɹ Ұ؏ੑͷ͋Δσʔλྔ͕ेʹ֬อͰ͖Δͱ NNܥͷػցֶशɺ͍͍ͩͨͲΜͳࣸ૾ͰͰ͖Δ ৄࡉׂѪ (http://www.orsj.or.jp/archive2/or62-11/or62_11_731.pdf)
Ͳ͏͔ͨ͠ʁ ◦ ྑ͍ޱίϛ͔Βྑ͍Ωϟονίϐʔ͕͓ళʹ০ΒΕΔʂ • Ϣʔβͷޱίϛ͕͓ళΛԠԉ͢Δͱ͍͏ɺαʔϏεͷՁ؍ͱϚον ◦ ॊೈੑ͕ߴ͍: ਪϑΣʔζͷࡍͷจͷੜํ๏Λ͢Εɺ ৭ʑͳύλʔϯͰΩϟονίϐʔ͕ੜͰ͖Δ ◦
ղऍੑߴ͍: Ϟσϧ͕ͱͯ୯७ͳͨΊ ▪ ϩδεςΟοΫճؼͷಛྔͷॏΈΛੳ͢Ε ▪ ୯ޠ-unigram: ΩϟονίϐʔʹΘΕ͍͢ಛతͳ୯ޠ͕Θ͔Δ ▪ ୯ޠ-ngram: จମֶ͕शͰ͖ΔɻະޠॲཧΛߦ͏͜ͱͰςϯϓϨʔτ֫ಘͰ͖Δɻ ◦ ੜ͢ΔͷͰͳ͘ ධՁثΛ࡞͍ͬͯΔͷͰɺΦϖϨʔγϣϯʹରͯ͠ੑ͕ߴ͍ ▪ ΫϥυιʔγϯάͰ͋Εɺॳֶऀͷ܇࿅ʹ͑Δ ▪ ՌͷϑΟϧλͱͯ͠ͷԠ༻ߟ͑ΒΕΔ
݁Ռ: Ͳ͏͍͏Ωϟονίϐʔ͕Ͱ͖Δ͔ʁ ࣾͰͷਓखධՁͰ ఆྔతʹਓ͕ؒ࡞ͨ͠ΑΓ༗ҙʹྑ͍Ωϟονίϐʔ͕Ͱ͖Δ͜ͱ͕Θ͔ͬͨ શళฮͰແཧ͕ͩಛఆͷϑΟϧλΛ͔·ͤϓϩμΫτΠϯͰ͖ͨ ◦ (ਓख) ࠷ڧͷ͏ͲΜ ◦ (ػց)
ே͔Β൩·Ͱऄͷྻ͕Ͱ͖Δ໊ళ͏ͲΜ͞Μ ◦ (ػց) ೋށ࢈ͷͦΛళͰค͠ɺṢ͖ͨͯɾଧͪͨͯɾᣐͰͨͯͷʮ̏ͨͯʯͰఏڙ ◦ (ਓख) ͓ംͪΌΜͷՈʹ༡ͼʹདྷͨΑ͏ͳݹຽՈͰ͘ίγͷڧ͍͓ڶഴඒຯ
݁Ռ: Ͳ͏͍͏Ωϟονίϐʔ͕Ͱ͖Δ͔ʁ ࣾͰͷਓखධՁͰ ఆྔతʹਓ͕ؒ࡞ͨ͠ΑΓ༗ҙʹྑ͍Ωϟονίϐʔ͕Ͱ͖Δ͜ͱ͕Θ͔ͬͨ શళฮͰແཧ͕ͩಛఆͷϑΟϧλΛ͔·ͤϓϩμΫτΠϯͰ͖ͨ ◦ (ਓख) ࠷ڧͷ͏ͲΜ ◦ (ػց)
ே͔Β൩·Ͱऄͷྻ͕Ͱ͖Δ໊ళ͏ͲΜ͞Μ ◦ (ػց) ೋށ࢈ͷͦΛళͰค͠ɺṢ͖ͨͯɾଧͪͨͯɾᣐͰͨͯͷʮ̏ͨͯʯͰఏڙ ◦ (ਓख) ͓ംͪΌΜͷՈʹ༡ͼʹདྷͨΑ͏ͳݹຽՈͰ͘ίγͷڧ͍͓ڶഴඒຯ
toCྖҬͰͷσʔλ׆༻ʢػցֶशʣͷݟ ྑ͍σʔλΛγϯϓϧʹͯ͘͠ΕΔ ▪ ྑ͍ઃఆෳͷղܾΛ༩͑ͯ͘ΕΔ (Simple > Easy) ▪
ʢαʔϏεʗۀքʗλεΫʣυϝΠϯಛ༗ͷಛԽ͢Δ͜ͱͰɺΑΓΛγϯϓϧʹͰ͖Δ Ұఆਫ४ͷ୲อʹͱͯۤ࿑͢Δ ◦ ϞσϧʙγεςϜͷ͏·͍ύΠϓϥΠϯͱͯ͠ͷઃܭྗ͕ࢼ͞ΕΔ ◦ ΞΧσϛοΫͰ͋Ε ͻͱͭͣͭͰධՁɾղܾ͢ΔෳͷΛಉ࣌ʹղܾ͢Δඞཁ͕͋Δ ◦ Ωϟονίϐʔͷ߹ ྲྀெੑ / ৴པੑʢղऍੑʣ / ॊೈੑ Λಉ࣌ʹຬͨ͢ඞཁ͕͋ͬͨ ◦ ਓͷؒҧ͍ʹൺֱతڐ༰త͕ͩɺػցతͳؒҧ͍ඇڐ༰త ʢਓΈ͍ͨʹؒҧ͍͑ͨʣ A/BςετͷΑ͏ͳܗͰΠϯϋεͳධՁ͕ར༻Ͱ͖ΔΞυόϯςʔδ ◦ αʔϏεಛ༗ͷ݁Ռʹͳͬͯ͠·͏ͨΊɺଞͷαʔϏεʹ͓͍ͯͷ࠶ݱੑ୲อͰ͖ͳ͍͕…
toCྖҬͰͷσʔλ׆༻ʢػցֶशʣͷΈ ໘ന͍ྖҬͰ͋Δ
toCͱͯ͠ޮՌతʹσʔλΛར༻͢ΔͨΊʹͲ͏͖͔͢ʁ
͍͔ʹσʔλ׆༻Λߦ͏͔ʁ from https://simplystatistics.org/2019/04/17/tukey-design-thinking-and-better-questions/ ղ͚Δͷ қ ղ͖͘ͷ ࣭
͍͔ʹσʔλ׆༻Λߦ͏͔ʁ NNͷ಄ from https://simplystatistics.org/2019/04/17/tukey-design-thinking-and-better-questions/ ղ͚Δͷ қ ղ͖͘ͷ ࣭
͍͔ʹσʔλ׆༻Λߦ͏͔ʁ NNͷ಄ ਅͷGOAL from https://simplystatistics.org/2019/04/17/tukey-design-thinking-and-better-questions/ ղ͚Δͷ қ ղ͖͘ͷ ࣭
͍͔ʹσʔλ׆༻Λߦ͏͔ʁ NNͷ಄ ਅͷGOAL ཧ from https://simplystatistics.org/2019/04/17/tukey-design-thinking-and-better-questions/ ղ͚Δͷ қ ղ͖͘ͷ ࣭
͍͔ʹσʔλ׆༻Λߦ͏͔ʁ NNͷ಄ ਅͷGOAL ཧ ͜͜ʹ͍ΔͭΓʁ from https://simplystatistics.org/2019/04/17/tukey-design-thinking-and-better-questions/ ղ͚Δͷ қ ղ͖͘ͷ
࣭
͍͔ʹσʔλ׆༻Λߦ͏͔ʁ NNͷ಄ ਅͷGOAL ཧ ͜͜ʹ͍ΔͭΓʁ ࣮ࡍ͔ͬͪ͜ʁ from https://simplystatistics.org/2019/04/17/tukey-design-thinking-and-better-questions/ ղ͚Δͷ қ
ղ͖͘ͷ ࣭
͍͔ʹσʔλ׆༻Λߦ͏͔ʁ ղ͚Δͷ қ ղ͖͘ͷ ࣭ NNͷ಄ ਅͷGOAL ཧ ͜͜ʹ͍ΔͭΓʁ ࣮ࡍ͔ͬͪ͜ʁ
from https://simplystatistics.org/2019/04/17/tukey-design-thinking-and-better-questions/ ΪϟοϓΛຒΊΔ ྑ͍͍Λߟ͑Δඞཁ͕͋Δ
ྑ͍σʔλ͕͋ΕγϯϓϧʹͳΔ … ͱ͢Δͱ ྑ͍σʔλΛ͍͔ʹ࡞Δ͔Λߟ͑ΔͨΊʹ ςΫϊϩδʔΤϯδχΞϦϯά͚ͩͰͳ͘ ྑ͍σʔλ͕ಘΒΕΔαΠΫϧΛߟ͑Δͱྑͦ͞͏
Krebs Cycle of Creativity https://jods.mitpress.mit.edu/pub/AgeOfEntanglement
ཧͷʮਓೳʯϧϯό…ʁ https://twitter.com/atochotto/status/1129183985119051776? ref_src=twsrc%5Etfw%7Ctwcamp%5Etweetembed&ref_url=https%3A%2F%2Fpaperusercontent.com%2Fintegrations%2Fembed%2Fiframe%2Ftweet%3Fid%3D1129183985119051776 จԽΛม͑ΒΕΔʮਓೳʯޭ ϧϯόͷத͕ػցֶशͷ༷ͷ༗ແ ۃͲ͏Ͱ͍͍͔͠Εͳ͍
Case Study: Google༁ ਓೳܥͷαʔϏε͕ʮσβΠϯʯ͔ΒʮจԽʯʹӨڹ༩͑ΔྫΛߟͯ͠Έ͍ͨͱࢥ͏ • ͙̏Β͍લ͔ΒΊͪΌΑ͘ͳͬͨɻ • ͪΐ͏Ͳػց༁ͷύϥμΠϜ͕େ͖͘มΘΔλΠϛϯάΛ • ΞΧσϛοΫଆͰݟ͍ͯͨͷͰɺػց༁ͰͷࡐΛ͋͛ͯΈ͍ͨ
Case Study: Google༁ (ௌऺͷօ͞Μʹ࣭) • ༁ීஈ͍͍ͯ͠ΔਓɺखΛ͋͛ͯΈͯཉ͍͠ ◦ தֶߍߴߍͰͷ॓ͷࡍʹར༻͍ͯͨ͠Γ ◦ Θ͔Βͳ͍୯ޠΛࣙॻ͕ΘΓʹࡧҾ͢Δਓ
◦ Πϯλʔωοτ αʔϑΟϯͰӳޠͷχϡʔεΛ༁͢Δ༻్ ◦ શͬͯ͘ͳ͍ਓ
Case Study: Google༁ ωλཁһʁ
Case Study: Google༁ from NLP2017νϡʔτϦΞϧʮθϩ͔Β࢝ΊΔ χϡʔϥϧωοτϫʔΫػց༁ʯ (http://lotus.kuee.kyoto-u.ac.jp/~nakazawa/NLP2017-NMT-Tutorial.pdf) ৽͍͠ൃ໌ or ٕज़తʹઌߦ͍ͯͨ͠ͷ͔ʁ
Krebs Cycle of Creativity https://jods.mitpress.mit.edu/pub/AgeOfEntanglement
Google༁ϓϩμΫτͱͯ͠ͳͥޭ͍ͯ͠Δ͔ʁ Α͏ͳؾ͕͢Δ
ڊਓͨͪͲΜͳࢹΛ͍࣋ͬͯΔͷ͔
ڊਓͨͪͷಈ͖ʹணͯ͠ΈΔ
ڊਓͨͪͷಈ͖ʹணͯ͠ΈΔ - Netflix
ڊਓͨͪͷಈ͖ʹணͯ͠ΈΔ - Netflix: byDevTools
ڊਓͨͪͷಈ͖ʹணͯ͠ΈΔ - Netflix: ίϯςϯπใ
ڊਓͨͪͷಈ͖ʹணͯ͠ΈΔ - Netflix: Πϕϯτใ - ϩέʔϧใ - ࢪࡦ൪߸ใ - ϦϦʔε൪߸ใ
- Ͳͷίϯςϯπ͕ݟΒΕͨ - Ͳͷίϯςϯπ͕දࣔ͞Ε͔ͨ - ͲͷλΠϛϯάͰ ίϯςϯπ͕ಈ͍͔ͨ …
ڊਓͨͪͷಈ͖ʹணͯ͠ΈΔ - AirBnB
ڊਓͨͪͷಈ͖ʹணͯ͠ΈΔ - AirBnB https://www.slideshare.net/mounialalmas/tutorial-on-online-user-engagement-metrics-and-optimization • Ϣʔβߦಈʹؔ͢ΔମܥɺͲͷΑ͏ʹଌఆ͍͔͕ͯ͘͠·ͱ·͍ͬͯΔ
ڊਓͨͪͷಈ͖ʹணͯ͠ΈΔ - AirBnB https://www.kdd.org/kdd2018/accepted-papers/view/real-time-personalization-using-embeddings-for-search-ranking-at-airbnb ϢʔβͷৼΔ͍͔Βݕࡧ݁ՌʢίϯςϯπʣΛ࠷దԽ͢Δ: ηογϣϯϩάΛ Skip-gramϞσϧͰղऍͯ͠ɺίϯςϯπͱͯ͠ͷྨࣅΛߏங → ݕࡧ݁Ռʹө
ݸਓͱͯ͠ΑΓޮՌతͳσʔλ׆༻Λ͍ͯͨ͘͠Ίʹ
υϝΠϯΛཧղ͢Δ͜ͱɺҙຯͷ͋ΔAIΛ࡞Δ͜ͱ࿈ଓత
Data Architectͷ͓͠͝ͱ: ྲྀ௨ͱσʔλͷܦࡁݍΛ࡞Δ͜ͱ ΞφϦετ σʔλϚʔτ ϓϩμΫτ σʔλΣΞϋε
RettyͰͷ׆ಈ͜ΕΒΛͭͳ͛Δ׆ಈ
ϢʔβΛཧղ͢Δ͜ͱɺʹཱͭAIΛ࡞Δ͜ͱ࿈ଓత σʔλͷؔΘΓํͬͱ৭ʑ͍͍͋ͬͯ
·ͱΊ σʔλҰੜ͖ͳ͍
͓ΘΓ