Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
NLP2021 WS2 AI王 〜クイズAI日本一決定戦〜 報告スライド
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
junya-takayama
March 19, 2021
Research
1.1k
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
NLP2021 WS2 AI王 〜クイズAI日本一決定戦〜 報告スライド
言語処理学会第27回年次大会ワークショップ2「AI王 〜クイズAI日本一決定戦〜」
での報告資料です
junya-takayama
March 19, 2021
More Decks by junya-takayama
See All by junya-takayama
[SNLP2021] Prefix-Tuning: Optimizing Continuous Prompts for Generation
tkym1220
0
660
Other Decks in Research
See All in Research
AIを叩き台として、 「検証」から「共創」へと進化するリサーチ
mela_dayo
0
290
Apache Gravitinoで実現する Icebergカタログ統合とアクセスの一元化
matsumooon
0
290
RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent
satai
2
310
2026-01-30-MandSL-textbook-jp-cos-lod
yegusa
1
1.4k
通時的な類似度行列に基づく単語の意味変化の分析
rudorudo11
0
320
定数整数除算・剰余算最適化再考
herumi
1
130
Spatial Active Noise Control Based onSound Field Interpolation Incorporating Physical Constraints
skoyamalab
0
100
正規分布と最適化について
koide3
1
270
機械学習で作った ポケモン対戦bot で 遊ぼう!
fufufukakaka
0
310
CVPR2026論文紹介_VLMにとって良いvision encoderとは何か?Rethinking Model Selection in VLM Through the Lens of Gromov-Wasserstein Distance
kobayashi31
1
140
Ghost in the 7‑Zip: The Shadow of Residential Proxies Creeping into Your Life
nttcom
0
1.2k
計算情報学研究室(数理情報学第7研究室)2026
tomohirokoana
0
570
Featured
See All Featured
How to train your dragon (web standard)
notwaldorf
97
6.7k
Tips & Tricks on How to Get Your First Job In Tech
honzajavorek
1
540
Exploring the relationship between traditional SERPs and Gen AI search
raygrieselhuber
PRO
2
4k
Claude Code どこまでも/ Claude Code Everywhere
nwiizo
65
56k
What the history of the web can teach us about the future of AI
inesmontani
PRO
1
620
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
55k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
141
35k
Odyssey Design
rkendrick25
PRO
2
700
Darren the Foodie - Storyboard
khoart
PRO
3
3.4k
Between Models and Reality
mayunak
4
350
Gemini Prompt Engineering: Practical Techniques for Tangible AI Outcomes
mfonobong
2
440
Site-Speed That Sticks
csswizardry
13
1.2k
Transcript
NLP2021 WS2 AIԦ ʙΫΠζAIຊҰܾఆઓʙ େൃදձ γεςϜใࠂ ͓ؾ࣋ͪղઆ 2021/03/19 େࡕେֶେֶӃใՊֶݚڀՊ ߴࢁ
൏
ࣗݾհ Ø໊લ ߴࢁ ൏ Øॴଐ େࡕେֶَ௩ݚڀࣨ % Ø5XJUUFS!ULZN Ø63-IUUQTKVOZBUBLBZBNBHJUIVCJP
ØීஈͷݚڀτϐοΫ ରγεςϜɾࣗવݴޠੜ ØࢀՃͷ͖͔͚ͬ • ࠷ۙΫΠζʹϋϚ͍ͬͯΔ͔Β • ίϯϖͱ͍͏ͷʹग़ͯΈ͔͔ͨͬͨΒ ઈࢍब׆தͰ͢ʂʂ 1
େํ • ϦʔμʔϘʔυΛҙਂ͘؍ͨ݁͠ՌͳΜ͔օͦ͏ͯͨ͠ͷͰ #&35ͱγϯϓϧͳ *3ख๏ͷΞϯαϯϒϧΛ࠾༻ ʢ·͋ײతʹදϕʔεͰdׂղ͚ͦ͏ͳײ͕͢͡Δʣ • ܭࢉࢿݯతʹ #&35ຊདྷͷઃఆతʹೖྗΛ/τʔΫϯʹ͑Δඞཁ͋Γ •
ઌ಄/τʔΫϯͱ͔Ͱͬͯɼղʹඞཁͳ͕ࣝͪΌΜͱೖΔͷ͔ʁ ʢആ༏ͷهࣄͱ͔ɼग़ԋ࡞ΘΓͱޙΖͷํʹॻ͍ͯ͋ΔΑͶʣ • ඞཁͳؚ͕ࣝ·ΕΔΑ͏ʹͪΐͬͱݡ͍ΓํΛ͍ͨ͠ 2
ઌ಄/τʔΫϯͰ͑ΒΕͳͦ͞͏ͳྫ ଉࢠʹആ༏ͷେɺ່ʹঁ༏ͷҍΛ࣋ͭɺʰϥεταϜϥΠʱ ͳͲͷөըͰ͓ͳ͡Έͷຊͷആ༏ͱ͍͑୭Ͱ͠ΐ͏ʁ ਖ਼ղهࣄɿลݠ ˠ ΫΤϦʹԠͯ͡͏·͘هࣄຊจΛཁ͍ͨ͠ʜʜ 3 ʮଉࢠʹആ༏ͷେɺ່ʹঁ༏ͷҍʯʹؔ͢Δॳग़ ʮϥεταϜϥΠʯॳग़ ઌ಄τʔΫϯʢ͍͍ͩͨʣ
ˣ·ͩ·ͩଓ͘
#&35ϕʔεछͱ *3ϕʔεछͷΞϯαϯϒϧʢॏΈ͖ͭฏۉʣ ೖྗσʔλʢڞ௨ʣ γεςϜશମ૾ 4 BERT for ཁ BERT for
લ IR (TF-IDF) *3 $IBSOHSBN จ ީิهࣄू߹ ཁث ॏ Έ ͭ ͖ ฏ ۉ ༧ଌهࣄ BSHNBY
ཁث ϞνϕʔγϣϯจதͷϑϨʔζΛଟؚ͘ΉΑ͏ʹهࣄΛཁ͍ͨ͠ ˠީิهࣄ ! ͷຊจத͔Βɼจ " தͷ୯ޠΛଟ͘ඃ෴͢ΔΑ͏ʹ จΛෳநग़͠ɼ૯୯ޠ # ҎԼͷཁจॻ
̃ ! Λ࡞͢Δ తؔɿ% = '( ∩ ' ̃ * '( ʢͨͩ͠ '( จதͷ୯ޠू߹ɼ' ̃ * ཁจॻதͷ୯ޠू߹ʣ % ྼϞδϡϥੑΛ࣋ͭͨΊɼ্࣮ % ͕࠷େ͖͘ͳΔจΛஞ࣍తʹ ̃ ! ʹՃ͍͑ͯ͘ΞϓϩʔνΛͱΔʢᩦཉ๏ʣ 5
ཁثͷग़ྗྫ จ ଉࢠʹആ༏ͷେɺ່ʹঁ༏ͷҍΛ࣋ͭɺʰϥεταϜϥΠʱͳͲ ͷөըͰ͓ͳ͡Έͷຊͷആ༏ͱ͍͑୭Ͱ͠ΐ͏ʁ ਖ਼ղهࣄʢลݠʣݪจลݠʢΘͨͳ͚Μɺ݄ʣɺຊͷആ༏ɻຊ໊ಉ͡ɻ৽ ׁݝڕপ܊ਆଜʢݱɿڕপࢢʣग़ɻԋܶूஂԁΛܦ͔ͯΒέΠμογϡॴଐɻੈք֤ࠃʹ͓͍ͯөըΛத ৺ʹςϨϏυϥϚɺɺςϨϏίϚʔγϟϧͱ෯͘׆༂͍ͯ͠ΔຊΛද͢Δആ༏ͷҰਓɻDNɺମॏ LHɻͷล྄ҰըՈͱͯ͠׆ಈ͍ͯ͠Δɻ৽ׁݝڕপ܊ਆଜʹͯڞʹڭࢣΛ͍ͯͨ྆͠ͷݩʹੜ·ΕΔɻ ྆ͷసۈͰ༮গظΛೖଜɺकଜʢͱʹڕপࢢʣɺߴాࢢʢ্ӽࢢʣͰա͢͝ɻʜʜʢதུʣʜʜҰ༂શࠃతͳ ਓؾΛ֫ಘɺελʔμϜʹͷ্͕͠Δɻ·ͨɺͦͷࠒ͔ΒՎखͱͯ͠ࠒ·Ͱ׆ಈ͍ͯͨ͠ɻҎ߱ɺɾςϨ
ϏυϥϚͳͲͰ࣍ʑͱେΛԋ͡ɺલ్༸ʑʹݟ͑ͨɺөըॳओԋͱͳΔͣͰ͋ͬͨʰఱͱʢ୯ޠʣ ਖ਼ղهࣄʢลݠʣཁ ลݠʢΘͨͳ͚Μɺ݄ʣɺຊͷആ༏ɻຊ໊ಉ͡ɻ৽ ׁݝڕপ܊ਆଜʢݱɿڕপࢢʣग़ɻຊࠃ֎өըॳग़ԋͱͳͬͨΞϝϦΧөըʰϥεταϜϥΠʱ ʢެ։ʣͰɺลಉͷୈճΞΧσϛʔॿԋஉ༏ͳΒͼʹୈճΰʔϧσϯάϩʔϒॿԋஉ༏ɺ ୈճαλʔϯॿԋஉ༏ʹϊϛωʔτ͞ΕΔߴ͍ධՁΛಘΔɻ·ͨɺөըެ։ͱಉ࣌͡ظʹൃදͨࣗ͠Βͷஶॻ ʰ୭ 8)0".* ʱͰɺ͔ͭͯന݂පͷ࣏ྍதසൟʹड͚ͨ༌݂ʢओʹ݂খ൘༌݂ʣ͕ݪҼͰ$ܕ؊ԌΠϧεʹײછ͠ɺ ʰ໌ͷهԱʱͷࡱӨͦͷ࣏ྍͷ෭࡞༻ʹ·͞Εͳ͕Βߦ͍ͯͨ͜͠ͱΛࠂനɻ࣌Λಉͯ͘͡͠ςϨϏ౦ژͷαε ϖϯευϥϚͷڞԋΛػʹΓ߹ͬͨঁ༏ͷೆՌาͱຊ֨తʹަࡍΛ։࢝͠ɺಉ݄ʹ࠶ࠗɻͳ͓ɺଉࢠͷେ ͱؒతʹͰ͋Δ͕ڞԋྺ͋Δ͕ɺ່ͷҍͱऀۀҎ֎Ͱڞԋͨ͜͠ͱͳ͍ɻ 6
#&35ϕʔεྨث ͋Δબࢶ͕ਖ਼ղ͔Ͳ͏͔ఆ͢Δࡍʹଞͷબࢶߟྀ͍ͤͨ͞ ˠ #&35 4FMG"UUFOUJPO-BZFSͷ֊ܕΞʔΩςΫνϟΛ࠾༻ 7 ࠷ऴ <$-4> ࠷ऴ .BY1PPMJOH
<$-4>จ<4&1>هࣄ<4&1> <$-4>จ<4&1>هࣄ<4&1> ʜ ʜ BERT BERT BERT Self Attention Layer Softmax Linear Linear Linear
*3Ϟσϧ <5'*%'ϕʔε> • จͷ 5'*%'ϕΫτϧͱީิهࣄͷ 5'*%'ϕΫτϧͷ DPTྨࣅ͕ߴ͍هࣄΛਖ਼ղީิͱ͢ΔγϯϓϧͳϞσϧ • ͨͩ͠ *%'ʢίʔύεશମͰͳ͘ʣ֤͝ͱʹ
ͦͷͷީิهࣄશମʢ݅ʣ͔Βܭࢉ <ཧ༝>ީิهࣄू߹ͦͦʢ8JLJQFEJB7FDతʹʣྨࣅ͓ͯ͠Γɼ ίʔύεશମ͔Βܭࢉͨ͠ *%'Λ༻͍Δͱ 5'*%'ϕΫτϧ͕௵Εͦ͏ ʢͳؾ͕͢Δʂʂʣʢະݕূʣ <$IBSBDUFS/HSBNϕʔε> • จͷ /HSBNू߹ͱީิهࣄͷ /HSBNू߹ͷ 4JNQTPO • ୯ޠΑΓจࣈ /HSBNͷํ͕ "DDVSBDZ͕͘Β͍ߴ͔ͬͨ 8
ͦͷଞࡉʑͱͨ͠ʢCVUΫϦςΟΧϧͳʣલॲཧ • <*3 $IBS >ίʔύεதͷස্Ґޠͷ͏ͪʮετοϓϫʔυͳʔʯͱ ࢥͬͨͷΛετοϓϫʔυϦετʹՃɽείΞܭࢉ࣌ʹআ֎ • <*3 ྆ํ >ΤϯςΟςΟ໊͕จதʹؚ·Ε͍ͯͨΒਖ਼ղީิ͔Βআ
ʢʮIPHF GVHB ͱ͋ͱԿͰ͠ΐ͏ʁʯͰ IPHF GVHB ͕બΕ͕ͪ ͩͬͨͨΊʣ • <ཁث>ετοϓϫʔυతؔ ! ͷܭࢉ࣌ʹߟྀ͠ͳ͍ • <ཁث>ɻͰจׂ͢Δ͕ɼ͗͢Δ߹ʢʣ૭෯Λ ୯ޠ ͱͯ͠ɼ૭Λٖࣅతͳจͱ͢Δ • <ཁث>લจ࠷ॳ͔ΒཁจʹՃ 9
%FWͰͷ࣮ݧ݁Ռ <ओཁͳ࣮ݧઃఆ> • #&35Ϟσϧͷ࠷େτʔΫϯɿʢϞσϧڞ௨ʣ • #&35ࣄલֶशࡁΈϞσϧɿcl-tohoku/bert-base-japanese-whole-word-masking • *3 $IBSBDUFS/HSBN ͷ
A/A • ܇࿅σʔλɿ5SBJOͷΈɽΞϯαϯϒϧͷॏΈ %FWͰௐ <࣮ݧ݁Ռ> 10 Ϟσϧ "DDVSBDZ<> %FW %FW *3 5'*%' 64.72 61.79 *3 $IBSCJHSBN 72.66 69.71 *3 $IBSUSJHSBN 74.77 73.82 #&35 લτʔΫϯ 84.62 83.55 #&35 ཁ 88.94 89.67 Ξϯαϯϒϧ 92.05 91.14
ϦʔμʔϘʔυ "DDॱҐҐλΠʢ࣌ʣ Ґ ʢ࣌ʣ ʢʮ·͋ҐҎʹΔΖʯͱ͔ࢥͬͯͨͷʹʜʜʣ 11
ॴײ <ল> • ʮͰ͕͢ʯͷʮͰ͕͢ʯલ෦ͱ͔ฒྻͷྻڍ෦ͱ͔ɼ هࣄݕࡧʹ͍Βͳͦ͏ͳ෦Λؤுͬͯআڈͯ͠ΈΔ͖͔ͩͬͨ • จͱީิهࣄͷؒʹ͏ ϗοϓ͘Β͍ඞཁͦ͏ͳ͕݁ߏ͕͋ͬͨɼ ݟͯݟ͵ৼΓΛͯ͠͠·ͬͨ <ײ>
• ࠷ۙ #&35 #"35ʹͱʹ͔͘ͳΜͰಥͬࠐΉ͜ͱ͕ଟ͔ͬͨͷͰɼ ٱʑʹࣗવݴޠॲཧಓͰటष͍લॲཧΛΕָ͔ͯͬͨ͠Ͱ͢ • ίϯϖָ͍͠Ͱ͢Ͷɽ,BHHMFͱ͔ͬͯΈΑ͏ͱࢥ͍·ͨ͠ ओ࠵ऀͷօ༷ɼָ͍͠ίϯϖΛاըͯͩ͘͠͞Γ͋Γ͕ͱ͏͍͟͝·ͨ͠ʂʂʂ ઈࢍब׆தͰ͢ʂʂ 12