Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Pathologies of Neural Models Make Interpretations Difficult
Search
Yasufumi Taniguchi
December 09, 2018
Research
1
1.7k
Pathologies of Neural Models Make Interpretations Difficult
Yasufumi Taniguchi
December 09, 2018
Tweet
Share
More Decks by Yasufumi Taniguchi
See All by Yasufumi Taniguchi
AllenNLPを使った開発
yasufumy
0
2.1k
Making Neural QA as Simple as Possible but not Simpler
yasufumy
0
84
Other Decks in Research
See All in Research
一般化ランダムフォレストの理論と統計的因果推論への応用
tomoshige_n
10
1.8k
Sosiaalisen median katsaus 02/2024
hponka
0
2.6k
Ground Metric Learning with applications in genomics
gpeyre
0
360
株式会社リクルートホールディングス 企業分析
frandle256
0
130
インタビューだけじゃない!ユーザーに共感しユーザーの目👀を手に入れるためのインプット
moco1013
0
210
Discovering Universal Geometry in Embeddings with ICA
momoseoyama
1
340
200名の育児中男性の声 「僕たちは、キャリアとライフをトレードオフにしたくない」共働き3.0世代の男性が 本当に求める働き方とは【ワーキングペアレンツの転職意識調査2023|XTalent株式会社】
xtalent
0
470
Experiments on ROP Attack with Various Instruction Set Architectures
yumulab
0
320
第14回対話システムシンポジウム EMNLP 2023 参加報告
atsumoto
0
150
床面圧力センサ開発における感圧導電シート分離方式の検討 / WISS2023
yumulab
0
270
10-ot-generic-bio.pdf
gpeyre
0
130
Alternative Photographic Processes Reimagined: The Role of Digital Technology in Revitalizing Classic Printing Techniques【SIGGRAPH Asia 2023】
toremolo72
0
430
Featured
See All Featured
Thoughts on Productivity
jonyablonski
58
3.8k
Typedesign – Prime Four
hannesfritz
36
2.1k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
19
1.7k
Gamification - CAS2011
davidbonilla
76
4.6k
10 Git Anti Patterns You Should be Aware of
lemiorhan
648
58k
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
187
16k
Rails Girls Zürich Keynote
gr2m
91
13k
Making the Leap to Tech Lead
cromwellryan
124
8.5k
Building Better People: How to give real-time feedback that sticks.
wjessup
355
18k
Embracing the Ebb and Flow
colly
80
4.1k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
6
1.5k
The Invisible Side of Design
smashingmag
294
49k
Transcript
ൃදऀ ୩ޱହ࢙ ҟৗͳڍಈ
!2 Pathological behavior ࣭จ͕did͚ͩͰ Ϟσϧͷग़ྗಉ͡ ֬ߴ͍
֓ཁ w NLPʹ͓͚ΔχϡʔϥϧϞσϧͷղੳख๏ΛఏҊ w Ϟσϧ͕λεΫΛղ্͘Ͱॏཁͳ୯ޠΛநग़͢Δख๏ w நग़͞Εͨ୯ޠਓʹͱͬͯҙຯෆ໌ w ҰํͰϞσϧநग़୯ޠͰਖ਼͘͠༧ଌ(Pathology) w
ղੳ݁Ռʹجͮ͘ਖ਼ଇԽ߲ΛఏҊ w ਖ਼ଇԽ߲ʹΑͬͯϞσϧͷղऍੑ্ !3
࣍ Ϟσϧղੳͷطଘख๏ ఏҊख๏ ࣮ݧ ·ͱΊ !4
Ϟσϧղੳͷطଘख๏
Ϟσϧղੳͷطଘख๏ !6 Adversarial Example Ϟσϧʹਓͷײʹ͢ΔڍಈΛͤ͞Δαϯϓϧ NLPͷλεΫ ओʹQAλεΫ Ͱύλʔϯ ਓʹͱͬͯҙຯͷͳ͍มߋ͕ɺϞσϧͷग़ྗΛܹมͤ͞Δέʔε
ਓʹͱͬͯ໌Β͔ͳมߋͰɺϞσϧ͕ग़ྗΛม͑ͳ͍έʔε
ग़ྗ͕ܹม͢Δέʔε !7 Jia et al., 2017 ΫΥʔλʔόοΫͷྸʹ͍ͭͯͷ จॻʹΫΥʔλʔόοΫͷഎ൪߸ʹ ؔ͢ΔจΛՃ Ϟσϧޡ
ग़ྗΛม͑ͳ͍έʔε !8 Mudrakarta et al., 2018 ݐͷന͍ϨϯΨ͕ରশ͔ʁ spherical (ٿঢ়ͷ) ݐͷന͍ϨϯΨ͕ٿঢ়͔ʁ
࣭จͷҙຯมԽ Ϟσϧͷ༧ଌෆม
2. ఏҊख๏
*OQVU3FEVDUJPO • ॏཁͰͳ͍୯ޠΛೖྗ͔ΒΓɺϞσϧͷڍಈΛੳ • Ϟσϧ͕ਖ਼͍͠ग़ྗΛ͢ΔͨΊʹඞཁͳ࠷୯ޠ (ॏཁ ୯ޠ) • Adversarial ExampleϞσϧʹͱͬͯͷॏཁ୯ޠʹண
*OQVU3FEVDUJPO !11 x y Ϟσϧͷ༧ଌ f( ⋅ ) χϡʔϥϧϞσϧ ೖྗܥྻ
(จจॻ) xi ೖྗܥྻͷ͋Δཁૉ (୯ޠ) g(xi |x) = f(y|x) − f(y|x−i ) ͋Δ୯ޠ ʹର͢Δ ॏཁΛఆٛ xi g i൪ͷ୯ޠΛফͨ͠ೖྗ
*OQVU3FEVDUJPO !12 g(xcontest |x) = f(y|x) − f(y|x−contest ) What
company won free advertisement due to QuickBooks contest ? What company won free advertisement due to QuickBooks contest ? g͕େ͖͚Εɺcontest͕ॏཁͳ୯ޠͱͳΔ Ϟσϧͷग़ྗʹେ͖͘د༩͍ͯ͠ΔͨΊ
*OQVU3FEVDUJPO !13 g(xi |x) = f(y|x) − f(y|x−i ) ॏཁͷ͍୯ޠΛআ
y͕มԽ͠ͳ͍Α͏ʹɺg͕࠷খͱͳΔ୯ޠiΛআ ͍ͯ͘͠
3. ࣮ݧ
ղੳͷରλεΫ 1. SQuAD w จॻͱ࣭จ͕༩͑ΒΕΔˠ࣭จʹରͯ͠Input Reduction w จॻ͔ΒղΛநग़͢ΔλεΫ 2. SNLI
w จ͕༩͑ΒΕΔˠͭͷจʹରͯ͠Input Reduction w จͷؔΛਪఆ͢ΔλεΫ 3. VQA w ը૾ͱ࣭จ͕༩͑ΒΕΔˠ࣭จʹରͯ͠Input Reduction w ղΛੜ͢ΔλεΫ !15
࣮ݧ༰ Input Reduction w Ϟσϧ͕ਖ਼͍͠ग़ྗΛ͢ΔαϯϓϧΛରʹ࣮ݧ w Input ReductionΛద༻ͨ͠ೖྗ(Reduced)ʹର͢ΔਓखධՁ w ReducedͱϥϯμϜʹ୯ޠΛམͱͨ͠߹(Random)ͷࠩҟͷධՁ
Regularization on Reduced Inputs w Input ReductionʹΑΔϞσϧͷPathological behaviorΛܰݮ͢Δਖ਼ଇԽ߲ ޙड़ ͷಋೖ !16
Reducedʹର͢ΔਓखධՁ !17 Reducedʹରͯ͠ ਓਖ਼͍͠༧ଌΛͰ ͖ͳ͍ w Reducedʹର͢Δਓͷਖ਼ w Ϟσϧͷਖ਼͕ͷαϯϓϧΛ༻
Reducedʹର͢ΔਓखධՁ !18 w ReducedͱRandomͷͲͪΒ͕ࣗવͳจ͔ w vs. Randomfifty-fiftyͱׂ͑ͨ߹ Reducedਓʹͱͬ ͯRandomͱಉ͡
Reducedͷࣄྫ !19 ʮͲ͜Ͱ࿅शͨ͠ ͔ʯΛฉ͔Ε͍ͯ ΔͷΘ͔Δ͕ɺ ʮͲͷνʔϜʯ͔ Θ͔Βͳ͍
Reducedͷฏۉ୯ޠ ͭͷλεΫͱɺ ਖ਼͢Δͷʹඞཁͳ୯ޠฏۉd
Reducedʹର͢ΔϞσϧͷ֬ !21 • Input Reductionͷద༻લޙͰϞσϧͷ ֬ʹมԽ΄ͱΜͲͳ͍ • ϞσϧӶ͍ϐʔΫΛ࣋ͭΑ͏ͳ Λֶश͍ͯ͠Δ͜ͱ͕ݪҼ
ਖ਼ଇԽ߲ͷಋೖ !22 ∑ (x,y)∈(X,Y) log(f(y|x)) + λ∑ ¯ x∈ ¯
X H(f(y| ¯ x)) Reducedʹରͯ͠ਖ਼͍͠yΛ ग़ྗ͠ʹ͘͘͢Δ ௨ৗͷతؔ Reducedαϯϓϧ௨ৗͷతؔΛֶͬͯशͨ͠ ϞσϧΛ༻͍ͯੜ
ਖ਼ଇԽ߲ͷޮՌ !23 • Ϟσϧͷਫ਼͕ඍ૿ • ਖ਼ʹඞཁͳ୯ޠ ͕૿Ճ
ਖ਼ଇԽ߲ͷޮՌ !24 ਓखධՁͷਫ਼্ Input Reductionͨ͠ೖྗ ͷղऍੑ্͕
ਖ਼ଇԽͨ͠Ϟσϧͷࣄྫ !25 Input Reductionͨ͠ೖྗ͕ਓͰ ղऍՄೳʹͳͬͨ
·ͱΊ ఏҊख๏ w NLPͷχϡʔϥϧϞσϧղੳख๏ͱͯ͠ɺInput ReductionΛఏҊ w ༧ଌʹد༩͠ͳ͍୯ޠΛೖྗ͔ΒΓɺϞσϧΛղੳ ࣮ݧ݁Ռ w ఏҊख๏Λద༻ͨ͠ೖྗਓʹͱͬͯҙຯෆ໌
w ҰํͰχϡʔϥϧϞσϧਖ਼͍͠༧ଌΛߦ͏ w ਖ਼ଇԽ߲Λಋೖ͢ΔͱϞσϧͷڍಈվળ !26