Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Pathologies of Neural Models Make Interpretatio...
Search
Yasufumi Taniguchi
December 09, 2018
Research
1
1.8k
Pathologies of Neural Models Make Interpretations Difficult
Yasufumi Taniguchi
December 09, 2018
Tweet
Share
More Decks by Yasufumi Taniguchi
See All by Yasufumi Taniguchi
AllenNLPを使った開発
yasufumy
0
2.2k
Making Neural QA as Simple as Possible but not Simpler
yasufumy
0
97
Other Decks in Research
See All in Research
When Learned Data Structures Meet Computer Vision
matsui_528
1
170
地域丸ごとデイサービス「Go トレ」の紹介
smartfukushilab1
0
310
能動適応的実験計画
masakat0
2
910
「どう育てるか」より「どう働きたいか」〜スクラムマスターの最初の一歩〜
hirakawa51
0
990
論文紹介:Not All Tokens Are What You Need for Pretraining
kosuken
0
200
Remote sensing × Multi-modal meta survey
satai
4
530
CoRL2025速報
rpc
1
2.7k
論文紹介:Safety Alignment Should be Made More Than Just a Few Tokens Deep
kazutoshishinoda
0
110
Sat2City:3D City Generation from A Single Satellite Image with Cascaded Latent Diffusion
satai
3
200
EOGS: Gaussian Splatting for Efficient Satellite Image Photogrammetry
satai
4
750
Combinatorial Search with Generators
kei18
0
1.1k
EcoWikiRS: Learning Ecological Representation of Satellite Images from Weak Supervision with Species Observation and Wikipedia
satai
3
310
Featured
See All Featured
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
48
9.7k
RailsConf 2023
tenderlove
30
1.3k
Code Reviewing Like a Champion
maltzj
526
40k
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
231
22k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
132
19k
It's Worth the Effort
3n
187
28k
Making the Leap to Tech Lead
cromwellryan
135
9.6k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
31
2.6k
Bash Introduction
62gerente
615
210k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
10
900
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
12
1.3k
BBQ
matthewcrist
89
9.9k
Transcript
ൃදऀ ୩ޱହ࢙ ҟৗͳڍಈ
!2 Pathological behavior ࣭จ͕did͚ͩͰ Ϟσϧͷग़ྗಉ͡ ֬ߴ͍
֓ཁ w NLPʹ͓͚ΔχϡʔϥϧϞσϧͷղੳख๏ΛఏҊ w Ϟσϧ͕λεΫΛղ্͘Ͱॏཁͳ୯ޠΛநग़͢Δख๏ w நग़͞Εͨ୯ޠਓʹͱͬͯҙຯෆ໌ w ҰํͰϞσϧநग़୯ޠͰਖ਼͘͠༧ଌ(Pathology) w
ղੳ݁Ռʹجͮ͘ਖ਼ଇԽ߲ΛఏҊ w ਖ਼ଇԽ߲ʹΑͬͯϞσϧͷղऍੑ্ !3
࣍ Ϟσϧղੳͷطଘख๏ ఏҊख๏ ࣮ݧ ·ͱΊ !4
Ϟσϧղੳͷطଘख๏
Ϟσϧղੳͷطଘख๏ !6 Adversarial Example Ϟσϧʹਓͷײʹ͢ΔڍಈΛͤ͞Δαϯϓϧ NLPͷλεΫ ओʹQAλεΫ Ͱύλʔϯ ਓʹͱͬͯҙຯͷͳ͍มߋ͕ɺϞσϧͷग़ྗΛܹมͤ͞Δέʔε
ਓʹͱͬͯ໌Β͔ͳมߋͰɺϞσϧ͕ग़ྗΛม͑ͳ͍έʔε
ग़ྗ͕ܹม͢Δέʔε !7 Jia et al., 2017 ΫΥʔλʔόοΫͷྸʹ͍ͭͯͷ จॻʹΫΥʔλʔόοΫͷഎ൪߸ʹ ؔ͢ΔจΛՃ Ϟσϧޡ
ग़ྗΛม͑ͳ͍έʔε !8 Mudrakarta et al., 2018 ݐͷന͍ϨϯΨ͕ରশ͔ʁ spherical (ٿঢ়ͷ) ݐͷന͍ϨϯΨ͕ٿঢ়͔ʁ
࣭จͷҙຯมԽ Ϟσϧͷ༧ଌෆม
2. ఏҊख๏
*OQVU3FEVDUJPO • ॏཁͰͳ͍୯ޠΛೖྗ͔ΒΓɺϞσϧͷڍಈΛੳ • Ϟσϧ͕ਖ਼͍͠ग़ྗΛ͢ΔͨΊʹඞཁͳ࠷୯ޠ (ॏཁ ୯ޠ) • Adversarial ExampleϞσϧʹͱͬͯͷॏཁ୯ޠʹண
*OQVU3FEVDUJPO !11 x y Ϟσϧͷ༧ଌ f( ⋅ ) χϡʔϥϧϞσϧ ೖྗܥྻ
(จจॻ) xi ೖྗܥྻͷ͋Δཁૉ (୯ޠ) g(xi |x) = f(y|x) − f(y|x−i ) ͋Δ୯ޠ ʹର͢Δ ॏཁΛఆٛ xi g i൪ͷ୯ޠΛফͨ͠ೖྗ
*OQVU3FEVDUJPO !12 g(xcontest |x) = f(y|x) − f(y|x−contest ) What
company won free advertisement due to QuickBooks contest ? What company won free advertisement due to QuickBooks contest ? g͕େ͖͚Εɺcontest͕ॏཁͳ୯ޠͱͳΔ Ϟσϧͷग़ྗʹେ͖͘د༩͍ͯ͠ΔͨΊ
*OQVU3FEVDUJPO !13 g(xi |x) = f(y|x) − f(y|x−i ) ॏཁͷ͍୯ޠΛআ
y͕มԽ͠ͳ͍Α͏ʹɺg͕࠷খͱͳΔ୯ޠiΛআ ͍ͯ͘͠
3. ࣮ݧ
ղੳͷରλεΫ 1. SQuAD w จॻͱ࣭จ͕༩͑ΒΕΔˠ࣭จʹରͯ͠Input Reduction w จॻ͔ΒղΛநग़͢ΔλεΫ 2. SNLI
w จ͕༩͑ΒΕΔˠͭͷจʹରͯ͠Input Reduction w จͷؔΛਪఆ͢ΔλεΫ 3. VQA w ը૾ͱ࣭จ͕༩͑ΒΕΔˠ࣭จʹରͯ͠Input Reduction w ղΛੜ͢ΔλεΫ !15
࣮ݧ༰ Input Reduction w Ϟσϧ͕ਖ਼͍͠ग़ྗΛ͢ΔαϯϓϧΛରʹ࣮ݧ w Input ReductionΛద༻ͨ͠ೖྗ(Reduced)ʹର͢ΔਓखධՁ w ReducedͱϥϯμϜʹ୯ޠΛམͱͨ͠߹(Random)ͷࠩҟͷධՁ
Regularization on Reduced Inputs w Input ReductionʹΑΔϞσϧͷPathological behaviorΛܰݮ͢Δਖ਼ଇԽ߲ ޙड़ ͷಋೖ !16
Reducedʹର͢ΔਓखධՁ !17 Reducedʹରͯ͠ ਓਖ਼͍͠༧ଌΛͰ ͖ͳ͍ w Reducedʹର͢Δਓͷਖ਼ w Ϟσϧͷਖ਼͕ͷαϯϓϧΛ༻
Reducedʹର͢ΔਓखධՁ !18 w ReducedͱRandomͷͲͪΒ͕ࣗવͳจ͔ w vs. Randomfifty-fiftyͱׂ͑ͨ߹ Reducedਓʹͱͬ ͯRandomͱಉ͡
Reducedͷࣄྫ !19 ʮͲ͜Ͱ࿅शͨ͠ ͔ʯΛฉ͔Ε͍ͯ ΔͷΘ͔Δ͕ɺ ʮͲͷνʔϜʯ͔ Θ͔Βͳ͍
Reducedͷฏۉ୯ޠ ͭͷλεΫͱɺ ਖ਼͢Δͷʹඞཁͳ୯ޠฏۉd
Reducedʹର͢ΔϞσϧͷ֬ !21 • Input Reductionͷద༻લޙͰϞσϧͷ ֬ʹมԽ΄ͱΜͲͳ͍ • ϞσϧӶ͍ϐʔΫΛ࣋ͭΑ͏ͳ Λֶश͍ͯ͠Δ͜ͱ͕ݪҼ
ਖ਼ଇԽ߲ͷಋೖ !22 ∑ (x,y)∈(X,Y) log(f(y|x)) + λ∑ ¯ x∈ ¯
X H(f(y| ¯ x)) Reducedʹରͯ͠ਖ਼͍͠yΛ ग़ྗ͠ʹ͘͘͢Δ ௨ৗͷతؔ Reducedαϯϓϧ௨ৗͷతؔΛֶͬͯशͨ͠ ϞσϧΛ༻͍ͯੜ
ਖ਼ଇԽ߲ͷޮՌ !23 • Ϟσϧͷਫ਼͕ඍ૿ • ਖ਼ʹඞཁͳ୯ޠ ͕૿Ճ
ਖ਼ଇԽ߲ͷޮՌ !24 ਓखධՁͷਫ਼্ Input Reductionͨ͠ೖྗ ͷղऍੑ্͕
ਖ਼ଇԽͨ͠Ϟσϧͷࣄྫ !25 Input Reductionͨ͠ೖྗ͕ਓͰ ղऍՄೳʹͳͬͨ
·ͱΊ ఏҊख๏ w NLPͷχϡʔϥϧϞσϧղੳख๏ͱͯ͠ɺInput ReductionΛఏҊ w ༧ଌʹد༩͠ͳ͍୯ޠΛೖྗ͔ΒΓɺϞσϧΛղੳ ࣮ݧ݁Ռ w ఏҊख๏Λద༻ͨ͠ೖྗਓʹͱͬͯҙຯෆ໌
w ҰํͰχϡʔϥϧϞσϧਖ਼͍͠༧ଌΛߦ͏ w ਖ਼ଇԽ߲Λಋೖ͢ΔͱϞσϧͷڍಈվળ !26