Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
[読み会]Learning to Explain: An Information-Theoretic Perspective on Model Interpretation
Search
mei28
September 29, 2020
0
37
[読み会]Learning to Explain: An Information-Theoretic Perspective on Model Interpretation
読み会資料.
Learning to Explain: An Information-Theoretic Perspective on Model Interpretation (ICML2018)
mei28
September 29, 2020
Tweet
Share
More Decks by mei28
See All by mei28
[Human-AI Decision Making勉強会] 意思決定 with AIは個人vsグループで変わるの?
mei28
0
140
[読み会] Words are All You Need? Language as an Approximation for Human Similality Judgements
mei28
0
15
[参加報告] AAAI'23
mei28
0
56
[計算機構論] Learning Models of Individual Behavior in Chess
mei28
0
57
[計算機構論] Why do tree-based models still outperform deep learning on tabular data?
mei28
0
32
チーム開発と機械学習
mei28
0
34
[読み会] Knowledge distillation: A good teacher is patient and consistent
mei28
0
29
[読み会] ControlBurn- Feature Selection by Sparse Forests
mei28
0
18
[読み会] Learning Representations by Humans, for Humans
mei28
0
22
Featured
See All Featured
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
34
8.9k
Large-scale JavaScript Application Architecture
addyosmani
504
110k
Designing for Performance
lara
601
67k
WebSockets: Embracing the real-time Web
robhawkes
59
7k
For a Future-Friendly Web
brad_frost
172
9k
Automating Front-end Workflow
addyosmani
1357
200k
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
9
1.3k
The Invisible Side of Design
smashingmag
294
49k
A Tale of Four Properties
chriscoyier
153
22k
Fashionably flexible responsive web design (full day workshop)
malarkey
398
65k
It's Worth the Effort
3n
180
27k
Statistics for Hackers
jakevdp
790
220k
Transcript
-FBSOJOHUP&YQMBJO"O*OGPSNBUJPO 5IFPSFUJD1FSTQFDUJWFPO.PEFM *OUFSQSFUBUJPO ಡΈձˏ ༶໌
wஶऀ w+JBOCP$IFO 6$#FSLFMFZ "OU'JOBODJBMͷΠϯλʔϯ w-F4POH (FPSHJB*OTUJUVUFPG5FDIOPMPHZ "OU'JOBODJBM
w.BSUJO+8BJOXSJHIU 6OJWFSTJUZPG$BMJGPSOJB 5IF7PMFPO(SPVQ w.JDIBFM*+PSEBO 6OJWFSTJUZPG$BMJGPSOJB wग़య*$.- จใ
wϞσϧ͕ʮஅ ग़ྗ ͷࠜڌͱͳͬͨಛʯΛઆ໌͢ΔΑ͏ͳઆ໌ϞσϧΛֶश ͢Δํ๏ΛఏҊ͍ͯ͠Δɽ wैདྷͷઆ໌ϞσϧͱҧͬͯɼҰઆ໌ϞσϧͷֶशΛߦ͑આ໌ͷੜΛຖճܭ ࢉ͠ͳ͍͍ͯ͘ͷͰඇৗʹߴɽ wબΜͩཧ༝ wࣗͷݚڀʹઆ໌Մೳ"*ΛऔΓೖΕ͍ͨͱࢥͬͨͨΊɽ w
࠷ۙϗοτͰ͋Δ͔Β ֓ཁ ͲΜͳจ
wϥϯμϜϑΥϨετ Χʔωϧ๏ɼσΟʔϓχϡʔϥϧωοτϫʔΫͳͲෳࡶͳϞ σϧ͕ఏҊ͞ΕɼͦΕΒʹΑͬͯ༧ଌਫ਼͕ߴ·͍ͬͯΔɽ w͔͠͠ɼෳࡶͳϞσϧʹΑΔग़ྗ݁ՌΛਓ͕ؒղऍ͢Δͷࠔʹͳ͍ͬͯ Δɽ wຊจͰ༧ଌʹॏཁͳಛྔΛݟ͚ͭΔ͜ͱͰɼϞσϧͷղऍੑΛߴΊΑ͏ͱ ͍ͯ͠Δɽ ֓ཁ ݚڀഎܠ
ೖྗͷϕΫτϧʹؔͯ͠ɼਖ਼ղͷग़ྗͷޯΛٻΊΔɽٻΊͨޯΛ͍ɼೖྗ ʹϚεΫͯ͠આ໌ͱ͢Δɽ આ໌͍ͨ͠αϯϓϧͷपลͰہॴతʹ؆୯ͳࣝผϞσϧΛ࡞ΓɼͦΕΛͬͯઆ ໌Λߦ͏ɽ w -*.&%FFQ-*'5 LFSOFM4)"1ͳͲ͕͋Δɽ
֓ཁ طଘݚڀʹ͍ͭͯ
wہॴతͳઆ໌ϞσϧΛશମతʹֶश͠ɼೖྗͷߟྀʹೖΕΒΕΔɽ wہॴతͳઆ໌ϞσϧΛՃ͠ͳͯ͘ྑ͍ɽ ֓ཁ طଘݚڀͱͷҧ͍
ߩݙ࣍ͷΑ͏ʹͳΔ w ใྔΛͱʹͯ͠ɼΠϯελϯε͝ͱͷಛબ͕Ͱ͖ΔϑϨʔϜϫʔΫΛఏҊ ͨ͠ɽ w ϞσϧʹґଘͤͣޮͷΑ͍ɼಛબͷΞϧΰϦζϜΛఏҊͨ͠ɽ w ఏҊͨ͠ΞϧΰϦζϜ͕ਓσʔλͱϦΞϧσʔλͰޮՌ͕͋Δ͜ͱΛ࣮ݧͨ͠ɽ ֓ཁ
ߩݙ
wఏҊख๏ͰɼϞσϧʹґଘ͠ͳ͍ͨΊճؼɼྨϞσϧͷͲͪΒʹదԠͰ͖Δɽ ʢࠓճྨϞσϧͰߟ͑Δʣ wલఏͱͯ͠ɼϞσϧͷग़ྗΛ͖݅֬ ɼઆ໌ม ֬ม ͱ͢Δɽ w͜͜Ͱɼ૬ޓใྔΛΠϯελϯε͝ͱͷಛબͷࢦඪͱͯ͠͏ɽ
આ໌มͱબ͞Εͨಛྔͷ૬ޓใྔ͕࠷େͱͳΔ͜ͱ͕ࠓճͷඪɽ ℙm ( ⋅ ∣ x) Y X = x ∈ ℝd I(X; Y) = X,Y [log pXY (X, Y) pX (X)pY (Y) ] ϑϨʔϜϫʔΫ
w पล͔Βग़ͯ͘Δɽ w ͖݅ ͔Βग़ͯ͘Δɽ w αΠζ͕ Ͱ͋Δ͖ू߹ɽಛબʹ͏ɽ
w ϋΠύʔύϥϝʔλ w ͔Β͖ू߹ʹϚοϐϯά͢Δઆ໌Ϟσϧɽ w ͰબΕͨ෦ू߹ɼ͜ΕʹΑͬͯબ͞Εͨಛͷ֬ ม ΛಘΔɽ X X ∼ ℙX ( ⋅ ) Y (Y ∣ x) ∼ ℙm ( ⋅ ∣ x) ℘k = {S ⊂ 2d ||S ∣ = k} k k ℰ ℝd S S = ℰ(x) xs ϑϨʔϜϫʔΫ ઃఆ
w৽͍֬͠ϕΫτϧ Λ༻͍Δͱࠓճͷత࣍ࣜΛ࠷దԽ͢Δ͜ͱʹͳ Δɽ w૬ޓใྔΛ࠷େʹ͢ΔΑ͏ͳɼಛͷ෦ۭ͕ؒࣝผϞσϧͰॏཁͳಛͰ͋ Δͱઆ໌͢Δ͜ͱ͕Ͱ͖Δɽ Xs ∈ Rk
ϑϨʔϜϫʔΫ తؔ
wతؔΛղ͘͜ͱ͕ࠔͰ͋Δ͔ΒɼมۙࣅΛ༻͍ͯղ͘ɽ w૬ޓใྔͷԼքΛ࠷େԽ͢Δ͜ͱΛඪʹͳΔɽ ఏҊख๏
w ҰൠతͳϞσϧͰɼ͖݅ͷԼքͷظΛܭࢉ͢Δ͜ͱ͕ࠔͰ͋Δͨ Ίɼ࣍ͷΑ͏ͳมΛఆٛ͢Δɽ w w +FOTFOͷෆࣜʹΑΓɼԼք͕࣍ͷΑ͏ʹมߋ͞ΕΔɽ w
:= {ℚ ∣ ℚ = {xS → ℚS (Y ∣ xS), S ∈ ℘k}} ఏҊख๏ มԼք
w Λ༻͍Δͱɼࠓճͷඪ͕࣍ͷΑ͏ʹมΘΔɽ w wҰൠతͳ ͷ߹ɼࣜ Λղ͘ͷ͍·ͩʹࠔͰ͋Δɽͦ͜Ͱ࠷దԽՄೳͳ ํ๏ʹม͑Δඞཁ͕͋Δɽ
ℰ, ఏҊख๏ ࠷దԽ
w ΛύϥϝʔλԽ͢ΔͨΊʹɼχϡʔϥϧωοτϫʔΫΛఆٛ͢Δɽ w wࠓճఆٛͨ͠ Λ༻͍ͯɼ ͱ͢Δɽ gα
: ℝd × [c] → [0,1] where[c] = 0,1,…, c − 1 gα ℚx := gα (˜ xs , Y) ఏҊख๏ ΛύϥϝʔλԽ͢Δ
wࣜ Λਪఆ͢Δʹɼ ͜ͷ෦ۭؒΛ߹ܭ͢Δඞཁ͕͋Γେมɽ w͜ΕΛղܾ͢ΔͨΊʹɼ(VNCFMTPGUNBYUSJDLΛ༻͍Δɽ wΧςΰϦΧϧΛ࿈ଓʹͳΔΑ͏ʹ؇͢Δख๏ w(VNCFMTPGUNBYUSJDLΛͬͯɼॏΈ͚ͮΒΕͨ෦ۭؒͷαϯϓϦϯάͷۙࣅ Λߦͳ͍ͬͯ͘ɽ n
Ck ఏҊख๏ αϒηοταϯϓϦϯάͷ࿈ଓ؇ Gumbel-softmax trickͷࢀߟ: http://peluigi.hatenablog.com/entry/2018/06/23/192435
w ͜ͷಛ͔Β ͜ͷಛΛαϯϓϦϯά͢Δํ๏Λ࣍ʹࣔ͢ɽ ͜ͷಛϕΫτϧΛಠཱʹ ճαϯϓϦϯάΛߦ͏ɽ ͔Ϳͬͨಛআͯ͠ΓΛ͢ɽ w͜͏͢Δͱ͍͍ͤͥ
͚ͩಛ͕Δɽ͜ΕΒΛࣜʹ͢Δͱ࣍ͷΑ͏ʹͳΔɽ d k d k k ఏҊख๏ ಛΛL͚ͩநग़͢Δ
w࠷ऴతʹɼ࠷దԽ͢Δ͖తͷͷ࣍ͷͷͰ͋Δɽ w w ʹґଘ͠ͳ͍ͷͰ͋Δ͔Βɼ܇࿅࣌Ͱɼಉ࣌ʹޯ߱ԼΛద༻͢Δ ͜ͱ͕Ͱ͖Δɽ X,ζ θ,
α ఏҊख๏ ࠷ऴతͳతؔͱͦͷ࠷దԽ
wֶश͞Εͨઆ໌ϞσϧʹΑͬͯɼαϯϓϧ ͔ΒॏΈϕΫτϧ ࣍ݩͷ ʹϚο ϐϯά͞ΕΔɽ w͔͜͜ΒείΞ ͷେ͖͍ํΛબͯ͠ɼઆ໌༻ͷಛͱ͢Δɽ w֤αϯϓϧʹରͯ͠ɼઆ໌ϞσϧΛ௨͚ͩ͢ͰɼಛΛબ͢Δ͜ͱ͕Ͱ͖Δͷ ͰɼطଘݚڀΑΓޮΑ͘આ໌Ͱ͖ɼ͔ͭϞσϧʹґଘ͠ͳ͍Ͱ͓͜ͳ͑Δɽ
X d wθ (X) pθ (X) ఏҊख๏ આ໌εςʔδ
wਓσʔλͱ࣮ࡍͷσʔλͷೋͭͰ࣮ݧΛߦ͏ɽ wਓσʔλͰɼ࣍ݩΨε͔Βੜͯ͠ɼ͔ͦ͜ΒछྨͷσʔληοτΛ࡞ɽ w ࣮ݧ σʔληοτ
wຊݚڀͷख๏ -9 ͱطଘख๏ %FFQ-*'5 4)"1 -*.&ʜ Λൺֱ͢Δɽ ࣮ݧ ࣮ݧ݁Ռ
wઆ໌࣌ͷ࣮ߦ࣌ؒͷ݁ՌΛҎԼʹࣔ͢ɽ ࣮ݧ ࣮ݧ݁Ռ
࣮ݧ݁Ռ ςΩετσʔλͰͷϋΠϥΠτ өըϨϏϡʔͷσʔληοτ
wөըϨϏϡʔͷจʹରͯ͠ɼΫϥυϫʔΧʹݸͷײʹ߆ΔΩʔϫʔυΛબ ͯ͠Β͏ɽ wෳͷจষΛෳͷϫʔΧʹධՁͯ͠Β͍ɼͦͷฏۉΛ)VNBOBDDVSBDZͱ͢ Δɽ ࣮ݧ݁Ռ ΫϥυϫʔΧʹධՁͯ͠Β͏
w૬ޓใྔΛجʹͯ͠ɼΠϯελϯε͝ͱͷಛબΛ͢ΔϑϨʔϜϫʔΫΛఏ Ҋͨ͠ɽ w-9ʹΑΓɼ͡ΊͯϦΞϧλΠϜͰϒϥοΫϘοΫεͳϞσϧΛઆ໌͢Δ͜ͱ ͕Մೳʹͳͬͨɽ ݁