Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Multimodal Grounding for Language Processing
Search
onizuka laboratory
October 17, 2018
Research
0
44
Multimodal Grounding for Language Processing
弊研究室で行なったCOLING2018読み会の発表資料です。
onizuka laboratory
October 17, 2018
Tweet
Share
More Decks by onizuka laboratory
See All by onizuka laboratory
Phrase-Based & Neural Unsupervised Machine Translation
onilab
0
110
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
onilab
0
64
Card-660: A Reliable Evaluation Framework for Rare Word Representation Models
onilab
0
31
A Word-Complexity Lexicon and A Neural Readability Ranking Model for Lexical Simplification
onilab
0
98
Integrating Transformer and Paraphrase Rules for Sentence Simplification
onilab
0
56
An Auto-Encoder Matching Model for Learning Utterance-Level Semantic Dependency in Dialogue Generation
onilab
0
48
Generating More Interesting Responses in Neural Conversation Models with Distributional Constraints
onilab
0
98
Modeling Multi-turn Conversation with Deep Utterance Aggregation
onilab
0
91
Learning Semantic Sentence Embeddings using Pair-wise Discriminator
onilab
0
110
Other Decks in Research
See All in Research
LiDARセキュリティ最前線
kentaroy47
0
280
Combating Misinformation in the age of LLMs
teacherpeterpan
0
130
第4回ナレッジグラフ勉強会:ISWC2023論文読み会
kg_wakate
1
200
[KDD2023論文読み会] BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model with Non-textual Features for CTR Prediction / KDD2023 LY Tech Reading
shunk031
0
440
Generative AI - practice and theory
gpeyre
1
560
説明可能AI:代表的手法と最近の動向
yuyay
1
590
HP (Hitto Point: 筆頭ポイント)
tanichu
0
710
訓練データ作成のためのCloudCompareを利用した点群の手動ラベリング
kentaitakura
0
540
動物倫理学ことはじめ:人間以外の動物との倫理的な付き合い方を考える
takeshit_m
0
290
The Theory behind Vector DB
matsui_528
0
1.6k
床面圧力センサ開発における感圧導電シート分離方式の検討 / WISS2023
yumulab
0
270
Accurate Method and Variable Tracking in Commit History
tsantalis
0
250
Featured
See All Featured
How to name files
jennybc
65
93k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
17
1.4k
Music & Morning Musume
bryan
41
5.6k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
30
6k
Clear Off the Table
cherdarchuk
84
310k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
25
2.3k
Designing Experiences People Love
moore
136
23k
Side Projects
sachag
451
41k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
14
1.6k
Web development in the modern age
philhawksworth
202
10k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
19
1.7k
A Modern Web Designer's Workflow
chriscoyier
689
190k
Transcript
Multimodal Grounding for Language Processing 20181017 NomotoEriko 1
### # &0 NLP +$ 65#5- Contents: Ø"!
Ø # &0 '8 Ø # 3/*2 Ø # NLP Multimodal Grounding for Language Processing 2 %7. 4( )35- ,1
Multimodal Grounding for Language Processing 3
HF>T NLP A>O… "/(# '*!.@dQaBX g6O _QR>O… KIEWLS 8e^?2JO
Conceptual grounding: ]`QZ3fDZb7P14N: U ↑ S<),%+/$,M=0 ]` [c 4 !.&- YZC;hVZC;h\ZC;h9ZC;GDZ56M=
Multimodal Grounding for Language Processing 5
# 3 - ($ '+ . ,! &" 3
- ,)* (Cross-modal transfer) ,'+ (Cross-modal interpretation) %# (Joint multimodal processing) 6 #-
@>? (Cross-modal transfer) E*D8F &E*A):F -+ 1! 92
C "G Ø07E;=→4# '45F ØB,(.E→B6F ØE< /#→F 7 $3%C
GAF (Cross-modal interpretation) $)=( /,>3D+ G@6 . K &)B"IL?*B"142;<M !N
Ø9#80LHE 5:M Ø LJ&C-5:M 8 %7'K
;$ 3 (Joint multimodal processing) =- %/ F *'
#27 : $(> I Ø+*CDG6B<* "4H Ø@A7E&)8G5? 6B 3H 9 3 !F 09.1,
Multimodal Grounding for Language Processing 10
,(+ 3 0 ,( $ * 3
.- Ø%#,( '" (Concept representations) Ø !'" (Projection) Ø&/),( '" (Compositional representations) 11 ,(+
.)50/( (Concept representations) .)50 7&;.)9 #, '!( Ø<:.)50 6,:1 +50
'!( Ø 2 8 *"3 %- 12 50$4
,*:21) (Concept representations) (&!+.F Ø"6BA7C%#/D Ø↑ “cat” ;<5 “panther”
>@5 “dog” : E 8$ : Ø04 = 3 ?=- 13 :2'9
2394 (Projection) 86>:4 D 5. &(" 86 +$) A &("@0-
(Mapping) /?>:<@ (Joint representation space) 7;86>:4 B*'!#1= ;C 14 %) &,)>:1= ←Mapping ↓ Joint learning
./40 (Projection) !"=)( (Mapping) Ø!"+!" )( !: #$% → #$'
-9 Ø87>13 @6(*? '& %$2<,: maximize sim(#$% , #$' ), minimize sim(#$% , ./0123$' ) 15 #!%#;5-9
-/70 (Projection) &@?8;B (Joint representation space) Ø%#$ :9C2+35A( ü) ?8'6<E1.
F üB 5A =*E1.F 4D 2 B !,> 16 " ?8,>
;NB KA @5 OE*KA6L/- E* I-6L CD=P U$#
< 8>7: Ø( E* RFS3.90&TJ Ø," MG? E*'U 94 Q2 )+% 17 !KA1H
NLP Multimodal Grounding for Language Processing
18
NLP #&$'&,> 957.A=<4 ØD(6' (simLex-999 etc…) 1 7.A=@2 B*
Ø/0E #&$'&7.A=%!$'&- +)?95 8C?95 :; Ø#&$'&7.A=<4" Ø#&$'&7.A=8C?953 19 #&$'& NLP
NLP !%"'% +1 3,47 Ø0) 3, 6/.( Ø0)-# $&
' 2)-,5 ' (imSitu) *8 20 !%"'% NLP imSitu*- (http://imsitu.org/ )
+. )&$ Ø " >< !,% #&$ Ø!,% #'(%
" >< - % Ø* &$ / 21 NLP
Multimodal Grounding for Language Processing 22
NLP NLP " # !
NLP 23