Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
文献紹介 6月12日
Search
gumigumi7
June 12, 2018
0
330
文献紹介 6月12日
A La Carte Embedding: Cheap but Effective Induction of Semantic Feature Vectors
gumigumi7
June 12, 2018
Tweet
Share
More Decks by gumigumi7
See All by gumigumi7
文献紹介 1月24日
gumigumi7
0
230
文献紹介 11月7日
gumigumi7
0
120
文献紹介 10月3日
gumigumi7
0
320
文献紹介 9月3日
gumigumi7
0
250
文献紹介 8月10日
gumigumi7
0
120
文献紹介 7月16日
gumigumi7
0
260
文献紹介 5月16日
gumigumi7
0
180
文献紹介 4月18日
gumigumi7
0
140
文献紹介 12月15日
gumigumi7
0
110
Featured
See All Featured
Atom: Resistance is Futile
akmur
260
25k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
41
4.4k
Product Roadmaps are Hard
iamctodd
45
9.7k
Building Flexible Design Systems
yeseniaperezcruz
320
37k
What the flash - Photography Introduction
edds
64
11k
Designing for Performance
lara
602
67k
Fantastic passwords and where to find them - at NoRuKo
philnash
38
2.5k
Music & Morning Musume
bryan
41
5.6k
How GitHub (no longer) Works
holman
305
140k
ReactJS: Keep Simple. Everything can be a component!
pedronauck
660
120k
Fontdeck: Realign not Redesign
paulrobertlloyd
76
4.9k
Art, The Web, and Tiny UX
lynnandtonic
290
19k
Transcript
() A La Carte Embedding: Cheap but Effective Induction of
Semantic Feature Vectors
▪ ▪ Mikhail Khodak, Nikunj Saunshi, Yingyu Liang,
Tengyu Ma, Brandon Stewart, Sanjeev Arora. ▪ A La Carte Embedding: Cheap but Effective Induction of Semantic Feature Vectors. ▪ Proceedings of the Association of Computational Linguistics. 2018. ▪ ▪ 2
▪ A A ▪ ▪ 3
▪ 1 0 ▪ 7&*$6' ▪ + -=<:)#6( 5
▪ )#+;B?<2,$(- ./ 4"+ ▪ !38 %5@>A8 4
▪ ( ▪ ▪ ▪ d ▪ ▪ Synset
d c d ▪ N-gram embeddings ▪ N-gram d d 5
▪ () ▪ 6 !" ⊂ !$ :
!$ % & ∈ !" : % ( ) : % (e.g. Word2Vec)
▪ ) ▪ ( 7 !" ⊂ !$ :
!$ % & ∈ !" : % ( ) : % (e.g. Word2Vec) * ∈ ℝ,×, : . |0|
8 ▪ Similarity Correlation ▪ Rare Words !
%$# ▪ Learning Embeddings of New Concepts ▪ Nonce: (#&)! %$# ▪ Chimera: %$# ▪ Word Sense Disambiguation ▪ Unsupervised Document Classification
9 ▪ Similarity Correlation ▪ " %
▪ % # $% !$
▪ Nonce and Chimera
▪ Word Sense Disambiguation ▪ PWN synsets synset ▪
A la carte embedding
12
▪ .957;"1.957;3 %'2 0 ,:6;3 ▪ -&(,:6;3 ,+-&(/
▪ .- !2,+,). #)$2 13