word2vec

Word2vec Word representation in Vector Space Javier Honduvilla Coto

What’s word2vec? • Vector representation of words • Uses neural
networks (more on the training later) • Unsupervised • Published in 2013 by Google researchers and engineers • A companion C implementation was published with the paper

Why? Image and video representation is pretty rich, usually done
with humongous vectors – commonly having a high dimensionality. Meanwhile, words are usually mapped to arbitrary IDs such as the word itself.

Previous work • Counting based methods: probability of a word
happening with some neighbour words • Predictive models: guess using nearby words’ vectors

Cool things of this model • Continuous Bag of Words:
predict a word using previous words (good in small models) • Skip-Gram: predict words which are close, from the context from an input word (good for big models) => • Pretty good performance (100 billions words/day in a single box) • 33 billions: 72% accuracy

Example (distance to Sweden)

Vector operations!!

Appendix • Original paper: http://papers.nips.cc/paper/5021-distributed-representations-of-words- and-phrases-and-their-compositionality.pdf • Original implementation: https://code.google.com/archive/p/word2vec
• Interesting JVM implementation https://deeplearning4j.org/word2vec

word2vec

word2vec

Javier Honduvilla Coto

More Decks by Javier Honduvilla Coto

Other Decks in Programming

Featured

Transcript

Word2vec Word representation in Vector Space Javier Honduvilla Coto

What’s word2vec? • Vector representation of words • Uses neural

Why? Image and video representation is pretty rich, usually done

Previous work • Counting based methods: probability of a word

Cool things of this model • Continuous Bag of Words:

Example (distance to Sweden)

Vector operations!!

Appendix • Original paper: http://papers.nips.cc/paper/5021-distributed-representations-of-words- and-phrases-and-their-compositionality.pdf • Original implementation: https://code.google.com/archive/p/word2vec