A 5 minute talk at PyData London on 7 Feb.
Get the word similarity you need
Community Manager at Gensim
We turn NLP papers into industrial Python code.
University of Delhi, India
RaReTech Incubator program
Added WordRank to Gensim
“What does Elizabeth think about Mr Darcy?”
“Male characters in Pride and Prejudice?”
1) What words are in the topic of “Darcy”?
2) What are the Named Entities in the text?
P&P is only 120k words
Closest word to “king”?
Trained on Wikipedia 17m words
Attribute Interchangeable Both
Tensorflow has awesome viz!
How to get the similarity you need
My similar words must
I want to describes the
I want to Know what doc is about Recognize names
Then I should run Wordrank (even on small
corpus, 1m words)
Word2vec skipgram big
window needs large corpus
Word2vec skipgram small
Rare and Frequent words are
Gensim T-shirt question:
How many words are in
Pride and Prejudice?