Slide 2
Slide 2 text
Sofie Van Landeghem http://www.oxykodit.com
2
spaCy
➢
Focus on production usage
➢
Speed & efficiency
➢
Python + Cython
➢
Comparison to other NLP libraries: https://spacy.io/usage/facts-figures
➢
Open source (MIT license): https://github.com/explosion/spaCy/
➢
Created by Explosion AI (Ines Montani & Matthew Honnibal)
➢
Tokenization (50 languages), lemmatization, POS tagging, dependency parsing
➢
NER, text classification, rule-based matching (API + one implementation)
➢
Word vectors, BERT-style pre-training
➢
Statistical models in 10 languages (v. 2.2): DE, EN, EL, ES, FR, IT, LT, NL, NB, PT
➢
One multi-lingual NER model containing DE, EN, ES, FR, IT, PT, RU