Slide 42
Slide 42 text
42
比較対象の言語処理ライブラリ
● 2022年10月2日 時点の情報を収集
○ 日本語言語処理に特化した Python ライブラリを収集(81件)
■ PePy の PyPI package のダウンロード数(= pip install の回数)
■ GitHub のスター数
● 形態素解析器、係り受け解析器、文字列正規化、文分割器など
○ jaconv, mecab-python3, SudachiPy, tinysegmenter, pykakasi, janome, mojimoji, natto-py,
konoha, pyknp, fugashi, unidic-lite, unidic-py, ipadic-py, darts-clone-python, neologdn, nagisa,
ginza, Mykytea-python, UniDic2UD, mecab, camphr, shiba, oseti, sengiri, chirptext,
accel-brain-code, showcase, SuParUniDic, cutlet, pymlask, japanese-numbers-python, nlplot,
SudachiTra, pyknp-eventgraph, cabocha, esupar, donut, depccg, aovec, ginza-transformers,
bunkai, jamdict, rakutenma-python, ja-timex, kyoto-reader, asa-python, asari,
ja_sentence_segmenter, jageocoder, budoux, alphabet2kana, manga-ocr, toiro,
python-vaporetto, dango, negima, jawiki-cleaner, AugLy-jp, namedivider-python,
allennlp-shiba-model, daaja, kuzukiri, mokuro, PyKatsuyou, hirakanadic, jinf, furigana4epub,
pygeonlp, zunda-python, noyaki, desuwa, rhoknp, ishi, jel, hasami, japanese-sentence-breaker,
yubin, japanese2phoneme, kwja, mozcpy (81件)
https://github.com/taishi-i/awesome-japanese
-nlp-resources/wiki/PyCon-JP-2022