Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Spacyでお手軽NLP / NLP with spacy
Search
himkt
June 13, 2018
Programming
0
1k
Spacyでお手軽NLP / NLP with spacy
2018/06/13のレトリバセミナーのスライドです
himkt
June 13, 2018
Tweet
Share
More Decks by himkt
See All by himkt
Linformer: paper reading
himkt
0
460
RoBERTa: paper reading
himkt
1
330
NLP SoTA 勉強会 / ner_2019
himkt
2
1.4k
自然言語処理 @ クックパッド / nlp at cookpad
himkt
1
500
Interpretable Machine Learning 6.3 - Prototypes and Criticisms
himkt
2
150
ニューラル固有表現抽出 / Neural Named Entity Recognition
himkt
3
700
ニューラル固有表現抽出器を実装してみる / PyNER
himkt
6
2.1k
Deep Learning Book 10その2 / deep learning book 10 vol2
himkt
2
180
ふわふわ系列ラベリング / ner 2018
himkt
5
850
Other Decks in Programming
See All in Programming
Bedrock×MCPで社内ブログ執筆文化を育てたい!
har1101
6
1k
一緒に働きたくなるプログラマの思想 #QiitaConference
mu_zaru
59
14k
七輪ライブラリー: Claude AI で作る Next.js アプリ
suneo3476
1
110
「影響が少ない」を自分の目でみてみる
o0h
PRO
2
1.1k
趣味全開のAITuber開発
kokushin
0
200
AI時代の開発者評価について
ayumuu
0
180
RubyKaigi Dev Meeting 2025
tenderlove
1
210
Fiber Scheduler vs. General-Purpose Parallel Client
hayaokimura
1
110
DataStoreをテストする
mkeeda
0
290
Amazon CloudWatchの地味だけど強力な機能紹介!
itotsum
0
180
Do Dumb Things
mitsuhiko
0
440
新しいPHP拡張モジュールインストール方法「PHP Installer for Extensions (PIE)」を使ってみよう!
cocoeyes02
0
410
Featured
See All Featured
The World Runs on Bad Software
bkeepers
PRO
67
11k
A Modern Web Designer's Workflow
chriscoyier
693
190k
Build your cross-platform service in a week with App Engine
jlugia
229
18k
Keith and Marios Guide to Fast Websites
keithpitt
411
22k
The Illustrated Children's Guide to Kubernetes
chrisshort
48
49k
Why Our Code Smells
bkeepers
PRO
336
57k
It's Worth the Effort
3n
184
28k
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
32
5.4k
Typedesign – Prime Four
hannesfritz
41
2.6k
Code Review Best Practice
trishagee
67
18k
Bootstrapping a Software Product
garrettdimon
PRO
307
110k
Stop Working from a Prison Cell
hatefulcrawdad
268
20k
Transcript
Ͱ͓खܰ/-1 )JSBNBUTV!ϨτϦόηϛφʔ ը૾IUUQTHJUIVCDPNFYQMPTJPOTQB$ZCMPCNBTUFSXFCTJUFBTTFUTJNHMPHPTWH
Tsukuba, M2, NLP himkt
5-%3 w 1ZUIPOͷࣗવݴޠॲཧϥΠϒϥϦͰ͋Δ4QB$Zͷհ w Wͷ͓͠Ζػೳʹ͍ͭͯ w 4QB$ZͰຊޠςΩετΛॲཧ͢Δ
"CPVU4QB$Z ը૾IUUQTTQBDZJP https://spacy.io
IUUQTTQBDZJP "CPVU4QB$Z “Industrial-Strength NLP” w /POEFTUSVDUJWFUPLFOJ[BUJPO w /BNFEFOUJUZSFDPHOJUJPO w 4VQQPSUGPS
MBOHVBHFT w TUBUJTUJDBMNPEFMTGPSMBOHVBHFT w ʜFUD IUUQTHJUIVCDPNFYQMPTJPOTQB$Z
#BTJDVTBHFPG4QB$Z
.BOZ'FBUVSFT ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
'BTUFTUJOUIFXPSME w %FQFOEFODZQBSTFSͷύϑΥʔϚϯεൺֱ w จIUUQTBDMXFCPSHBOUIPMPHZ111QEG w จʹ͋Δͷ41&&%ͷදͰɼ"DDVSBDZࣗલͰ࡞ΒΕͨͷʁ w $IPJͷϕϯνϚʔΫ࣌ʹTQB$ZWະϦϦʔεͳͷͰOB
ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
4QFFEDPNQBSJTPOXJUIPUIFSMJCSBSJFT w ଞͷࣗવݴޠॲཧϥΠϒϥϦͱͷൺֱ w จͰͳ͘࡞ऀ͕ௐࠪͨ͠ͷ DPOEVDUFEJO w ϦϙδτϦIUUQTHJUIVCDPNFYQMPTJPOTQBDZCFODINBSLT
ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
.PEFMDPNQBSJTPO w ݴޠʹΑͬͯෳͷαΠζͷϞσϧ͕͋Δ FO GS FT w 104UBHHFS /&3UBHHFS
%FQFOEFODZQBSTFS w 8JUIPVUBOZQSFQSPDFTTJOH EBUBTFU ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
/-5,BOE4QB$Z w /-5,P⒎FSTTPNFPGUIFTBNFGVODUJPOBMJUZBTTQB$Z w *ODPNQBSJTPOUPTQB$Z /-5,UBLFTBNVDINPSF CSPBEDIVSDIBQQSPBDI w TQB$ZJTBMTPNVDINPSFQFSGPSNBODFGPDVTTFEUIBO/-5, XIFSFUIFUXPMJCSBSJFTQSPWJEFUIFTBNFGVODUJPOBMJUZ
TQB$ZTJNQMFNFOUBUJPOXJMMVTVBMMZCFGBTUFSBOENPSF BDDVSBUF Ҿ༻IUUQTTQBDZJPVTBHFGBDUTpHVSFT
0UIFSMJCSBSJFTBOETQB$Z 1ZUPSDIIUUQTHJUIVCDPNQZUPSDIQZUPSDICMPCNBTUFSEPDTTPVSDF@TUBUJDJNHQZUPSDIMPHPEBSLTWH "MMFO/-1IUUQTHJUIVCDPNBMMFOBJBMMFOOMQCMPCNBTUFSEPDTUBUJDBMMFOOMQMPHPEBSLQOH (FOTJNIUUQTHJUIVCDPN3B3F5FDIOPMPHJFTHFOTJNCMPCEFWFMPQEPDTTSDSFBENF@JNBHFTSBSFQOH $V1ZIUUQTHJUIVCDPNDVQZDVQZCMPCNBTUFSEPDTJNBHFDVQZ@MPHP@QYQOH JOUPSDIUFYU GPS(16BDDFMFSBUJPO XPSEWFDUPS
QJQFMJOF
4QB$ZWͷݸਓతʹ͖ͳػೳ w EJTQMB$ZͰ͌͢Εʔ͠ʔʁ w 4QB$ZͷՄࢹԽϞδϡʔϧ w ͖Ε͍Ͱ͍͍ײ͡ͳը૾Λ࡞ͬͯ͘ΕΔ w ݻ༗දݱநग़ͱΓड͚ղੳͷ݁ՌΛՄࢹԽͯ͘͠ΕΔ w
4QB$ZͰղੳͨ͠ΦϒδΣΫτΛͦͷ··͑Δ w 47(ܗࣜͷը૾͕ग़ྗ͞ΕΔ
8FCαΠτ্ͷ/&3ͷσϞ ը૾IUUQTFYQMPTJPOBJEFNPTEJTQMBDZFOU
4QB$ZY+VQZUFS/PUFCPPL
4QB$ZBOEຊޠ w WͰͷຊޠରԠ 13 w ຊޠܗଶૉղੳث+BOPNFΛϥοϓ͢ΔܗͰ࣮ w WͰͷܗଶૉղੳثҠߦ *TTVF
13 w ຊޠ6OJWFSTBM%FQFOEFODZσʔλ6OJ%JDͰׂ͞Ε͍ͯΔ w +BOPNFݱࡏͷͱ͜Ζ6OJ%JDʹະରԠ w .F$BCʹҠߦ
4QB$ZBOEຊޠ
4QB$ZBOEຊޠ "OTXFSVTF6OJ%JD
4QB$ZBOEຊޠ
ຊޠ/&3%FQFOEFODZQBSTJOHXJUI4QB$Z w ݁ݱࡏͰ͖ͳ͍ w 4QB$ZʹࣗͰ5BHHFS1BSTFSΛֶशͰ͖Δ ػߏ͕Έࠐ·Ε͍ͯΔ w ຊޠ6OJWFSTBM%FQFOEFODJFTެ։͞Ε͍ͯΔ IUUQTHJUIVCDPN6OJWFSTBM%FQFOEFODJFT6%@+BQBOFTF(4%ͳͲ
Ϟσϧࣗ࡞Ͱ͖ΔͷͰʂʁ
"EEJOH-BOHVBHFTVQQPSU 4QB$ZͷݴޠϞδϡʔϧͷ ίϯϙʔωϯτ ը૾IUUQTTQBDZJPVTBHFBEEJOHMBOHVBHFTTFDUJPOUSBJOJOH
"EEJOH-BOHVBHFTVQQPSU ը૾IUUQTTQBDZJPVTBHFBEEJOHMBOHVBHFTTFDUJPOUSBJOJOH 1PXFSFECZ.F$BC 4QB$ZͷֶशϞδϡʔϧ͕͑ͳ͍ʁ
ຊޠTQB$Zͷݱࡏͷʁ w ࣙॻ͕6OJ%JDͰ͋Δ͜ͱ͕ఆ͞Ε͍ͯΔ JTTVF w ʢ͓ͦΒ͘ʣଟ͘ͷڥͰ.F$BCͷσϑΥϧτࣙॻ*1"EJD w <50%0>+BQBOFTF5PLFOJ[FSͰ5BHHFSΛ࡞͍ͬͯΔ͕ɼ ͜ͷίϯετϥΫλͷҾ֎෦͔Β৮Εͳͦ͏
w ൃԻ͕ະొͷ୯ޠΛղੳ͢ΔͱΤϥʔ *1"EJD 13 w ࣙॻ͝ͱͷग़ྗͷࠩҟΛٵऩ͢ΔΠϯλʔϑΣʔε͕ඞཁʁ w /&3%FQFOEFODZ1BSTJOHͷϞσϧ·ͩଘࡏ͠ͳ͍ w ݱࡏA"MQIBUPLFOJ[BUJPOTVQQPSUA w ͔ͪॻ͖͕ඞཁͳݴޠͷରԠͲ͏͢Εʜ w தࠃޠࣅͨঢ়گͰࢭ·͍ͬͯΔ 13
͜ΕΛΕͱΓ͋͑ͣςετͰ͖ͦ͏ USBWJTͰ.F$BCΛΠϯετʔϧ͢ΔΑ͏ʹ͢Δ TQBDZNPEFMTʹ6OJ%JDΛొ͢Δ QZUIPONTQBDZEPXOMPBEKBΛඋ͢Δ /&3ͱ%FQFOEFODZ1BSTJOH͋ͱͰௐΔʜ
·ͱΊ w ࣗવݴޠॲཧϥΠϒϥϦTQB$Zͷհ w *OEVTUSZൃͷࣗવݴޠॲཧϥΠϒϥϦ w WͰೖͬͨՄࢹԽϞδϡʔϧ͕͍͍ײ͡ w EJTQMB$ZΛ͍͍ͬͨײ͡ͳՄࢹԽ w
TQB$ZͰͷຊޠςΩετॲཧ·ͩෆશ w ݱঢ়ܗଶૉղੳͷΠϯλʔϑΣʔε QJQJOTUBMMTQBDZ 3VO5IJTDPNNBOE