Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Spacyでお手軽NLP / NLP with spacy
Search
himkt
June 13, 2018
Programming
0
1k
Spacyでお手軽NLP / NLP with spacy
2018/06/13のレトリバセミナーのスライドです
himkt
June 13, 2018
Tweet
Share
More Decks by himkt
See All by himkt
Linformer: paper reading
himkt
0
550
RoBERTa: paper reading
himkt
1
350
NLP SoTA 勉強会 / ner_2019
himkt
2
1.4k
自然言語処理 @ クックパッド / nlp at cookpad
himkt
1
530
Interpretable Machine Learning 6.3 - Prototypes and Criticisms
himkt
2
170
ニューラル固有表現抽出 / Neural Named Entity Recognition
himkt
3
760
ニューラル固有表現抽出器を実装してみる / PyNER
himkt
6
2.1k
Deep Learning Book 10その2 / deep learning book 10 vol2
himkt
2
200
ふわふわ系列ラベリング / ner 2018
himkt
5
850
Other Decks in Programming
See All in Programming
Range on Rails ―「多重範囲型」という新たな選択肢が、複雑ロジックを劇的にシンプルにしたワケ
rizap_tech
0
480
CSC305 Lecture 06
javiergs
PRO
0
230
CSC305 Lecture 04
javiergs
PRO
0
270
Go言語の特性を活かした公式MCP SDKの設計
hond0413
1
230
私達はmodernize packageに夢を見るか feat. go/analysis, go/ast / Go Conference 2025
kaorumuta
2
570
登壇は dynamic! な営みである / speech is dynamic
da1chi
0
340
Go Conference 2025: Goで体感するMultipath TCP ― Go 1.24 時代の MPTCP Listener を理解する
takehaya
9
1.7k
AI Coding Meetup #3 - 導入セッション / ai-coding-meetup-3
izumin5210
0
3.3k
スマホから Youtube Shortsを見られないようにする
lemolatoon
27
32k
Web Components で実現する Hotwire とフロントエンドフレームワークの橋渡し / Bridging with Web Components
da1chi
3
2.5k
What Spring Developers Should Know About Jakarta EE
ivargrimstad
0
120
What's new in Spring Modulith?
olivergierke
1
150
Featured
See All Featured
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
46
7.7k
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
194
16k
Optimising Largest Contentful Paint
csswizardry
37
3.5k
A Modern Web Designer's Workflow
chriscoyier
697
190k
The Cult of Friendly URLs
andyhume
79
6.6k
What’s in a name? Adding method to the madness
productmarketing
PRO
24
3.7k
Gamification - CAS2011
davidbonilla
81
5.5k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
PRO
189
55k
Bootstrapping a Software Product
garrettdimon
PRO
307
110k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
252
21k
Agile that works and the tools we love
rasmusluckow
331
21k
The Art of Programming - Codeland 2020
erikaheidi
56
14k
Transcript
Ͱ͓खܰ/-1 )JSBNBUTV!ϨτϦόηϛφʔ ը૾IUUQTHJUIVCDPNFYQMPTJPOTQB$ZCMPCNBTUFSXFCTJUFBTTFUTJNHMPHPTWH
Tsukuba, M2, NLP himkt
5-%3 w 1ZUIPOͷࣗવݴޠॲཧϥΠϒϥϦͰ͋Δ4QB$Zͷհ w Wͷ͓͠Ζػೳʹ͍ͭͯ w 4QB$ZͰຊޠςΩετΛॲཧ͢Δ
"CPVU4QB$Z ը૾IUUQTTQBDZJP https://spacy.io
IUUQTTQBDZJP "CPVU4QB$Z “Industrial-Strength NLP” w /POEFTUSVDUJWFUPLFOJ[BUJPO w /BNFEFOUJUZSFDPHOJUJPO w 4VQQPSUGPS
MBOHVBHFT w TUBUJTUJDBMNPEFMTGPSMBOHVBHFT w ʜFUD IUUQTHJUIVCDPNFYQMPTJPOTQB$Z
#BTJDVTBHFPG4QB$Z
.BOZ'FBUVSFT ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
'BTUFTUJOUIFXPSME w %FQFOEFODZQBSTFSͷύϑΥʔϚϯεൺֱ w จIUUQTBDMXFCPSHBOUIPMPHZ111QEG w จʹ͋Δͷ41&&%ͷදͰɼ"DDVSBDZࣗલͰ࡞ΒΕͨͷʁ w $IPJͷϕϯνϚʔΫ࣌ʹTQB$ZWະϦϦʔεͳͷͰOB
ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
4QFFEDPNQBSJTPOXJUIPUIFSMJCSBSJFT w ଞͷࣗવݴޠॲཧϥΠϒϥϦͱͷൺֱ w จͰͳ͘࡞ऀ͕ௐࠪͨ͠ͷ DPOEVDUFEJO w ϦϙδτϦIUUQTHJUIVCDPNFYQMPTJPOTQBDZCFODINBSLT
ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
.PEFMDPNQBSJTPO w ݴޠʹΑͬͯෳͷαΠζͷϞσϧ͕͋Δ FO GS FT w 104UBHHFS /&3UBHHFS
%FQFOEFODZQBSTFS w 8JUIPVUBOZQSFQSPDFTTJOH EBUBTFU ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
/-5,BOE4QB$Z w /-5,P⒎FSTTPNFPGUIFTBNFGVODUJPOBMJUZBTTQB$Z w *ODPNQBSJTPOUPTQB$Z /-5,UBLFTBNVDINPSF CSPBEDIVSDIBQQSPBDI w TQB$ZJTBMTPNVDINPSFQFSGPSNBODFGPDVTTFEUIBO/-5, XIFSFUIFUXPMJCSBSJFTQSPWJEFUIFTBNFGVODUJPOBMJUZ
TQB$ZTJNQMFNFOUBUJPOXJMMVTVBMMZCFGBTUFSBOENPSF BDDVSBUF Ҿ༻IUUQTTQBDZJPVTBHFGBDUTpHVSFT
0UIFSMJCSBSJFTBOETQB$Z 1ZUPSDIIUUQTHJUIVCDPNQZUPSDIQZUPSDICMPCNBTUFSEPDTTPVSDF@TUBUJDJNHQZUPSDIMPHPEBSLTWH "MMFO/-1IUUQTHJUIVCDPNBMMFOBJBMMFOOMQCMPCNBTUFSEPDTUBUJDBMMFOOMQMPHPEBSLQOH (FOTJNIUUQTHJUIVCDPN3B3F5FDIOPMPHJFTHFOTJNCMPCEFWFMPQEPDTTSDSFBENF@JNBHFTSBSFQOH $V1ZIUUQTHJUIVCDPNDVQZDVQZCMPCNBTUFSEPDTJNBHFDVQZ@MPHP@QYQOH JOUPSDIUFYU GPS(16BDDFMFSBUJPO XPSEWFDUPS
QJQFMJOF
4QB$ZWͷݸਓతʹ͖ͳػೳ w EJTQMB$ZͰ͌͢Εʔ͠ʔʁ w 4QB$ZͷՄࢹԽϞδϡʔϧ w ͖Ε͍Ͱ͍͍ײ͡ͳը૾Λ࡞ͬͯ͘ΕΔ w ݻ༗දݱநग़ͱΓड͚ղੳͷ݁ՌΛՄࢹԽͯ͘͠ΕΔ w
4QB$ZͰղੳͨ͠ΦϒδΣΫτΛͦͷ··͑Δ w 47(ܗࣜͷը૾͕ग़ྗ͞ΕΔ
8FCαΠτ্ͷ/&3ͷσϞ ը૾IUUQTFYQMPTJPOBJEFNPTEJTQMBDZFOU
4QB$ZY+VQZUFS/PUFCPPL
4QB$ZBOEຊޠ w WͰͷຊޠରԠ 13 w ຊޠܗଶૉղੳث+BOPNFΛϥοϓ͢ΔܗͰ࣮ w WͰͷܗଶૉղੳثҠߦ *TTVF
13 w ຊޠ6OJWFSTBM%FQFOEFODZσʔλ6OJ%JDͰׂ͞Ε͍ͯΔ w +BOPNFݱࡏͷͱ͜Ζ6OJ%JDʹະରԠ w .F$BCʹҠߦ
4QB$ZBOEຊޠ
4QB$ZBOEຊޠ "OTXFSVTF6OJ%JD
4QB$ZBOEຊޠ
ຊޠ/&3%FQFOEFODZQBSTJOHXJUI4QB$Z w ݁ݱࡏͰ͖ͳ͍ w 4QB$ZʹࣗͰ5BHHFS1BSTFSΛֶशͰ͖Δ ػߏ͕Έࠐ·Ε͍ͯΔ w ຊޠ6OJWFSTBM%FQFOEFODJFTެ։͞Ε͍ͯΔ IUUQTHJUIVCDPN6OJWFSTBM%FQFOEFODJFT6%@+BQBOFTF(4%ͳͲ
Ϟσϧࣗ࡞Ͱ͖ΔͷͰʂʁ
"EEJOH-BOHVBHFTVQQPSU 4QB$ZͷݴޠϞδϡʔϧͷ ίϯϙʔωϯτ ը૾IUUQTTQBDZJPVTBHFBEEJOHMBOHVBHFTTFDUJPOUSBJOJOH
"EEJOH-BOHVBHFTVQQPSU ը૾IUUQTTQBDZJPVTBHFBEEJOHMBOHVBHFTTFDUJPOUSBJOJOH 1PXFSFECZ.F$BC 4QB$ZͷֶशϞδϡʔϧ͕͑ͳ͍ʁ
ຊޠTQB$Zͷݱࡏͷʁ w ࣙॻ͕6OJ%JDͰ͋Δ͜ͱ͕ఆ͞Ε͍ͯΔ JTTVF w ʢ͓ͦΒ͘ʣଟ͘ͷڥͰ.F$BCͷσϑΥϧτࣙॻ*1"EJD w <50%0>+BQBOFTF5PLFOJ[FSͰ5BHHFSΛ࡞͍ͬͯΔ͕ɼ ͜ͷίϯετϥΫλͷҾ֎෦͔Β৮Εͳͦ͏
w ൃԻ͕ະొͷ୯ޠΛղੳ͢ΔͱΤϥʔ *1"EJD 13 w ࣙॻ͝ͱͷग़ྗͷࠩҟΛٵऩ͢ΔΠϯλʔϑΣʔε͕ඞཁʁ w /&3%FQFOEFODZ1BSTJOHͷϞσϧ·ͩଘࡏ͠ͳ͍ w ݱࡏA"MQIBUPLFOJ[BUJPOTVQQPSUA w ͔ͪॻ͖͕ඞཁͳݴޠͷରԠͲ͏͢Εʜ w தࠃޠࣅͨঢ়گͰࢭ·͍ͬͯΔ 13
͜ΕΛΕͱΓ͋͑ͣςετͰ͖ͦ͏ USBWJTͰ.F$BCΛΠϯετʔϧ͢ΔΑ͏ʹ͢Δ TQBDZNPEFMTʹ6OJ%JDΛొ͢Δ QZUIPONTQBDZEPXOMPBEKBΛඋ͢Δ /&3ͱ%FQFOEFODZ1BSTJOH͋ͱͰௐΔʜ
·ͱΊ w ࣗવݴޠॲཧϥΠϒϥϦTQB$Zͷհ w *OEVTUSZൃͷࣗવݴޠॲཧϥΠϒϥϦ w WͰೖͬͨՄࢹԽϞδϡʔϧ͕͍͍ײ͡ w EJTQMB$ZΛ͍͍ͬͨײ͡ͳՄࢹԽ w
TQB$ZͰͷຊޠςΩετॲཧ·ͩෆશ w ݱঢ়ܗଶૉղੳͷΠϯλʔϑΣʔε QJQJOTUBMMTQBDZ 3VO5IJTDPNNBOE