Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Spacyでお手軽NLP / NLP with spacy
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
himkt
June 13, 2018
Programming
0
1.1k
Spacyでお手軽NLP / NLP with spacy
2018/06/13のレトリバセミナーのスライドです
himkt
June 13, 2018
Tweet
Share
More Decks by himkt
See All by himkt
Linformer: paper reading
himkt
0
590
RoBERTa: paper reading
himkt
1
400
NLP SoTA 勉強会 / ner_2019
himkt
2
1.5k
自然言語処理 @ クックパッド / nlp at cookpad
himkt
1
570
Interpretable Machine Learning 6.3 - Prototypes and Criticisms
himkt
2
180
ニューラル固有表現抽出 / Neural Named Entity Recognition
himkt
3
800
ニューラル固有表現抽出器を実装してみる / PyNER
himkt
6
2.2k
Deep Learning Book 10その2 / deep learning book 10 vol2
himkt
2
200
ふわふわ系列ラベリング / ner 2018
himkt
5
870
Other Decks in Programming
See All in Programming
CSC307 Lecture 09
javiergs
PRO
1
830
Best-Practices-for-Cortex-Analyst-and-AI-Agent
ryotaroikeda
1
100
20260127_試行錯誤の結晶を1冊に。著者が解説 先輩データサイエンティストからの指南書 / author's_commentary_ds_instructions_guide
nash_efp
1
960
AgentCoreとHuman in the Loop
har1101
5
230
AI時代の認知負荷との向き合い方
optfit
0
160
CSC307 Lecture 03
javiergs
PRO
1
490
Apache Iceberg V3 and migration to V3
tomtanaka
0
160
Oxlint JS plugins
kazupon
1
920
カスタマーサクセス業務を変革したヘルススコアの実現と学び
_hummer0724
0
700
AtCoder Conference 2025
shindannin
0
1.1k
疑似コードによるプロンプト記述、どのくらい正確に実行される?
kokuyouwind
0
380
16年目のピクシブ百科事典を支える最新の技術基盤 / The Modern Tech Stack Powering Pixiv Encyclopedia in its 16th Year
ahuglajbclajep
5
1k
Featured
See All Featured
Speed Design
sergeychernyshev
33
1.5k
Navigating the Design Leadership Dip - Product Design Week Design Leaders+ Conference 2024
apolaine
0
170
Why Our Code Smells
bkeepers
PRO
340
58k
Mozcon NYC 2025: Stop Losing SEO Traffic
samtorres
0
140
Color Theory Basics | Prateek | Gurzu
gurzu
0
200
The Language of Interfaces
destraynor
162
26k
Beyond borders and beyond the search box: How to win the global "messy middle" with AI-driven SEO
davidcarrasco
1
51
Primal Persuasion: How to Engage the Brain for Learning That Lasts
tmiket
0
250
Understanding Cognitive Biases in Performance Measurement
bluesmoon
32
2.8k
Leadership Guide Workshop - DevTernity 2021
reverentgeek
1
200
Digital Projects Gone Horribly Wrong (And the UX Pros Who Still Save the Day) - Dean Schuster
uxyall
0
340
Automating Front-end Workflow
addyosmani
1371
200k
Transcript
Ͱ͓खܰ/-1 )JSBNBUTV!ϨτϦόηϛφʔ ը૾IUUQTHJUIVCDPNFYQMPTJPOTQB$ZCMPCNBTUFSXFCTJUFBTTFUTJNHMPHPTWH
Tsukuba, M2, NLP himkt
5-%3 w 1ZUIPOͷࣗવݴޠॲཧϥΠϒϥϦͰ͋Δ4QB$Zͷհ w Wͷ͓͠Ζػೳʹ͍ͭͯ w 4QB$ZͰຊޠςΩετΛॲཧ͢Δ
"CPVU4QB$Z ը૾IUUQTTQBDZJP https://spacy.io
IUUQTTQBDZJP "CPVU4QB$Z “Industrial-Strength NLP” w /POEFTUSVDUJWFUPLFOJ[BUJPO w /BNFEFOUJUZSFDPHOJUJPO w 4VQQPSUGPS
MBOHVBHFT w TUBUJTUJDBMNPEFMTGPSMBOHVBHFT w ʜFUD IUUQTHJUIVCDPNFYQMPTJPOTQB$Z
#BTJDVTBHFPG4QB$Z
.BOZ'FBUVSFT ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
'BTUFTUJOUIFXPSME w %FQFOEFODZQBSTFSͷύϑΥʔϚϯεൺֱ w จIUUQTBDMXFCPSHBOUIPMPHZ111QEG w จʹ͋Δͷ41&&%ͷදͰɼ"DDVSBDZࣗલͰ࡞ΒΕͨͷʁ w $IPJͷϕϯνϚʔΫ࣌ʹTQB$ZWະϦϦʔεͳͷͰOB
ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
4QFFEDPNQBSJTPOXJUIPUIFSMJCSBSJFT w ଞͷࣗવݴޠॲཧϥΠϒϥϦͱͷൺֱ w จͰͳ͘࡞ऀ͕ௐࠪͨ͠ͷ DPOEVDUFEJO w ϦϙδτϦIUUQTHJUIVCDPNFYQMPTJPOTQBDZCFODINBSLT
ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
.PEFMDPNQBSJTPO w ݴޠʹΑͬͯෳͷαΠζͷϞσϧ͕͋Δ FO GS FT w 104UBHHFS /&3UBHHFS
%FQFOEFODZQBSTFS w 8JUIPVUBOZQSFQSPDFTTJOH EBUBTFU ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
/-5,BOE4QB$Z w /-5,P⒎FSTTPNFPGUIFTBNFGVODUJPOBMJUZBTTQB$Z w *ODPNQBSJTPOUPTQB$Z /-5,UBLFTBNVDINPSF CSPBEDIVSDIBQQSPBDI w TQB$ZJTBMTPNVDINPSFQFSGPSNBODFGPDVTTFEUIBO/-5, XIFSFUIFUXPMJCSBSJFTQSPWJEFUIFTBNFGVODUJPOBMJUZ
TQB$ZTJNQMFNFOUBUJPOXJMMVTVBMMZCFGBTUFSBOENPSF BDDVSBUF Ҿ༻IUUQTTQBDZJPVTBHFGBDUTpHVSFT
0UIFSMJCSBSJFTBOETQB$Z 1ZUPSDIIUUQTHJUIVCDPNQZUPSDIQZUPSDICMPCNBTUFSEPDTTPVSDF@TUBUJDJNHQZUPSDIMPHPEBSLTWH "MMFO/-1IUUQTHJUIVCDPNBMMFOBJBMMFOOMQCMPCNBTUFSEPDTUBUJDBMMFOOMQMPHPEBSLQOH (FOTJNIUUQTHJUIVCDPN3B3F5FDIOPMPHJFTHFOTJNCMPCEFWFMPQEPDTTSDSFBENF@JNBHFTSBSFQOH $V1ZIUUQTHJUIVCDPNDVQZDVQZCMPCNBTUFSEPDTJNBHFDVQZ@MPHP@QYQOH JOUPSDIUFYU GPS(16BDDFMFSBUJPO XPSEWFDUPS
QJQFMJOF
4QB$ZWͷݸਓతʹ͖ͳػೳ w EJTQMB$ZͰ͌͢Εʔ͠ʔʁ w 4QB$ZͷՄࢹԽϞδϡʔϧ w ͖Ε͍Ͱ͍͍ײ͡ͳը૾Λ࡞ͬͯ͘ΕΔ w ݻ༗දݱநग़ͱΓड͚ղੳͷ݁ՌΛՄࢹԽͯ͘͠ΕΔ w
4QB$ZͰղੳͨ͠ΦϒδΣΫτΛͦͷ··͑Δ w 47(ܗࣜͷը૾͕ग़ྗ͞ΕΔ
8FCαΠτ্ͷ/&3ͷσϞ ը૾IUUQTFYQMPTJPOBJEFNPTEJTQMBDZFOU
4QB$ZY+VQZUFS/PUFCPPL
4QB$ZBOEຊޠ w WͰͷຊޠରԠ 13 w ຊޠܗଶૉղੳث+BOPNFΛϥοϓ͢ΔܗͰ࣮ w WͰͷܗଶૉղੳثҠߦ *TTVF
13 w ຊޠ6OJWFSTBM%FQFOEFODZσʔλ6OJ%JDͰׂ͞Ε͍ͯΔ w +BOPNFݱࡏͷͱ͜Ζ6OJ%JDʹະରԠ w .F$BCʹҠߦ
4QB$ZBOEຊޠ
4QB$ZBOEຊޠ "OTXFSVTF6OJ%JD
4QB$ZBOEຊޠ
ຊޠ/&3%FQFOEFODZQBSTJOHXJUI4QB$Z w ݁ݱࡏͰ͖ͳ͍ w 4QB$ZʹࣗͰ5BHHFS1BSTFSΛֶशͰ͖Δ ػߏ͕Έࠐ·Ε͍ͯΔ w ຊޠ6OJWFSTBM%FQFOEFODJFTެ։͞Ε͍ͯΔ IUUQTHJUIVCDPN6OJWFSTBM%FQFOEFODJFT6%@+BQBOFTF(4%ͳͲ
Ϟσϧࣗ࡞Ͱ͖ΔͷͰʂʁ
"EEJOH-BOHVBHFTVQQPSU 4QB$ZͷݴޠϞδϡʔϧͷ ίϯϙʔωϯτ ը૾IUUQTTQBDZJPVTBHFBEEJOHMBOHVBHFTTFDUJPOUSBJOJOH
"EEJOH-BOHVBHFTVQQPSU ը૾IUUQTTQBDZJPVTBHFBEEJOHMBOHVBHFTTFDUJPOUSBJOJOH 1PXFSFECZ.F$BC 4QB$ZͷֶशϞδϡʔϧ͕͑ͳ͍ʁ
ຊޠTQB$Zͷݱࡏͷʁ w ࣙॻ͕6OJ%JDͰ͋Δ͜ͱ͕ఆ͞Ε͍ͯΔ JTTVF w ʢ͓ͦΒ͘ʣଟ͘ͷڥͰ.F$BCͷσϑΥϧτࣙॻ*1"EJD w <50%0>+BQBOFTF5PLFOJ[FSͰ5BHHFSΛ࡞͍ͬͯΔ͕ɼ ͜ͷίϯετϥΫλͷҾ֎෦͔Β৮Εͳͦ͏
w ൃԻ͕ະొͷ୯ޠΛղੳ͢ΔͱΤϥʔ *1"EJD 13 w ࣙॻ͝ͱͷग़ྗͷࠩҟΛٵऩ͢ΔΠϯλʔϑΣʔε͕ඞཁʁ w /&3%FQFOEFODZ1BSTJOHͷϞσϧ·ͩଘࡏ͠ͳ͍ w ݱࡏA"MQIBUPLFOJ[BUJPOTVQQPSUA w ͔ͪॻ͖͕ඞཁͳݴޠͷରԠͲ͏͢Εʜ w தࠃޠࣅͨঢ়گͰࢭ·͍ͬͯΔ 13
͜ΕΛΕͱΓ͋͑ͣςετͰ͖ͦ͏ USBWJTͰ.F$BCΛΠϯετʔϧ͢ΔΑ͏ʹ͢Δ TQBDZNPEFMTʹ6OJ%JDΛొ͢Δ QZUIPONTQBDZEPXOMPBEKBΛඋ͢Δ /&3ͱ%FQFOEFODZ1BSTJOH͋ͱͰௐΔʜ
·ͱΊ w ࣗવݴޠॲཧϥΠϒϥϦTQB$Zͷհ w *OEVTUSZൃͷࣗવݴޠॲཧϥΠϒϥϦ w WͰೖͬͨՄࢹԽϞδϡʔϧ͕͍͍ײ͡ w EJTQMB$ZΛ͍͍ͬͨײ͡ͳՄࢹԽ w
TQB$ZͰͷຊޠςΩετॲཧ·ͩෆશ w ݱঢ়ܗଶૉղੳͷΠϯλʔϑΣʔε QJQJOTUBMMTQBDZ 3VO5IJTDPNNBOE