Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
How to start studying NLP 02
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
kabayan55
February 18, 2019
Programming
5.3k
7
Share
How to start studying NLP 02
kabayan55
February 18, 2019
More Decks by kabayan55
See All by kabayan55
My favorite tool 2019
kabayan55
2
1.7k
Escalators are Awesome
kabayan55
2
1.5k
How to start studying NLP
kabayan55
0
360
Other Decks in Programming
See All in Programming
【26新卒研修資料】TDD実装演習
dip_tech
PRO
0
160
(Re)make Regexp in Ruby: Democratizing internals for the JIT
makenowjust
3
970
Programming with a DJ Controller — not vibe coding
m_seki
3
750
書籍「ユーザーストーリーマッピング」が私のバイブル
asumikam
4
470
Claude Codeをカスタムして自分だけのClaude Codeを作ろう
terisuke
0
160
AWSコミュニティ活動は顧客のクラウド推進に効くのか / Do AWS community activities help customers adopt the cloud?
seike460
PRO
0
160
Running Swift without an OS
kishikawakatsumi
0
880
ハーネスエンジニアリングとは?
kinopeee
13
6.7k
Liberating Ruby's Parser from Lexer Hacks
ydah
2
2.5k
mruby on C#: From VM Implementation to Game Scripting (RubyKaigi 2026)
hadashia
2
1.5k
「話せることがない」を乗り越える 〜日常業務から登壇テーマをつくる思考法〜
shoheimitani
4
960
Oxlintとeslint-plugin-react-hooks 明日から始められそう?
t6adev
0
320
Featured
See All Featured
30 Presentation Tips
portentint
PRO
1
290
We Have a Design System, Now What?
morganepeng
55
8.1k
DBのスキルで生き残る技術 - AI時代におけるテーブル設計の勘所
soudai
PRO
65
54k
How to build an LLM SEO readiness audit: a practical framework
nmsamuel
1
730
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
3k
How STYLIGHT went responsive
nonsquared
100
6.1k
Un-Boring Meetings
codingconduct
0
280
Thoughts on Productivity
jonyablonski
76
5.1k
Leadership Guide Workshop - DevTernity 2021
reverentgeek
1
270
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
231
23k
The Invisible Side of Design
smashingmag
302
52k
Gemini Prompt Engineering: Practical Techniques for Tangible AI Outcomes
mfonobong
2
380
Transcript
ʲॳ৺ऀ͚ʳ ɹ͡ΊͯΈΑ͏ʂࣗવݴޠॲཧ ɹɹࣗવݴޠॲཧͷੈքɺΑ͏ͦ͜ αϙʔλʔζ$P-BCษڧձ ݄ LBCBZBO
LBCBZBO େֶɾେֶӃͷݚڀͰࣗવݴޠॲཧ 8FCܥاۀ৽ଔ σʔλαΠΤϯεΤϯδχΞ ࣗݾհ
Agenda ࣗવݴޠॲཧͰͰ͖Δ͜ͱ ࣗવݴޠॲཧͷษڧ๏
Agenda ࣗવݴޠॲཧͰͰ͖Δ͜ͱ ࣗવݴޠॲཧͷษڧ๏
ࣗવݴޠΛίϯϐϡʔλͰॲཧ͢Δ ࣗવݴޠɿਓ͕ؒৗతʹͬͯΔݴޠ ɹɹɹɹɹྫ ຊޠɺӳޠ ੜ·Εͨͱ͖͔Βۙʹ͋ΔࣗવݴޠΛ ίϯϐϡʔλͰॲཧͰ͖Δͬͯ ͳΜ͔ͩͦ͢͝͏ʂ ʜʜͱ࠷ॳࢲࢥ͍·ͨ͠ ࣗવݴޠॲཧͬͯͳʹʁ
֓ཁਤ ⽂書分類 ⾃動要約 情報抽出 機械翻訳 質問応答 情報検索 評判分析 形態素解析 構⽂解析
意味解析 要素技術 複合技術 etc.
ܗଶૉղੳ ܗଶૉʢ୯ޠʣʹ͚ͯࢺผ .F$BC +6."/ͳͲ $ mecab すもももももももものうち すもも 名詞,⼀般,*,*,*,*,すもも,スモモ,スモモ も 助詞,係助詞,*,*,*,*,も,モ,モ もも 名詞,⼀般,*,*,*,*,もも,モモ,モモ
も 助詞,係助詞,*,*,*,*,も,モ,モ もも 名詞,⼀般,*,*,*,*,もも,モモ,モモ の 助詞,連体化,*,*,*,*,の,ノ,ノ うち名詞,⾮⾃⽴,副詞可能,*,*,*,うち,ウチ,ウチ EOS ཁૉٕज़
ߏจղੳ ,/1 $BCP$IB ͳͲ ཁૉٕज़ Wikipedia より
ҙຯղੳ ߏจతᐆດੑ͕͋Δͱ͖ ҙຯղੳ͕ඞཁ ྫ ʮ಄͕͍ڕΛ৯Δೣʯ தଜ໌༟͞Μ !OLNS@BLJ ͷ5XJUUFSΑΓ ཁૉٕज़
จॻྨ จॻΛΧςΰϦ͝ͱʹ͚Δ ࣗಈཁ จষΛࣗಈͰཁ͢Δ ใநग़ ΩʔϫʔυΛநग़͢Δ ྫʣΠϕϯτใநग़ɺใநग़ ෳ߹ٕज़
ෳ߹ٕज़ ධੳ ྫ ϨϏϡʔจ Positive Negative ͜ͷέʔΩ͍ͪ͝ͷ ͕͞ࡍཱͬͯඒຯͰͨ͠ɻ ·ͨߪೖ͍ͨ͠Ͱ͢ɻ ΫϦʔϜ͕͗ͨ͢ɻ
εϙϯδ͕ύαύαͩͬͨɻ
ෳ߹ٕज़ ػց༁ ใݕࡧ ࣭Ԡ
୯ޠΛϕΫτϧͰදݱͰ͖Δ ୯ޠͷ͠ࢉҾ͖ࢉ͕Ͱ͖Δ ྫ LJOHrNBO XPNBORVFFO ୯ޠͷྨࣅ͕Θ͔Δ χϡʔϥϧωοτϫʔΫ ٕज़հ8PSE7FD King Queen
Woman Man
8PSE7FDͱͷҧ͍ɿ׆༻ܗΛ·ͱΊΒΕΔ ྫ HP HPJOH HPFTˠHP ٕज़հGBTU5FYU
݄ʹ(PPHMF͕ެ։ ൚༻తͳϞσϧ ϑΝΠϯνϡʔχϯάͰߴ͍ਫ਼Λग़͢ ٕज़հ#&35
Agenda ࣗવݴޠॲཧͰͰ͖Δ͜ͱ ࣗવݴޠॲཧͷษڧ๏
ࢲPythonΛ༻͍ͯ͠·͢ Python͕ਓؾʂ ϝϦοτ ! εΫϦϓτݴޠͳͷͰ͙͢ʹ࣮ߦͰ͖Δ ! ๛ͳϥΠϒϥϦ ɹ/VNQZ 4DJQZ /-5, 4DJLJUMFBSO ϓϩάϥϛϯάݴޠʁ
͓͢͢Ίڭࡐ
ݴޠॲཧຊϊοΫ http://www.cl.ecei.tohoku.ac.jp/nlp100/
ݴޠॲཧຊϊοΫ ! ౦େͷԬ࡚ઌੜ͕࡞ͨ͠ νϡʔτϦΞϧ ! Pythonͷ࿅शʹͳΔ ! ݴޠॲཧʹඞཁͳ࣮͜͜ͰֶΔ ! GitHubʹίʔυΛ্͛ͯΔͻͱଟ͘ɺ ଞͷਓͷίʔυΛࢀߟʹͰ͖ΔͷͰ ಠֶ͍͢͠
ݴޠॲཧຊϊοΫ
ݴޠॲཧຊϊοΫ GitHubͰ “NLP100knock” ͱ ݕࡧ͢Δ͚ͩͰɺ 86 ϦϙδτϦ ݟ͔ͭΔ ˞20189݄࣌
ར༻ऀͨ͘͞Μ ͍·͢
/-1ϓϩάϥϛϯάνϡʔτϦΞϧ http://phontron.com/teaching.php
/-1ϓϩάϥϛϯάνϡʔτϦΞϧ http://phontron.com/teaching.php
/-1ϓϩάϥϛϯάνϡʔτϦΞϧ ! ΧʔωΪʔϝϩϯେֶͷ Graham Neubig ઌੜ͕࡞ͨ͠ νϡʔτϦΞϧ ! εϥΠυܗࣜ ! ֤νϡʔτϦΞϧʹԋश͕͋Γɺ ٖࣅίʔυͱߨٛεϥΠυΛࢀߟʹ ࣮͢Δͱཧղ͕ਂ·Δ
! ࣜΑΓίʔυΛݟͨ΄͏͕ ཧղ͍͢͠ਓʹಛʹΦεεϝ
/-1ϓϩάϥϛϯάνϡʔτϦΞϧ ࢿྉɾԋशσʔλ ͔͜͜Β Ұׅμϯϩʔυʂ https://github.com/neubig/nlptutorial
ࣗવݴޠॲཧΛಠश͍ͨ͠ਓͷͨΊʹ http://cl.sd.tmu.ac.jp/prospective/prerequisite
ࣗવݴޠॲཧΛಠश͍ͨ͠ਓͷͨΊʹ टେֶ౦ژͷখொઌੜ͕ ! ֶ ! ӳޠ ! ϓϩάϥϛϯά ! ػցֶश ! ࣗવݴޠॲཧ ͷษڧͷํʹ͍ͭͯ ·ͱΊ͍ͯΔϖʔδ
ࣗવݴޠॲཧΛಠश͍ͨ͠ਓͷͨΊʹ ࠓճॳ৺ऀ͚ͷߨٛͳͷͰ հ͚ͩʹͱͲΊ͓͖ͯ·͕͢ Կͷษڧ͕ඞཁͰ Ͳ͏ษڧ͖͔͢ ஸೡʹΘ͔Γ͘͢·ͱ·͍ͬͯΔͷͰ ੋඇ͝ཡʹͳͬͯ΄͍͠Ͱ͢ʂ
⻑岡技術科学⼤学⾃然⾔語処理研究室(YouTube) IUUQTXXXZPVUVCFDPNVTFSKOMQPSH ʮษڧձʯ͔ΒݟΔͱྑ͍ͱࢥ͍·͢
LBHHMF ࣗવݴޠॲཧܥͷίϯϖ͋Δ Θͨ͠/-1ͷίϯϖग़ͨ͜ͱͳ͍Ͱ͢
ࣗવݴޠॲཧΤϯδχΞʹͳΓ͍ͨਓ ! ػցֶशΤϯδχΞʹͳͬͯ ࣗવݴޠॲཧΔ ! ࣗવݴޠॲཧٕज़ʹಛԽͨ͠اۀʹߦ͘
ػցֶशΤϯδχΞʹͳΓ͍ͨਓ Φεεϝॻ੶ ʰػցֶशΤϯδχΞʹͳΓ͍ͨਓͷ ɹͨΊͷຊ"*Λఱ৬ʹ͢Δʱ ! ԿΛ͢Ε͍͍͔۩ମత
&OKPZ 4UVEZJOH /-1