Lock in $30 Savings on PRO—Offer Ends Soon! ⏳
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
B4勉強会 書き手の判別
Search
Atsushi
February 27, 2018
0
90
B4勉強会 書き手の判別
B4勉強会
発表日 2月27日
Atsushi
February 27, 2018
Tweet
Share
More Decks by Atsushi
See All by Atsushi
文献紹介:Automated Evaluation of Out-of-Context Errors
atsumikan
0
98
文献紹介:Correction of OCR Word Segmentation Errors in Articles from the ACL Collection through Neural Machine Translation Methods
atsumikan
0
160
文献紹介:Auxiliary Objectives for Neural Error Detection Models
atsumikan
0
92
文献紹介:Wronging a Right: Generating Better Errors to Improve Grammatical Error Detection
atsumikan
0
120
文献紹介:Low-resource OCR error detection and correction in French Clinical Texts
atsumikan
0
130
文献紹介:CMMC-BDRC Solution to the NLP-TEA-2018 Chinese Grammatical Error Diagnosis Task
atsumikan
0
130
文献紹介 : Fluency Boost Learning and Inference for Neural Grammatical Error Correction
atsumikan
0
180
文献紹介:語彙の概念化と Wikipediaを用いた英字略語の意味推定方法
atsumikan
0
150
文献紹介:The Effect of Error Rate in Artificially Generated Data for Automatic Preposition and Determiner Correction
atsumikan
0
130
Featured
See All Featured
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
122
21k
A designer walks into a library…
pauljervisheath
210
24k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
359
30k
Bootstrapping a Software Product
garrettdimon
PRO
307
120k
How Fast Is Fast Enough? [PerfNow 2025]
tammyeverts
3
390
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
35
2.3k
Principles of Awesome APIs and How to Build Them.
keavy
127
17k
How GitHub (no longer) Works
holman
316
140k
How Software Deployment tools have changed in the past 20 years
geshan
0
29k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
26
3.3k
Practical Orchestrator
shlominoach
190
11k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
132
19k
Transcript
B4 227 B4
1
93 1. I;)% 1= 2. ,FH4> 3. #+D:
E 1. !-@CJ(AG<" 2. '$&2/B68 4. 7?.5 7?J(0* 2
$! u .+'2"% 3 - 7#
16,515 6 16,547 4 16,810 16,946 1 8( 0 ) 16,702 5, 17,028 & 16,785 3*/ 16,245
1. N-gram 2. !
4
1. N-gram 2. !
5
$= $8 %( %: 1. 0/#;,1 2.
2&'.8>) <7- 3. <76!$# 439*5 0/#; "+?) #; 6
N-gram "(< -*-- )9 1. N-gram 2. 3+,27=. ;6 1
4) !#;6, %%;6,;6 3. ;65 $( ' 4) -,/&;6-,:08- 7
N-gram - - 8 soseki_eijitsu.txt soseki_yume.txt soseki_garasu.txt soseki_omoidasu.txt ogai_niwatori.txt ogai_vita.txt
ogai_gan.txt ogai_kanoyoni.txt 200 300 400 500 600 700 Cluster Dendrogram hclust (*, "ward.D") dist(t(res)) Height
1. N-gram 2. !
9
-/''A(PCA) u 41<163-/'50 @=1&/)1 <?(+ u 8 89#",% 8
<1 u # !$ ;. 2 u 2* :-/'50 @>7 10 z1 = a11x1 + a21x2 + a31x3 + a41x4 z2 = a12x1 + a22x2 + a32x3 + a42x4 z3 = a13x1 + a23x2 + a33x3 + a43x4 z4 = a14x1 + a24x2 + a34x3 + a44x4
&"#*)@<68//J -12- u H:! ;B07F M u EKH:&"#*)!A M!>C
0G 1. &"#*)!A 2. EKH:=. ,9L3! A 3. 68//J!-68/?:!A 4. 68/?:#*'!(+$% 68/?:DI! 5 2!4(+$% 11
- - −100 −50 0 50 −20
0 20 40 60 Comp.1 Comp.2 ogai_gan ogai_kanoyoni ogai_niwatori ogai_vita soseki_eijitsu soseki_garasu soseki_omoidasu soseki_yume 12
- - −100 −50 0 50 −20
0 20 40 60 Comp.1 Comp.2 1 2 3 4 5 6 7 8 13
- #' u % +" u )
$ 247,249-251 u *() - . ! - 261-264 u &, 6600&, 14
-- 15 tensura_247.txt tensura_249.txt tensura_250.txt tensura_251.txt mushoku_264.txt mushoku_261.txt mushoku_262.txt
mushoku_263.txt 150 200 250 300 350 Cluster Dendrogram hclust (*, "ward.D") dist(t(res)) Height
-- −30 −20 −10 0 10 20 30 −10
−5 0 5 10 15 20 Comp.1 Comp.2 mushoku_261 mushoku_262 mushoku_263 mushoku_264 tensura_247 tensura_249 tensura_250 tensura_251 16
u 3. u ! u N-gram*$> u 6?:(-/+
%'> u 8 u 9" A17 @&= 4402 u #5, ;< 202 44) 17
/*,2 u -$() (2008) R &0, 1'"3+.!# page
143-153 u %4 , <https://www.macromill.com/service/data_analysis/cluster- analysis.html>,2018/02/27 18