A System to Solve Language Tests for Second Grade Students
Manami Saito, Satoshi Sekine, Hitoshi Isahara, and Kazuhide Yamamoto. A System to Solve Language Tests for Second Grade Students. Proceedings of The Second International Joint Conference on Natural Language Processing (IJCNLP-05), pp.45-50 (2005.10)
Manami SaitoʢNagaoka University of Technology ʣ Kazuhide YamamotoʢNagaoka University of Technology ʣ Satoshi SekineʢLanguage CraftɾNew York Universityʣ Hitoshi IsaharaʢNational Institute of Information and Communications Technologyʣ
for 2nd grade students (http://languagecraft.jp). Two aims: To realize the NLP technologies into the form which can be easily observed by ordinary people. To observe the problems of the NLP technologies by degrading the level of target materials.
Word knowledge, Comprehension, and Composition Kanji Reading, Writing, Order of writing, etc… Word knowledge Anonym, Synonym, Particle, Onomatopeia, etc… Comprehension 5W1H, Fill in the blanks, Progress order of a story, etc…
morphological analysis Writing We got the candidates from Kanji-dictionary, and then choose feasible one using large corpus (38 years of newspapers, 350GB Web corpus)
at different type of questions ʢaʣPattern matching (Ex3) ʢbʣStandard NE and form of grammar ʢcʣPartial matching with keywords ʢdʣUse of frequencies in the large corpus ʢeʣUse of distance between keywords in question and answers
ೋɺࡾͨͭͱɺͦͷՖ ͠΅ΜͰɺͩΜͩ Μ ࠇͬΆ͍ ৭ʹ ͔Θͬͯ ͍͖·͢ɻ(In a few days, the flower withers and gradually changes its color to black.) Expression: Ֆ (1), (2) ৭ʹ ͔ΘΔɻ (The flower (1) and changes its color to (2).) Answer: (1) ͠΅ΜͰ (withers) (2) ࠇͬΆ͍ (black)
grade level e.g. “A student enters junior high school after graduated from elementary school”, “A person become happy, if he receives something nice from someone ”
It should be flexibility of Named Entity “Who” = “raccoon dog behind our house ”, “the moon” The answer can be a clause “When” = “the time when new leaves growing on a branch ”
for 2nd grade students Test Data Targeted questions suggest the questions prepared the subsystem to solve it All questions suggest targeted questions and the questions that this system can’t even try to solve it Rate of the questions targeted by this system Kanji 97.4%, Word knowledge 57.1%, Reading comprehension 64.7%
100 minor types Is this classification right? Are there any unknown types? We can’t solve the questions of unknown type Reclassification is in review Accuracy of the classification program We are building the system which classify the questions automatically