Context is Key- Grammatical Error Detection with Contextual Word Representations

長岡技術科学大学自然言語処理研究室小川耀一朗文献紹介（2019-12-16） Context is Key- Grammatical Error Detection
with Contextual Word Representations

Paper 2

3 Abstract • 誤り検出 (Grammatical Error Detection: GED) タスクの研究 •
既存のモデルに文脈を考慮した単語埋め込み (contextualized embeddings) を組み込むことでスコアの向上を確認した • ELMo, BERT, Flair を比較し、それぞれの傾向を調査した

4 Error Detection Model ベースモデル (Rei 2017) • bi-LSTMを用いて各トークンに対して correct
/ incorrect ラベルを予測する、sequence labelingタスクとして解く • Language modelを同時にマルチタスク学習する ◦ forward LSTMは次のトークンを予測するLM ◦ backward LSTMは前のトークンを予測するLM • character-level LSTMの出力を結合してword embeddingsとする

5 Error Detection Model 提案モデル word embeddingsにcontextualized embeddingsを結合して入力する

6 Contextualized word embeddings • BERT, ELMo, Flairの公開されている学習済みのモデルを使う • Flair:
Contextual String Embeddings for Sequence Labeling (Akbik et al., 2018)

BERT • masked LM と next sentence predictionを学習 • BooksCorpus
(0.8 billion words) + English Wikipedia (2.5 billion words) ELMo • 3つのlayerの出力を合計したembeddings • One Billion Word Benchmark corpus (0.8 billion words) Flair • One Billion Word Benchmark corpus (0.8 billion words) 7 Contextualized word embeddings

8 Results Rei (2017): ベースモデルを提案 Rei et al. (2017), Kasewa
et al. (2018) : 擬似データを使用提案手法は擬似データ未使用だが、 contextualized emneddingsを使うことで高いスコア BERTが最も高い

9 Integration method contextualized embeddingsをbi-LSTMの（左）入力に結合するか（右）出力に結合するかを比較

10 Integration method ベストスコアはデータセットによって異なるが、全体的には入力に結合する方がいい

11 Error type performance 各エラータイプでのRecallをベースラインと比較（全データセットの平均スコア）

12 Error type performance 各エラータイプでのRecallをベースラインと比較（全データセットの平均スコア） conjugation, spellingは向上が小さい →これらのエラーは単言語コーパスでは未知語になる

13 Conclusion • 文脈を考慮した単語の埋め込み (contextual embeddings) を使うことで誤り検出タスクの性能を向上させることができた • contextual
embeddingsは学習者コーパスを必要としないため実用的である • ELMo, Flair, BERT(base/large) の各エラータイプにおける長所と短所を分析した

Context is Key- Grammatical Error Detection wit...

Context is Key- Grammatical Error Detection with Contextual Word Representations

youichiro

More Decks by youichiro

Other Decks in Research

Featured

Transcript

長岡技術科学大学自然言語処理研究室小川耀一朗文献紹介（2019-12-16） Context is Key- Grammatical Error Detection

Paper 2

3 Abstract • 誤り検出 (Grammatical Error Detection: GED) タスクの研究 •

4 Error Detection Model ベースモデル (Rei 2017) • bi-LSTMを用いて各トークンに対して correct

5 Error Detection Model 提案モデル word embeddingsにcontextualized embeddingsを結合して入力する

6 Contextualized word embeddings • BERT, ELMo, Flairの公開されている学習済みのモデルを使う • Flair:

BERT • masked LM と next sentence predictionを学習 • BooksCorpus

8 Results Rei (2017): ベースモデルを提案 Rei et al. (2017), Kasewa

9 Integration method contextualized embeddingsをbi-LSTMの（左）入力に結合するか（右）出力に結合するかを比較

10 Integration method ベストスコアはデータセットによって異なるが、全体的には入力に結合する方がいい

11 Error type performance 各エラータイプでのRecallをベースラインと比較（全データセットの平均スコア）

12 Error type performance 各エラータイプでのRecallをベースラインと比較（全データセットの平均スコア） conjugation, spellingは向上が小さい →これらのエラーは単言語コーパスでは未知語になる

13 Conclusion • 文脈を考慮した単語の埋め込み (contextual embeddings) を使うことで誤り検出タスクの性能を向上させることができた • contextual