Speech and Language Processing 9章 RNNによる系列の処理

Slide 1

Slide 1 text

Speech and Language Processing 9章 RNN による系列の処理 +α Dan Jurafsky and James H. Martin. Speech and Language Processing (3rd ed. draft). https://web.stanford.edu/~jurafsky/slp3/ 2020 年 6 月 30 日三原千尋テキスト

Slide 3

Slide 3 text

言語は単語（文字、音素…）の系列データになっており、私たちは単語（文字、音素…）を流れてきた先から順に処理している。→ 言語処理モデルはこのような流れを考慮している？ナイーブベイズ分類器（4章）、ロジスティック回帰による分類器（5章）： • これらは何番目にどの単語がきたかを考慮するモデルではない（特徴の設計によっては単語の並びを部分的には考慮できるが）。文章の途中までに対して何か特徴を出すようなこともしない。固定長の単語列を入力して何か予測するニューラルネットモデル： • これは何番目にどの単語がきたかを考慮しているが、文章の途中までに大して何か特徴を出すようなことはしない。また、任意の長さの文章を扱うことはできない。固定ウィンドウ幅を入力にしたニューラルネット単語予測モデル（7章）： • これは文章に対して順にウィンドウ内の単語とその並びを参照し、次の単語を予測していく。 • ただし、ウィンドウの範囲に入っていない過去の単語は参照できない。 • ウィンドウをスライドさせていく方式は語のまとまりの意味を捉えにくい（※）。 9章 RNN による系列の処理 constituency ― 構成素。語のまとまり。 ground the in hole there lived ground the in hole there lived 入力単語予測対象 ※ the ground というフレーズは、あるステップでは入力の2語目と3語目になるが、次のステップでは1語目と2語目になる（右図）。ネットワークは両方のパターンを学習しなければならない。そこで、再帰的ニューラルネットワーク（RNN: Recurrent Neural Networks）を導入すると文章の流れ（文脈）を取り扱うことができ、可変長の入力も扱うことができる。

Slide 24

Slide 24 text

★ RNNの起こりと利用 • 1980年代 ― UC San Diego の Parallel Distributed Processing (PDP) グループがヒトの認知のモデルとして RNN を研究する (Rumelhart et al., 1986b) (McClelland et al. 1986)。 • 1994年 ― 信号処理や言語処理でも RNN の研究が盛んになる (Giles et al., 1994)。 ★ RNNの構造の発展 • 1986年 ― 出力層を再帰するモデルが提案される (Jordan, 1986)。 • 1986年 ― RNN の FFN への展開が議論される (Rumelhart et al., 1986b)。 • 1990年 ― Elman Network が提案される (Elman, 1990)。 • 1995年 ― 隠れ層の前に再帰的なコンテクスト層を加えたモデル (Mathis and Mozer, 1995)。 • 1997年 ― Bi-RNN が提案され、TIMIT 音声認識で性能が示される (Schuster and Paliwal, 1997)。 • 1997年 ― LSTM が提案され (Hochreiter and Schmidhuber, 1997)、後に様々なタスクで特筆すべき性能を示す。 • 信号処理と言語処理の境界領域のタスク (Graves and Schmidhuber, 2005)。 • 手書き文字認識 (Graves et al., 2007)。 • 音声認識 (Graves et al., 2013)。 ★ RNNと自然言語処理 • 2008-2011年 ― 様々な自然言語処理の代表的なタスクが、単語埋め込み＋畳込みネットワークによって人手でつくった特徴量なしに解かれる (Collobert and Weston, 2008) (Collobert et al., 2011)。 • 2013年 ― word2vec + LSTM が提案される(Mikolov et al., 2013)。 • 2014年 ― GLOVE ＋ LSTM が提案される(Pennington et al., 2014)。 • 2015-2016年 ― 単語埋め込みと LSTM の組み合わせがさまざまなタスクを席巻する。 • 品詞タグ付け (Ling et al., 2015) • syntactic chunking (Søgaard and Goldberg, 2016) • named entity recognition via IOB tagging (Chiu and Nichols, 2016) (Ma and Hovy, 2016) • opinion mining (Irsoy and Cardie, 2014) • semantic role labeling (Zhou and Xu, 2015) • AMR parsing (Foland and Martin, 2016)

Slide 1

Slide 1 text

Slide 2

Slide 2 text

Slide 3

Slide 3 text

Slide 4

Slide 4 text

Slide 5

Slide 5 text

Slide 6

Slide 6 text

Slide 7

Slide 7 text

Slide 8

Slide 8 text

Slide 9

Slide 9 text

Slide 10

Slide 10 text

Slide 11

Slide 11 text

Slide 12

Slide 12 text

Slide 13

Slide 13 text

Slide 14

Slide 14 text

Slide 15

Slide 15 text

Slide 16

Slide 16 text

Slide 17

Slide 17 text

Slide 18

Slide 18 text

Slide 19

Slide 19 text

Slide 20

Slide 20 text

Slide 21

Slide 21 text

Slide 22

Slide 22 text

Slide 23

Slide 23 text

Slide 24

Slide 24 text

Slide 25

Slide 25 text

Slide 26

Slide 26 text

Slide 27

Slide 27 text

Slide 28

Slide 28 text

Slide 29

Slide 29 text

Slide 30

Slide 30 text

Slide 31

Slide 31 text