Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated
Pooling Mechanisms Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Long Papers), pages 440–450 Melbourne, Australia, July 15 - 20, 2018. 文献紹介：長岡技術科学大学勝田哲弘

Abstract • Simple Word-Embedding-based Models (SWEMs)と word-embedding-based RNN/CNN modelsの比較 ◦
SWEMsが多くの場合で同等、優れた精度を示す • Parameter freeのpoolingを活用するモデル ◦ hierarchical pooling ◦ parameter数が少なく済む 2

Introduction • Word embeddingは各単語を固定長のベクトルとして表現し、可変長テキストのモデル化によく利用されている ◦ 加算などの簡易的なものからRNN、CNNなど • RNN、CNNはパラメータが多く、計算コストが高い •
SWEMは語順情報が明示的でない、計算コストは低い • 計算コストと表現力はトレードオフ 3

Introduction • 単語分散表現で実行される単純なpooling処理が自然言語処理にいつ、なぜ有効なのかを調査する • ３つの異なるタスク（17のデータセット）で評価 4

Simple Word-Embedding Model (SWEM) パラメータを持たないモデル • Average-Pooling（一番単純なモデル） • Max Pooling（CNNでのmax-over-time
pooling に近い） • Hierarchical Pooling ◦ ウィンドウ幅nでavg-poolingを行い、その上にmax-pooling 5

Parameters & Computation Comparison 6

Experiments • タスク： ◦ 文書分類（トピック分類、感情分類、オントロジー分類） ◦ テキストマッチング ◦ 文分類
◦ 17データセット • モデル ◦ GloVe ◦ MLP ◦ Adam 7

Document Categorization 8

Interpreting model predictions 殆どの値が0付近に集中するタスクがテキスト中のあるキーワードに依存していることを示唆各次元ごとに選択された単語は関連性や共通のトピックに対応する 9

Interpreting model predictions 10

Importance of word-order information 11

Text Sequence Matching 12

Short Sentence Processing 13

Extension to other languages • Sogou news corpus(a Chinese dataset
represented by Pinyin) ◦ SWEM-concat accuracy : 91.3% ◦ SWEM-hier (window size of 5) accuracy : 96.2% ◦ CNN (95.6%) and LSTM (95.2%) • より語順に敏感な中国語においても最高精度に匹敵する 14

Conclusions 17のデータセットでSWEM、CNN、LSTMのモデル間の比較を行った • 単純なプーリングは長い文書の表現に効果的、短い文にはCNN/LSTMが最適 • 感情分類はトピック分類よりも語順に敏感である、hierarchical poolingは CNN/LSTMと同等の結果が得られる •
NLI、QAでは単純なpoolingが優れた精度を出す • SWEM Max Poolingでは、分散表現の各次元にトピックと対応付けられるような意味的パターンが見られた 15

Baseline Needs More Love: On Simple Word-Embedd...

Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

katsutan

More Decks by katsutan

Other Decks in Technology

Featured

Transcript