Upgrade to Pro — share decks privately, control downloads, hide ads and more …

SGM: Sequence Generation Model for Multi-Label Classification

SGM: Sequence Generation Model for Multi-Label Classification

弊研究室で行なったCOLING2018読み会の発表資料です。

onizuka laboratory

October 23, 2018
Tweet

More Decks by onizuka laboratory

Other Decks in Research

Transcript

  1. #" n Multi Label Classification(MLC) (,$. >2 !'& -E n

    ?6:  MLC-E>2Single Label Classification)+ Binary Relevancepairwise ranking loss; $ ! 6: n %0:  !C<D3B) ! *574=/  n A3 seq2seq; sequence generation! 18%@-E?9 4
  2. Encoder n Bi-LSTM  n $!  #  n

    $! # "#  !" = LSTM !"() , +" !" = LSTM !",) , +" !" = !" ; !" 8
  3. Attention n *  &2#,.+ * '% n Attention *(

    3 4" /1  n !" , $" , %"&'-$( decoder!40) 10
  4. Decoder n LSTM  n %#" n !"#$% − 1

     ! n ( !"#$ % − 1 &!  global embedding($) 13
  5. Decoder n $(& !%  )' n !" , !$

    , %$ n &'  !!   " #  &' ( = * −∞ ( .  ) 0 12ℎ456.74 14
  6. Global Embedding n  #!* %  ) (!* n

    #!*-+ ".!* '&/ (exposure bias) n Global embedding $, n !" , !$ ∈ ℝ'×'  15
  7. "- n  l Reuters Corpus Volume I (RCV1-V2) l

    '800,000 (  l Arxiv Academic Paper Dataset (AAPD) l 55,840 )$ !* l   &,#.+  % 17
  8.  n  l Hamming-loss l! ", $ " =

    & '()*+( ∑ -./ '()*+(0& 1("- ≠ $ "- ) l Micro-F1 n   l Binary Relevance(BR) l Classifier Chains(CC) l Label Powerset(LP) l CNN l CNN-RNN 18
  9.   n Global Embedding    ! "#$%

     &  n  21
  10. 9; n Multi-label classification"68&3/0  =5> ( n 1 decoder2<4

    sequence generation  4%@7,)# /' n *   ! +. $-:? 25