Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Delete, Retrieve, Generate: A Simple Approach t...

Delete, Retrieve, Generate: A Simple Approach to Sentiment and Style Transfer

弊研究室で行なった NAACL読み会の発表資料です。

Avatar for onizuka laboratory

onizuka laboratory

July 25, 2018
Tweet

More Decks by onizuka laboratory

Other Decks in Research

Transcript

  1. Delete, Retrieve, Generate: 
 A Simple Approach to 
 Sentiment

    and Style Transfer 2018.7.25 B4 Miura Biwa  
  2. Overview • Propose simple models for style transfer • Process

    in 3 steps : Delete, Retrieve, Generate • Evaluate by Yelp, Amazon, Caption dataset • Best method is stronger than previous system  
  3. Background • Control attributes (such as sentiment and style) is

    very important for neural language generation(NLG) • Goal : transfer specific attribute while preserving all attribute-independent content • Difficulty : aligned sentences with the same content but different attributes are not available  
  4. Method • Delete : delete attribute marker • Retrieve :

    retrieve similar sentence 
 with target attribute • Generate : generate output   %FMFUF 3FUSJFWF (FOFSBUF Y WTSD Z
  5. Delete • Identify attribute markers • Delete attribute markers •

    (“i have had this mount for about a year
 and it works great .” , “positive”)
 → “i have had this mount for about
 ɹ a year and it .”   %FMFUF 3FUSJFWF (FOFSBUF Y WTSD Z
  6. Retrieve • Look at similar sentence with target attribute •

    “i have had this mount for about
 a year and it .”
 →“i have had it for a while but barely
 ɹ used it .”   %FMFUF 3FUSJFWF (FOFSBUF Y WTSD Z
  7. Generate • Generate final output • RETRIEVEONLY
 - output retrieved

    sentence
 →“i have had it for a while but barely used it .” • TEMPLETEBASED
 - insert target attribute marker
 “i have had it for a while but barely used it .”
 →”i have had this mount for about
 ɹ a year and it barely used.”   %FMFUF 3FUSJFWF (FOFSBUF Y WTSD Z
  8. Generate • DELETEONLY
 - concat and decode extracted sentence
 and

    target label
 →G(“i have had this mount for about a year
 and it .”, “negative”) • DELETEANDRETRIEVE
 - concat and decode extracted sentence
 and attribute marker in retrieved sentence
 →G(“i have had this mount for about a year
 and it .”, “beraly used”)   %FMFUF 3FUSJFWF (FOFSBUF Y WTSD Z
  9. Training • DELETEONLY • AE : (extracted sentence, label)→source sentence

    • DELETEANDRETRIEVE • Denoising-AE : 
 (extracted sentence, attribute marker+noise)
 →source sentence  
  10. Training setting • word vector : 128 • GRU (both

    Decoder and Encoder) • hidden size : 512 • layer : 1 • activation function : maxout • init parameters : uniform distribution between [-0.1,0.1] • minibatch size : 256  
  11. Training setting • for attribute marker extraction • span :

    4 word • λ = 1 • γ : 15(Yelp) , 5.5(Amazon) , 5(Caption)  
  12. Training setting • Distance metric for retrieve • TF-IDF weighted

    word overlap score
 for DELETEANDRETREIVE and TEMPLETEBASED • Euclidian distance
 for RETRIEVEONLY • beamsize : 10  
  13. Conclusion • Proposed simple models for style transfer • Process

    in 3 steps : Delete, Retrieve, Generate • Evaluated by Yelp, Amazon, Caption dataset • Best method is stronger than previous system