Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Delete, Retrieve, Generate: Approach for Text Style Transfer

Scatter Lab Inc.
September 18, 2019

Delete, Retrieve, Generate: Approach for Text Style Transfer

Scatter Lab Inc.

September 18, 2019
Tweet

More Decks by Scatter Lab Inc.

Other Decks in Research

Transcript

  1. ݾର ݾର 1. ࣁ޷ա ѐਃ 2. Delete,Retrieve,Generate: 
 A Simple

    Approach to Sentiment and Style Transfer (NAACL 2018) 3. Transforming Delete,Retrieve,Generate Approach 
 for Controlled Text Style Transfer (EMNLP 2019)
  2. • Style Transfer • ࠺੹ ࠙ঠীࢲח ੜ ঌ۰૓ ӝࣿ۽
 ੉޷૑

    ղ੄ ௑బஎ (য়࠳ં౟)ח ਬ૑ೞݶࢲ, झఋੌ݅ਸ ߸ജೞৈ ੉޷૑ܳ ੤ҳࢿೞח ӝࣿ ࣁ޷ա ѐਃ What is Style Transfer?
  3. • Text Style Transfer • ఫझ౟੄ ղਊ਷ ਬ૑ೞݶࢲ, ఫझ౟੄ झఋੌਸ

    ߸ജೞৈ ఫझ౟ܳ ੤ҳࢿೞח ӝࣿ • Ӓ۞ա, ఫझ౟ীࢲ ݈ೞח ղਊҗ झఋੌ੉ۆ…? • গݒೞҊ ੿੄ೞӝ য۰਍ ѐ֛ • ٮۄࢲ, ೟ࣿ੸ੋ োҳٜ਷ ఫझ౟੄ хࢿ ߂ दઁ ١ ౠ੿ attributeী ୡ੼ਸ ݏ୶ח ҃਋о ݆਺ ࣁ޷ա ѐਃ What is Text Style Transfer? The chicken was delicious The chicken was terrible positive negative
  4. • Delete,Retrieve,Generate (DRG)ۄח ߑߨۿਸ ੉ਊೠ Text Style Transferী ҙೠ ֤ޙਸ

    2ѐ ࣗѐ • ఫझ౟੄ Style Transferܳ ా೧ Controllableೠ ޙ੢੄ ࢤࢿਸ ԫ೧ࠅ ࣻ ੓਺ • ઁয оמೠ (Controllable) ޙ੢੄ ࢤࢿ਷ ਋ܻীѱ ੓যࢲ рҗೡ ࣻ হח ࠗ࠙ 1. ೝಯ੉о ޖ঺ਸ ݈ೞח૑ب ઺ਃೞ૑݅ যڌѱ ݈ೞח૑ী ٮۄ ؀ച੄ ݅઒بח ௼ѱ ׳ۄ૗ • झ௼݀౴੄ ੗زച, ಕܰࣗա੄ ాੌ ١ 2. ೝಯ੉੄ ݈ਸ য়۽૑ ݽ؛ী݅ ੄ઓೞח Ѫ੉ ইפۄ ӝദ੗੄ ੄بী ٮۄ ੸੺൤ ઁয оמ • ੉ߣ ఢ਷ negativeೠ ׹߸݈Ҋ positiveೠ ׹߸ਸ ೞ੗! ۄ؍о • ੉ߣ ఢ਷ ಣࢲޙ੉ ইפۄ ૕ޙਸ ؍ઉࠁ੗! ۄ؍о ࣁ޷ա ѐਃ ࣁ޷ա ѐਃ
  5. 1. Delete,Retrieve,Generate: 
 A Simple Approach to Sentiment and Style

    Transfer (NAACL 2018) Delete,Retrieve,Generate: A Simple Approach to Sentiment and Style Transfer
  6. • ಕয ؘ੉ఠࣇ੉ ইצ non-parallel ؘ੉ఠࣇਸ ੉ਊೞৈ झఋੌਸ ߸ജೞח ߑߨۿਸ

    ઁউ • ੌ߈੸ਵ۽ زੌೠ ஶబஎী झఋੌ݅ ׮ܲ ఫझ౟ ಕযहਸ ҳೞӝח য۰਑ • ੉ ֤ޙ ੉੹ীח ఫझ౟۽ࠗఠ content latent representations৬ style latent representationsܳ implicit ೞѱ ܻ࠙ೞח ݽ؛ਸ ೟णೞח ߑधী ؀ࣁ৓਺. Ӓ۞ա… 1. ӝࠄ੸ਵ۽ ࢤࢿػ ޙ੢੄ fluencyо ڄয૑Ѣա झఋੌ ߸ജ੉ ੜ উؽ 2. content ࠁઓਯ ߂ style ߸ജਯ੄ ౟ۨ੉٘ য়೐ܳ ઑ੺ೞӝо ൨ٝ • ୭Ӕ Text Style Transfer ҙ۲ ؀ࠗ࠙੄ ֤ޙٜীࢲ ߬੉झۄੋਵ۽ ॳ੉ח ݽ؛ Delete,Retrieve,Generate: A Simple Approach to Sentiment and Style Transfer ֤ޙ ѐਃ
  7. • ߑߨۿ 1. Delete: ޙ੢ਵ۽ࠗఠ ਗې झఋੌী ೧׼ೞח 
 attribute

    words݅ਸ ઁѢ 2. Retrieve: target style corpus۽ࠗఠ 
 ਗ ޙ੢җ ਬࢎೠ ஶబஎܳ ׸਷ ޙ੢ਸ Ѩ࢝ೞৈ, 
 Ӓ ޙ੢ ղ੄ attribute wordsܳ ୶୹ 3. Generate: Delete җ੿ীࢲ झఋੌ ࣘࢿ੉ ઁѢػ ޙ੢җ 
 Retrieve җ੿ীࢲ Ѩ࢝ػ झఋੌ ࣘࢿ ױযٜਸ ੉ਊೞৈ 
 target झఋੌ۽ ߸ജػ ޙ੢ਸ ࢤࢿ Delete,Retrieve,Generate: A Simple Approach to Sentiment and Style Transfer Method ӛ੿ ࠗ੿ Delicious So good Friendly Awesome Loves … Ugly Very rude Sucks Not good … Great food but horrible staff and very rude workers Great food but staff and workers 1.Delete 2.Retrieve Awesome, Friendly 3.Generate Great food, awesome staff and friendly workers
  8. Delete,Retrieve,Generate: A Simple Approach to Sentiment and Style Transfer Delete

    • ௏ಌझ ղীࢲ झఋੌী ؀ೠ ध߹۱੉ о੢ ֫਷ (झఋੌ ӓࢿ੉ о੢ ௾) word-ngramsܳ ઁѢ • п n-gramsо пп੄ ௏ಌझী ١੢ೞח ࠼بܳ ష؀۽ ੼ࣻܳ ҅࢑ • ౠ੿ thresholdܳ ֈ਷ n-gramী ؀೧ࢲ attribute words۽ р઱
  9. Delete,Retrieve,Generate: A Simple Approach to Sentiment and Style Transfer Retrieve

    • ੸੺ೠ target style attribute wordsܳ ୶୹ೞӝ ਤ೧
 source sentence৬ ਬࢎೠ ޙ੢ਸ target style corpusীࢲ Ѩ࢝ • This food is (delicious) -> This food is ugly (X), This food is bland (O) • ਤ धীࢲ ف ޙ੢ ࢎ੉੄ Ѣܻܳ ੤ח ೣࣻ dח ޙ੢ р Ѣܻܳ ੤ח ইޖ metric੉ա ࢚ҙ হ਺ 1. TF-IDF weighted word overlap 2. Euclidean distance using sentence embedding 3. Edit distance ١١…
  10. Delete,Retrieve,Generate: A Simple Approach to Sentiment and Style Transfer Generate

    • ୨ 4о૑ ߑߨਸ ઁউ 1. RetrieveOnly: Retrieve Ѿҗ Ѩ࢝ػ target sentenceܳ Ӓ؀۽ ੉ਊ 2. TemplateBased: Deleteػ ޙ੢җ Retrieveীࢲ Ѩ࢝ػ target attribute markersܳ ױࣽ concat 3. DeleteOnly: Deleteػ ޙ੢җ target style ࣘࢿਸ ੑ۱ೞݶ 
 target style sentenceо ࢤࢿغب۾ ೞח LSTMӝ߈ encoder-decoder ݽ؛ 4. DeleteAndRetrieve: Deleteػ ޙ੢җ Retrieveীࢲ Ѩ࢝ػ target attribute markersܳ ੑ۱ೞݶ
 target style sentenceо ࢤࢿغب۾ ೞח LSTMӝ߈ encoder-decoder ݽ؛
  11. Delete,Retrieve,Generate: A Simple Approach to Sentiment and Style Transfer Experiments

    • ؘ੉ఠࣇ • Yelp, Amazon ܻ࠭੄ positive, negative ؘ੉ఠࣇ • ੉޷૑ ஭࣌੄ romantic, humorous ؘ੉ఠࣇ • ಣоח ௼ۄ਋٘ ࣗयਸ ੉ਊೞৈ 5ױ҅ ಣо • ղਊ ࠁઓਯ • ࣘࢿ (झఋੌ) ߸҃ਯ • ޙ੢੄ ੗োझ۞਑
  12. Delete,Retrieve,Generate: A Simple Approach to Sentiment and Style Transfer Experimental

    Results • 4 ഑਷ 5੼ਸ ߉਷ ҃਋ Success۽ р઱ • RetrieveOnly, TemplateBased ١੄ ߬੉झۄੋ ݽ؛ٜ੉ ӝઓ ݽ؛ٜࠁ׮ જ਷ ࢿמ • DeleteAndRetrieve ݽ؛੄ ࢿמ੉ о੢ ਋ࣻೞա Human Ѿҗীח ݆੉ ޅ ޷ஜ
  13. 2. Transforming Delete,Retrieve,Generate Approach 
 for Controlled Text Style Transfer

    (EMNLP 2019) Transforming Delete,Retrieve,Generate Approach for Controlled Text Style Transfer
  14. • Pre-trained language modelҗ Transformer੄ ౠࢿਸ ੉ਊ೧ࢲ ӝઓ ֤ޙ੄ ױ੼ਸ

    ࠁ৮ೞ੗ 1. Delete җ੿ীࢲ ղਊী ೧׼ೞח ೨ब ױযܳ ઁѢೣ 2. Source style ױযܳ ઁѢೞ૑ ޅೣ 3. LSTMӝ߈ encoder-decoder ݽ؛੉ Ӓ׮૑ ۽ߡझ౟ೞ૑ ޅ೧ ੗োझ۞਍ ޙ੢ਸ ղߜ૑ ޅೣ 4. ӟ ޙ੢ী ஂডೣ Transforming Delete,Retrieve,Generate Approach for Controlled Text Style Transfer ֤ޙ ѐਃ
  15. • Input reduction (Feng et al., 2018) • style classificationਸ

    ೡ ٸ о੢ ӝৈبо ௾ ױযٜ੉ attribute wordsੌ Ѫ • BERT-based transformerܳ Delete Transformer۽ ࢎਊ • [CLS] ష௾ী ؀ೠ attention scoreо ࢚ਤੋ ష௾ٜਸ attribute words۽ р઱ೞৈ ઁѢ • BERTח ӝࠄ੸ਵ۽ (Multi-head, Multi-layer)੉Ҋ п க, ೻٘݃׮ ׮ܲ ৉ೡਸ ыਵ޲۽ 
 attribute words ઁѢী ੸೤ೠ attention headܳ ଺ח җ੿੉ ೙ਃೣ Transforming Delete,Retrieve,Generate Approach for Controlled Text Style Transfer Delete
  16. • attribute words ઁѢী ੸೤ೠ attention headܳ ଺ח җ੿ •

    ޙ੢ ղীࢲ ੐੄۽ ѐ੄ ష௾ਸ ઁѢೠ ޙ੢ਸ ۽ ಴അ • п (h, l)ী ؀೧ (4)৬ э੉ scoreܳ ݒӣ • (5)৬ э੉ validation datasetী ؀೧ ੉ झ௏য੄ ୨೤੉ ઁੌ ੘਷ (h, l)ਸ 
 о੢ ੸೤ೠ attention head۽ Ѿ੿ Transforming Delete,Retrieve,Generate Approach for Controlled Text Style Transfer Delete γ|x| x′h,l
  17. • ӝઓ ࣻߨҗ زੌೞѱ, source sentence৬ ਬࢎೠ ஶబஎܳ ׸Ҋ ੓ח

    target sentenceܳ Ѩ࢝ • ف ޙ੢ ࢎ੉੄ Ѣܻܳ ੤ח ೣࣻ d 1. TF-IDF weighted 2. Averaged-GloVe over all tokens of a sentence 3. Universal Sentence Encoder (Cer et al., 2018) • 3о૑ metric ઺ 1.੉ о੢ retrieval Ѿҗо જও׮Ҋ ೣ Transforming Delete,Retrieve,Generate Approach for Controlled Text Style Transfer Retrieve
  18. • GPTܳ ӝ߈ਵ۽ ೞח Generative Style Transformer (GST)ܳ ઁউ •

    Blind Generative Style Transformer (B-GST), Guided Generative Style Transformer(G-GST) • B-GST • Deleteػ ޙ੢җ target styleਸ ੑ۱ • ୹۱ ޙ੢ਸ ࢤࢿೞחؘী ੓যࢲ ੗ਬبо ੓Ҋ
 ਗ ޙ੢җ ਬࢎೠ ޙ੢ਸ target corpus۽ࠗఠ retrieveೞӝ য۰਍ ҃਋ী ਬਊೣ • G-GST • Deleteػ ޙ੢җ target attribute wordsܳ ੑ۱ • ਗ ޙ੢җ ਬࢎೠ ޙ੢ਸ retrieveೠ ҃਋ ݽ؛ীѱ target attributes੄ ੿ࠁܳ ઁҕ೧઱ח Ѻ • ୹۱ ޙ੢ী ؀ೠ Fine-grained control੉ оמ Transforming Delete,Retrieve,Generate Approach for Controlled Text Style Transfer Generate
  19. • झಕ࣍ ష௾ٜਸ ୶о۽ ੉ਊೞৈ ޙ੢ਸ ੑ۱ • B-GST :

    <ATTR> <CONT_START> Contents <OUTPUT_START> • G-GST : <ATTR_START> Attribute Words <CONT_START> Contents <OUTPUT_START> Transforming Delete,Retrieve,Generate Approach for Controlled Text Style Transfer Generate
  20. Delete,Retrieve,Generate: A Simple Approach to Sentiment and Style Transfer Experiments

    • ؘ੉ఠࣇ • Yelp, Amazon ܻ࠭੄ positive, negative ؘ੉ఠࣇ • ੉޷૑ ஭࣌੄ romantic, humorous ؘ੉ఠࣇ • Political ؘ੉ఠࣇ - ಕ੉झ࠘ ҕച׼ ૑૑੗, ޹઱׼ ૑૑੗੄ ؆Ӗ • Gender ؘ੉ఠࣇ - Yelp ਺ध੼ ܻ࠭੄ ࢿ߹ • ಣоח ௼ۄ਋٘ ࣗयਸ ੉ਊೞৈ ߬੉झۄੋҗ ઁউ ݽ؛੄ ਋ਤܳ ࠺Ү • ղਊ ߸҃ਯ • ࣘࢿ ߸҃ਯ • ޙ੢੄ ੗োझ۞਑
  21. Delete,Retrieve,Generate: A Simple Approach to Sentiment and Style Transfer Experimental

    Results • ӝઓ੄ Delete&Retrieve ݽ؛җ B-GST ݽ؛ ઺ যו ଃ੉ ؊ ੸੺ೠ૑ܳ ௼ۄ਋٘ ࣗयਵ۽ ಣо • D&R: Delete & Retrieve, BT:Back-Translation • ੿۝੸ੋ ಣо Ѿҗب ֤ޙী ঱әغয੓૑݅ ৈӝࢲח ࢤۚ