• Skip-Thought [06], SCDV [07], InferSent [08], USE [09], etc. [01] Ru ̈ckle ́+: Concatenated Power Mean Word Embeddings as Universal Cross-Lingual Sentence Representations, arXiv ’18
[02] Shen+: Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms, ACL ’18
[03] Zhelezniak+: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR ’19
[04] Arora+: A Simple but Tough-to-Beat Baseline for Sentence Embeddings, ICLR '17
[05] Ethayarajh: Unsupervised Random Walk Sentence Embeddings: A Strong but Simple Baseline, Rep4NLP ’18
[06] Kiros+: Skip-Thought Vectors, NIPS ’15
[07] Mekala+: SCDV : Sparse Composite Document Vectors using soft clustering over distributional representations, ACL ’17
[08] Conneau+: Supervised Learning of Universal Sentence Representations from Natural Language Inference Data, EMNLP '17
ಋೖ: STSʹ͓͚ΔSpearmanͷॱҐ૬ؔͷܭࢉ 20 จA จB ਓؒධՁ Model 1 Model 2 A man is playing a guitar. The man is playing the guitar. 4.909 0.985 0.978 A man is playing a guitar. A guy is playing an instrument. 3.800 0.646 0.895 A man is playing a guitar. A man is playing a guitar and singing. 3.200 0.874 0.977 A man is playing a guitar. The girl is playing the guitar. 2.250 0.747 0.831 A man is playing a guitar. A woman is cutting vegetable. 0.000 0.290 0.595
ಋೖ: STSʹ͓͚ΔSpearmanͷॱҐ૬ؔͷܭࢉ 21 จA จB ਓؒධՁ Model 1 Model 2 A man is playing a guitar. The man is playing the guitar. 4.909 0.985 0.978 A man is playing a guitar. A guy is playing an instrument. 3.800 0.646 0.895 A man is playing a guitar. A man is playing a guitar and singing. 3.200 0.874 0.977 A man is playing a guitar. The girl is playing the guitar. 2.250 0.747 0.831 A man is playing a guitar. A woman is cutting vegetable. 0.000 0.290 0.595
ಋೖ: STSʹ͓͚ΔSpearmanͷॱҐ૬ؔͷܭࢉ 22 จA จB ਓؒධՁ Model 1 Model 2 A man is playing a guitar. The man is playing the guitar. 4.909 0.985 0.978 A man is playing a guitar. A guy is playing an instrument. 3.800 0.646 0.895 A man is playing a guitar. A man is playing a guitar and singing. 3.200 0.874 0.977 A man is playing a guitar. The girl is playing the guitar. 2.250 0.747 0.831 A man is playing a guitar. A woman is cutting vegetable. 0.000 0.290 0.595 1 1 4 3 2 2 3 4 5 5 1 2 3 4 5
ಋೖ: STSʹ͓͚ΔSpearmanͷॱҐ૬ؔͷܭࢉ 23 จA จB ਓؒධՁ Model 1 Model 2 A man is playing a guitar. The man is playing the guitar. 4.909 0.985 0.978 A man is playing a guitar. A guy is playing an instrument. 3.800 0.646 0.895 A man is playing a guitar. A man is playing a guitar and singing. 3.200 0.874 0.977 A man is playing a guitar. The girl is playing the guitar. 2.250 0.747 0.831 A man is playing a guitar. A woman is cutting vegetable. 0.000 0.290 0.595 1 1 4 3 2 2 3 4 5 5 1 2 3 4 5 จϖΞͷྨࣅॱҐ
ಋೖ: STSʹ͓͚ΔSpearmanͷॱҐ૬ؔͷܭࢉ 24 จA จB ਓؒධՁ Model 1 Model 2 A man is playing a guitar. The man is playing the guitar. 4.909 0.985 0.978 A man is playing a guitar. A guy is playing an instrument. 3.800 0.646 0.895 A man is playing a guitar. A man is playing a guitar and singing. 3.200 0.874 0.977 A man is playing a guitar. The girl is playing the guitar. 2.250 0.747 0.831 A man is playing a guitar. A woman is cutting vegetable. 0.000 0.290 0.595 1 1 4 3 2 2 3 4 5 5 1 2 3 4 5 r1 = 1 − 6 5(52 − 1) {(1−1)2 + (2−4)2 + (3−2)2 + (4−5)2 + (5−5)2} = 1 − 6 120 (0 + 4 + 1 + 1 + 0)
ಋೖ: STSʹ͓͚ΔSpearmanͷॱҐ૬ؔͷܭࢉ 25 จA จB ਓؒධՁ Model 1 Model 2 A man is playing a guitar. The man is playing the guitar. 4.909 0.985 0.978 A man is playing a guitar. A guy is playing an instrument. 3.800 0.646 0.895 A man is playing a guitar. A man is playing a guitar and singing. 3.200 0.874 0.977 A man is playing a guitar. The girl is playing the guitar. 2.250 0.747 0.831 A man is playing a guitar. A woman is cutting vegetable. 0.000 0.290 0.595 1 1 4 3 2 2 3 4 5 5 1 2 3 4 5 r1 = 1 − 6 5(52 − 1) {(1−1)2 + (2−4)2 + (3−2)2 + (4−5)2 + (5−5)2} = 1 − 6 120 (0 + 4 + 1 + 1 + 0) ਖ਼ղͷॱҐͱ༧ଌʹΑΔ ॱҐͷ૬ؔΛܭࢉ(ެࣜʹಥͬࠐΉ)
ಋೖ: STSʹ͓͚ΔSpearmanͷॱҐ૬ؔͷܭࢉ 26 จA จB ਓؒධՁ Model 1 Model 2 A man is playing a guitar. The man is playing the guitar. 4.909 0.985 0.978 A man is playing a guitar. A guy is playing an instrument. 3.800 0.646 0.895 A man is playing a guitar. A man is playing a guitar and singing. 3.200 0.874 0.977 A man is playing a guitar. The girl is playing the guitar. 2.250 0.747 0.831 A man is playing a guitar. A woman is cutting vegetable. 0.000 0.290 0.595 1 1 4 3 2 2 3 4 5 5 1 2 3 4 5 r1 = 1 − 6 5(52 − 1) {(1−1)2 + (2−4)2 + (3−2)2 + (4−5)2 + (5−5)2} = 1 − 6 120 (0 + 4 + 1 + 1 + 0) r1 = 0.7 r2 = 0.9 ਖ਼ղͷॱҐͱ༧ଌʹΑΔ ॱҐͷ૬ؔΛܭࢉ(ެࣜʹಥͬࠐΉ)
ಋೖ: STSʹ͓͚ΔSpearmanͷॱҐ૬ؔͷܭࢉ 27 จA จB ਓؒධՁ Model 1 Model 2 A man is playing a guitar. The man is playing the guitar. 4.909 0.985 0.978 A man is playing a guitar. A guy is playing an instrument. 3.800 0.646 0.895 A man is playing a guitar. A man is playing a guitar and singing. 3.200 0.874 0.977 A man is playing a guitar. The girl is playing the guitar. 2.250 0.747 0.831 A man is playing a guitar. A woman is cutting vegetable. 0.000 0.290 0.595 1 1 4 3 2 2 3 4 5 5 1 2 3 4 5 r1 = 1 − 6 5(52 − 1) {(1−1)2 + (2−4)2 + (3−2)2 + (4−5)2 + (5−5)2} = 1 − 6 120 (0 + 4 + 1 + 1 + 0) r1 = 0.7 r2 = 0.9 Model 2ͷ΄͏͕༏Ε͍ͯΔ
•“ranking”ͱݴ͍ͭͭͬͯΔͷਖ਼ྫ(ਖ਼͍͠༁จϖΞ)ͷྨࣅ࠷େԽ [26] Guo+: E ff ective Parallel Corpus Mining using Bilingual Sentence Embeddings, WMT ‘18 LaBSEͷߏཁૉ: Translation ranking task 40 զഐೣͰ͋Δɻ Ja I am a cat. En Nice to meet you. En ਖ਼ྫ ෛྫ
•“ranking”ͱݴ͍ͭͭͬͯΔͷਖ਼ྫ(ਖ਼͍͠༁จϖΞ)ͷྨࣅ࠷େԽ [26] Guo+: E ff ective Parallel Corpus Mining using Bilingual Sentence Embeddings, WMT ‘18 LaBSEͷߏཁૉ: Translation ranking task 41 զഐೣͰ͋Δɻ Ja I am a cat. En Nice to meet you. En ͚ۙͮΔ ԕ͚͟Δ ਖ਼ྫ ෛྫ
•“ranking”ͱݴ͍ͭͭͬͯΔͷਖ਼ྫ(ਖ਼͍͠༁จϖΞ)ͷྨࣅ࠷େԽ [26] Guo+: E ff ective Parallel Corpus Mining using Bilingual Sentence Embeddings, WMT ‘18 LaBSEͷߏཁૉ: Translation ranking task 42 զഐೣͰ͋Δɻ Ja I am a cat. En Nice to meet you. En ͚ۙͮΔ ԕ͚͟Δ ਖ਼ྫ ෛྫ ྨࣅ࠷େԽ ྨࣅ࠷খԽ
• ྨࣅͷߦྻ͕Ͱ͖Δ LaBSEͷߏཁૉ: Translation ranking task 43 ࢲϖϯͰ͢ɻ I am a pen. I’m a cat. Nice to m eet you. Sentence em bedding I’m a perfect hum an. ਖ਼ྫ զഐೣͰ͋Δɻ ͡Ί·ͯ͠ɻ จຒΊࠐΈ ࢲᘳͳਓؒͰ͢ɻ
• ʹྨࣅߦྻͷର֯ઢ͕ਖ਼ղ LaBSEͷߏཁૉ: Translation ranking task 44 ࢲϖϯͰ͢ɻ I am a pen. I’m a cat. Nice to m eet you. Sentence em bedding I’m a perfect hum an. ਖ਼ྫ զഐೣͰ͋Δɻ ͡Ί·ͯ͠ɻ จຒΊࠐΈ ࢲᘳͳਓؒͰ͢ɻ ྨࣅ0.98… 0.24…
• 1ରNΛNճ܁Γฦ͢Πϝʔδ LaBSEͷߏཁૉ: Translation ranking task 45 ࢲϖϯͰ͢ɻ I am a pen. I’m a cat. Nice to m eet you. Sentence em bedding I’m a perfect hum an. ਖ਼ྫ զഐೣͰ͋Δɻ ͡Ί·ͯ͠ɻ จຒΊࠐΈ ࢲᘳͳਓؒͰ͢ɻ 0.24… ྨࣅ0.98…
• ຒΊࠐΈͷੵ ϕ LaBSEͷߏཁૉ: Translation ranking task 46 ࢲϖϯͰ͢ɻ I am a pen. I’m a cat. Nice to m eet you. Sentence em bedding I’m a perfect hum an. ਖ਼ྫ զഐೣͰ͋Δɻ ͡Ί·ͯ͠ɻ จຒΊࠐΈ ࢲᘳͳਓؒͰ͢ɻ 0.24… ྨࣅ0.98…
• (ෛྫΛߋʹ૿͢͜ͱՄೳ) LaBSEͷߏཁૉ: Translation ranking task 47 ࢲϖϯͰ͢ɻ I am a pen. I’m a cat. Nice to m eet you. Sentence em bedding I’m a perfect hum an. ਖ਼ྫ զഐೣͰ͋Δɻ ͡Ί·ͯ͠ɻ จຒΊࠐΈ ࢲᘳͳਓؒͰ͢ɻ
• (ෛྫΛߋʹ૿͢͜ͱՄೳ) LaBSEͷߏཁૉ: Translation ranking task 48 ࢲϖϯͰ͢ɻ I am a pen. I’m a cat. Nice to m eet you. Sentence em bedding I’m a perfect hum an. batch_size ਖ਼ྫ batch_size զഐೣͰ͋Δɻ ͡Ί·ͯ͠ɻ จຒΊࠐΈ ࢲᘳͳਓؒͰ͢ɻ
•softmaxͰଛࣦ͕ඇରশੑʹ LaBSEͷߏཁૉ: Translation ranking task 49 ࢲϖϯͰ͢ɻ I am a pen. I’m a cat. Nice to m eet you. Sentence em bedding I’m a perfect hum an. batch_size ਖ਼ྫ batch_size զഐೣͰ͋Δɻ ͡Ί·ͯ͠ɻ จຒΊࠐΈ ࢲᘳͳਓؒͰ͢ɻ
•softmaxͰଛࣦ͕ඇରশੑʹ LaBSEͷߏཁૉ: Translation ranking task 50 ࢲϖϯͰ͢ɻ I am a pen. I’m a cat. Nice to m eet you. Sentence em bedding I’m a perfect hum an. batch_size ਖ਼ྫ batch_size զഐೣͰ͋Δɻ ͡Ί·ͯ͠ɻ จຒΊࠐΈ ࢲᘳͳਓؒͰ͢ɻ
• ղফ͢ΔͨΊɺ2ํ(→↓)ͷଛࣦΛ͠߹ΘͤΔ LaBSEͷߏཁૉ: Translation ranking task 51 ࢲϖϯͰ͢ɻ I am a pen. I’m a cat. Nice to m eet you. Sentence em bedding I’m a perfect hum an. batch_size ਖ਼ྫ batch_size զഐೣͰ͋Δɻ ͡Ί·ͯ͠ɻ จຒΊࠐΈ ࢲᘳͳਓؒͰ͢ɻ