Slide 1

Slide 1 text

จݙ঺հʢʣ Treat the Word As a Whole or Look Inside? Subword Embeddings Model Language Change and Typology Yang Xu, Jiasheng Zhang, David Reitter 1st International Workshop on Computational Approaches to Historical Language Change, ACL2019 ௕Ԭٕज़Պֶେֶ ࣗવݴޠॲཧݚڀࣨɹ ૬ాɹଠҰ

Slide 2

Slide 2 text

Abstract • ݴޠֶతͳԾઆΛௐ΂ΔͨΊʹ subword Λߟྀͨ͠୯ޠ෼ࢄදݱΛఏҊ • Indo-European ͷݴޠ͸৽͍͠୯ޠ΄Ͳ subword ʹର͢ΔॏΈ͕૿͑ɺɹ ٯʹதࠃޠ͸ subword ʹର͢ΔॏΈ͕ݮΓɺ୯ޠʹର͢ΔॏΈ͕૿͑ͨ !2

Slide 3

Slide 3 text

Motivation w ݴޠֶతͳ݁࿦ ʮதࠃޠʹ͓͍ͯɺ࣌ؒͱͱ΋ʹ༏Ґੑ͕୯ԻઅˠೋԻઅʹҠͬͨʯ w Ծઆ ʮݱ୅ͷதࠃޠʹ͓͍ͯɺ୯ޠʹؚ·ΕΔ׽ࣈจࣈʢTVCXPSEʣ ͸ҙຯతͳ໾ׂ͕গͳ͍ʯ !3

Slide 4

Slide 4 text

Related Work w $#08ʢDPOUFYU ͔ΒUBSHFU Λ༧ଌʣ w $IBSBDUFSFOIBODFEXPSEFNCFEEJOH $8&  w 4LJQHSBNʢUBSHFU ͔ΒDPOUFYU Λ༧ଌʣ w GBTU5FYU vc ui vc ui !4 ୯ޠͱจࣈΛಉ͡ॏཁ౓Ͱѻ͏

Slide 5

Slide 5 text

Method w %ZOBNJDTVCXPSEJODPSQPSBUFEFNCFEEJOHNPEFM %4&  w %4&$#08 w %4&4( w ୯ޠʹ͸୯ޠͷॏΈ ͰɺTVCXPSEʹ͸ ͰॏΈ෇͚͢Δ hw i 1 − hw i !5

Slide 6

Slide 6 text

Method !6

Slide 7

Slide 7 text

Experiment w %BUBTFUT w 5SBJOJOHXPSEFNCFEEJOH8JLJQFEJBEBUBCBTFEVNQT w $IJOFTF &OHMJTI 'SFODI (FSNBO *UBMJBO 4QBOJTI w .PEFM w %4&$#08 %4&4(ʢఏҊख๏ʣ w $8& GBTU5FYU !7

Slide 8

Slide 8 text

Experiment w ࣮ݧ߲໨   ͱ୯ޠͷൃੜ࣌ظͱͷ૬ؔ w ൃੜ࣌ظɿ͋Δޠ͕(PPHMF#PPLT/HSBNʹॳΊͯొ৔ͨ͠೥  ޠͷҙຯλεΫ w &NCFEEJOHͷੑೳΛଌΔ w 4JNJMBSJUZͱ"OBMPHZΛ࢖༻ hw i !8

Slide 9

Slide 9 text

Result ୯ޠͷॏΈ ͱൃੜ࣌ظͱͷ૬ؔɿ*OEP&VSPQFBOͱதࠃͰਖ਼൓ର w hw i !9

Slide 10

Slide 10 text

Result ୯ޠͷॏΈ ͱൃੜ࣌ظͱͷ૬ؔɿ*OEP&VSPQFBOͱதࠃͰਖ਼൓ର w hw i !10 ࣌୅͕ਐΉͱ ୯ޠʹର͢ΔॏΈ͕ݮগ ˣ ୯ޠΑΓ 4VCXPSEΛॏࢹ

Slide 11

Slide 11 text

Result ୯ޠͷॏΈ ͱൃੜ࣌ظͱͷ૬ؔɿ*OEP&VSPQFBOͱதࠃͰਖ਼൓ର w hw i !11 ࣌୅͕ਐΉͱ ୯ޠʹର͢ΔॏΈ͕૿Ճ ˣ 4VCXPSEΑΓ ୯ޠΛॏࢹ ʢԾઆ͕͔֬ΊΒΕͨʣ

Slide 12

Slide 12 text

Result w ͦΕͧΕͷάϧʔϓͰൺֱ w $#08ܥʢ%4&$#08 $8&ʣ w 4LJQHSBNܥʢ%4&4( GBTUUFYUʣ w %4&4(Ͱੑೳͷ޲্Λ֬ೝ !12

Slide 13

Slide 13 text

Conclusion w ԾઆΛݕূ͢ΔҝʹɺTVCXPSEΛߟྀ͢Δ୯ޠ෼ࢄදݱΛఏҊͨ͠ w *OEP&VSPQFBOͷݴޠͰ͸৽͘͠ੜ·ΕΔ୯ޠ΄ͲTVCXPSEʹҙຯͷ ॏΈ͕ॏࢹ͞ΕɺதࠃޠͰ͸ٯʹTVCXPSE΁ͷॏΈ͕ݮΓɺ୯ޠͦͷ΋ ͷʹରͯ͠ॏΈ͕ͭ͘Α͏ʹͳͬͨʢԾઆΛݕূͨ͠ʣ !13

Slide 14

Slide 14 text

No content

Slide 15

Slide 15 text

Discussion w ࣮ݧʹରͯ͠۩ମతͳൺֱΛߦͬͨ w தࠃɿ೥୅ͷۙ୅ԽͰٕज़΍Պֶ͕ൃలͨ͜͠ͱʹΑΓɺ৽͍͠୯ ޠ͕ೖ͖ͬͯͨʁ !15