Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
【輪講資料】Zero-shot Cross-lingual Semantic Parsing
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Yano
May 03, 2023
180
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
【輪講資料】Zero-shot Cross-lingual Semantic Parsing
研究室内の輪講で使った資料です。
Yano
May 03, 2023
More Decks by Yano
See All by Yano
【輪講資料】Decoding Dense Embeddings: Sparse Autoencoders for Interpreting and Discretizing Dense Retrieval
yano0
0
120
【輪講資料】Length-Induced Embedding Collapse in PLM-based Models
yano0
0
170
【輪講資料】How Do Large Language Models Acquire Factual Knowledge During Pretraining?
yano0
0
300
NLP2025参加報告
yano0
0
700
【輪講資料】ReAct: Synergizing Reasoning and Acting in Language Models / Tree of Thoughts: Deliberate Problem Solving with Large Language Models
yano0
0
270
【輪講資料】SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval
yano0
2
390
【輪講資料】From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers
yano0
0
140
Featured
See All Featured
Leveraging LLMs for student feedback in introductory data science courses - posit::conf(2025)
minecr
1
280
Efficient Content Optimization with Google Search Console & Apps Script
katarinadahlin
PRO
1
600
4 Signs Your Business is Dying
shpigford
187
22k
Agile that works and the tools we love
rasmusluckow
331
21k
Product Roadmaps are Hard
iamctodd
PRO
55
12k
Why You Should Never Use an ORM
jnunemaker
PRO
61
9.9k
AI Search: Implications for SEO and How to Move Forward - #ShenzhenSEOConference
aleyda
1
1.3k
How to train your dragon (web standard)
notwaldorf
97
6.7k
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
659
62k
What’s in a name? Adding method to the madness
productmarketing
PRO
24
4.1k
The Power of CSS Pseudo Elements
geoffreycrofte
82
6.3k
SERP Conf. Vienna - Web Accessibility: Optimizing for Inclusivity and SEO
sarafernandez
2
1.5k
Transcript
Zero-shot Cross-lingual Semantic Parsing Tom Sherborne, Mirella Lapata ACL 2022
֓ཁ 2 ✓ θϩγϣοτଟݴޠҙຯղੳϞσϧɺZX-parseͷఏҊ • ݴޠؒજࡏදݱͷΞϥΠϝϯτʹண͠సҠֶशͷޡࠩΛ࠷খԽ • ෳͷଛࣦؔΛಋೖ͢ΔϚϧνλεΫֶशʹΑΓҟͳΔݴޠͷ જࡏදݱ͕ྨࣅ͢ΔΑ͏ʹ •
ରݴޠͷϖΞσʔλΛඞཁͱͤͣɺӳޠϖΞͱରݴޠͷࣗવ จͷΈར༻ ✓ θϩγϣοτҙຯղੳλεΫʹ͓͍ͯෳͷݴޠͰߴ͍ੑೳ
બΜͩཧ༝ • ݴޠԣஅతͳݚڀʹڵຯ͕͋ΔͨΊ • ಛʹݴޠԣஅతͳજࡏۭؒΛ࡞ΔͰ໘നͦ͏ • ࣗͷݚڀΛؚΉ͞·͟·ͳసҠֶशλεΫͰར༻Ͱ͖ͦ͏ 3
ҙຯղੳ (Semantic Parsing) • ࣗવݴޠͷൃΛཧܗࣜ(Logical Form)ʹม • ͞·͟·ͳλεΫͰॏཁͳΠϯϑϥ • ࣭ԠɺରγεςϜɺػցͷࢦࣔ…
4 -JTU fl JHIUTGSPN4BO'SBODJTDPUP1JUUTCVSHI 4&-&$5%*45*/$5 fl JHIU@ fl JHIU@JE'30.ʜ
• ෳͷݴޠ͔ΒͳΔࣗવจΛಉ͡ཧܗࣜʹม[1] ଟݴޠҙຯղੳ (Cross-lingual Semantic Parsing) 5 <>.VMUJMJOHVBM4FNBOUJD1BSTJOH1BSTJOH.VMUJQMF-BOHVBHFTJOUP4FNBOUJD3FQSFTFOUBUJPOT 4&-&$5%*45*/$5 fl
JHIU@ fl JHIU@JE'30.ʜ -JTU fl JHIUTGSPN4BO 'SBODJTDPUP1JUUTCVSHI αϯϑϥϯγείൃϐοπ όʔάߦ͖ͷϑϥΠτΛ ϦετΞοϓ͍ͯͩ͘͠͞ɻ ʹ
ઌߦݚڀ • ӳޠͷࣗવจ-ཧܗࣜͷฒྻσʔλΛ༁͠ɺ֤ରݴޠͷࣗવ จ-ཧܗࣜϖΞσʔλΛར༻ • ػց༁Λ༻͍Δ߹[2] • ػց༁ᘳͰͳ͍ • ಛʹϦιʔεݴޠʹ͓͍ͯ
ߴ࣭ͳػց༁͍͠ • ਓख༁Λ༻͍Δ߹[3] • ߴίετ 6 <>#PPUTUSBQQJOHB$SPTTMJOHVBM4FNBOUJD1BSTFS <>/FVSBM"SDIJUFDUVSFTGPS.VMUJMJOHVBM4FNBOUJD1BSTJOH ཧܗࣜ ӳޠ ରݴޠ ༁
• ରݴޠͷฒྻσʔλΛΘͳ͍ θϩγϣοτͱ͢Δ ➡ ӳޠͷࣗવจ-ཧܗࣜϖΞσʔλͱ ରݴޠͷࣗવจͷΈར༻ ✓ Ϟσϧߏ • ӳޠ͔Βಘͨજࡏදݱ͔Β
ཧܗࣜΛੜ͢Δσίʔμ • ݴޠԣஅతͳʢӳޠͱྨࣅͨ͠ʣ જࡏදݱΛ֫ಘ͢ΔΤϯίʔμ ఏҊख๏ 7 αϯϑϥϯγείൃϐοπ όʔάߦ͖ͷϑϥΠτΛ ϦετΞοϓ͍ͯͩ͘͠͞ɻ 4&-&$54&-&$54&-&$5 8)&3& '30.4&-&$5 Τϯίʔμ σίʔμ ❌ 4&-&$5%*45*/$5 fl JHIU@ fl JHIU@JE'30.ʜ -JTU fl JHIUTGSPN4BO 'SBODJTDPUP1JUUTCVSHI ⭕ Τϯίʔμ σίʔμ ˛ΞϥΠϝϯτΛߦΘͳ͍߹ɺݴޠ͝ͱʹ ҟͳΔજࡏදݱͱͳΓग़ྗ͕ҟͳΔ જࡏදݱ
ϚϧνλεΫֶश • ̍ͭͷΤϯίʔμʹର͠ෳͷతؔΛ༻͍ɺಉ࣌ʹ࠷దԽ[4] • ଟݴޠλεΫʹ͓͍ͯɺʮιʔεݴޠ (ӳޠͳͲ) ͰͷඪλεΫʴ ιʔεݴޠͱରݴޠͷΞϥΠϝϯτʯΛಉ࣌ʹ࠷దԽ͢Δݚڀ ͕ଘࡏ •
ԻݴޠཧղɺςΩετ؆ུԽɺґଘੑߏจղੳɺػց༁ 8 <>.VMUJUBTL4FRVFODFUP4FRVFODF-FBSOJOH
ఏҊϞσϧɿZX-Parse • ϚϧνλεΫSeq2seqϞσϧ ✓ తͷλεΫʹՃ͑ิॿతͳλεΫΛಋೖ • DLF ɿཧܗࣜͷੜ • DNL
ɿࣗવݴޠͷੜ • LPɿݴޠ༧ଌ 9 Τϯίʔμ z DLF LP ࣗવจ DNL ֶश(EN) ਪ જࡏදݱ Transformer x 6 (mBARTͷΤϯίʔμ) Transformer x 6 ֶश(ଞݴޠ) ཧܗࣜͷੜ ӳޠͰͷΈֶश
DLF ɿ Generating Logical Forms 10 Τϯίʔμ z DLF LP
List fl ights from San Francisco to Pittsburgh? DNL SELECT DISTINCT fl ight_1. fl ight_id FROM … • ӳޠͷࣗવจ͔Βಉ͡ҙຯͷཧܗࣜΛੜ͢Δ ➡ཧܗࣜੜೳྗΛʹ͚ͭΔ ֶश࣌ ਪ࣌ જࡏදݱ
• ରݴޠͷࣗવจʹϊΠζΛՃ͑ɺ࠶ߏங͘͠ӳޠ༁ ➡֤ݴޠͷࣗવจʹΤϯίʔμʔΛదԠͤ͞Δ • ݴޠݻ༗ͷಛੑΛʹ͚Δ DNL ɿ Generating Natural Language
11 Τϯίʔμ z DLF LP αϯϑϥϯγείൃϐοπ όʔάߦ͖ͷϑϥΠτΛ ϦετΞοϓ͍ͯͩ͘͠͞ɻ DNL Н ϊΠζ Н αϯϑϥϯγείൃϐοπ όʔάߦ͖ͷϑϥΠτΛ ϦετΞοϓ͍ͯͩ͘͠͞ɻ List fl ights from San Francisco to Pittsburgh? ֶश࣌ ਪ࣌ જࡏදݱ
LPɿ Language Prediction 12 Τϯίʔμ z DLF LP List fl
ights from San Francisco to Pittsburgh? DNL English ֶश࣌ ਪ࣌ • ೖྗจͷݴޠΛྨثͰ༧ଌ͢Δ ➡ΤϯίʔμʹݴޠΛ۠ผͤ͞Δ જࡏදݱ
LPɿ Language Prediction 13 Τϯίʔμ z DLF LP List fl
ights from San Francisco to Pittsburgh? DNL English ֶश࣌ ਪ࣌ • ೖྗจͷݴޠΛྨثͰ༧ଌ͢Δ ➡ΤϯίʔμʹݴޠΛ۠ผͤ͞Δ ✦ ٯ࣌ʹޯΛసͤ͞Δ ➡ ݴޠΛ۠ผͤ͞ͳ͍ • ݴޠʹͱΒΘΕͳ͍දݱ જࡏදݱ
• ಉ࣌ʹ̏ͭͷతؔΛ࠷దԽ • ཧܗࣜͷੜʴݴޠ͝ͱͷಛΛֶशʴݴޠΛ۠ผ͠ͳ͍ • ରݴޠͷϖΞίʔύεΛඞཁͱ͠ͳ͍θϩγϣοτҙຯղੳ ZX-Parse 14 Τϯίʔμ z
DLF LP DNL ٯ࣌ ཧܗࣜͷੜ ࣗવݴޠͷੜ ʢ࠶ߏ/༁ʣ ݴޠ༧ଌ − ∂LLP ∂θ ∂LNL ∂θ ∂LLF ∂θ જࡏදݱ
σʔληοτ ✦ ҙຯղੳσʔληοτ • ATIS • ཱྀߦυϝΠϯͷଟݴޠࣗવจͱཧܗࣜͷϖΞίʔύε • ӳޠ, ϑϥϯεޠ,
ϙϧτΨϧޠ, εϖΠϯޠ, υΠπޠ, தࠃޠ, ώϯσΟʔޠ, τϧίޠ • Overnight • ̔υϝΠϯͷӳޠࣗવจͱཧܗࣜͷϖΞίʔύε • ਪ࣌ͷΈ༁Ͱ࡞͞ΕͨதࠃޠͱυΠπޠσʔλΛར༻ ✦ ࣗવݴޠσʔληοτ • MKQA • ࣭จͷର༁ίʔύε • ӳޠ, ϑϥϯεޠ, ϙϧτΨϧޠ, εϖΠϯޠ, υΠπޠ, தࠃޠ • ParaCrawl • Webର༁ίʔύε 15
σʔληοτ ✦ ҙຯղੳσʔληοτ • ATIS • ཱྀߦυϝΠϯͷଟݴޠࣗવจͱཧܗࣜͷϖΞίʔύε • ӳޠ, ϑϥϯεޠ,
ϙϧτΨϧޠ, εϖΠϯޠ, υΠπޠ, தࠃޠ, ώϯσΟʔޠ, τϧίޠ • Overnight • ̔υϝΠϯͷӳޠࣗવจͱཧܗࣜͷϖΞίʔύε • ਪ࣌ͷΈ༁Ͱ࡞͞ΕͨதࠃޠͱυΠπޠσʔλΛར༻ ✦ ࣗવݴޠσʔληοτ • MKQA • ࣭จͷର༁ίʔύε • ӳޠ, ϑϥϯεޠ, ϙϧτΨϧޠ, εϖΠϯޠ, υΠπޠ, தࠃޠ • ParaCrawl • Webର༁ίʔύε 16
༁ϕʔεϥΠϯ ✦ “࠷খݶͷྗ”ϕʔεϥΠϯ • ػց༁Λ༻͍ͯ࡞ͨ͠ϕʔεϥΠϯ • Translate-Test • ςετηοτΛӳޠʹ༁ •
Translate-Train • ֶशηοτΛରݴޠʹ༁ ✦ “࠷େݶͷྗ”ϕʔεϥΠϯ • Monolingual Training • ֶशηοτΛਓखͰରݴޠʹ༁ 17
࣮ݧ݁Ռɿ༁ϕʔεϥΠϯͱͷൺֱ • ӳޠҎ֎ͷݴޠͷθϩγϣοτҙຯղੳλεΫͰSOTA 18 • Monolingual Training: ֶशηοτΛਓखͰରݴޠʹ༁ • Translate-Train:
ֶशηοτΛରݴޠʹ༁ • Translate-Test: ςετηοτΛӳޠʹ༁ • ATIS: ࣭จͱཧܗࣜͷϖΞσʔλ • Overnight: ෳυϝΠϯͷจͱཧܗࣜͷϖΞσʔλ ※ώϯσΟʔޠ(HI)ɺτϧίޠ(TR)ର༁ίʔύε͕ ଘࡏ͠ͳ͍ͨΊิॿతͰͷֶशʹؚ·Ε͍ͯͳ͍
࣮ݧ݁Ռɿ࠷খݶͷྗϕʔεϥΠϯͱͷൺֱ • ิॿతͰͷֶशΛߦͳͬͨશͯͷݴޠ(HI,TRҎ֎)ʹ͓͍ͯੑೳ্ • OvernightATIS΄Ͳੑೳ্͍ͯ͠ͳ͍ • υϝΠϯ͕ଟ༷ͳͨΊɺ՝͕ෳ߹తʹͳ͍ͬͯΔ • ӳޠʹ͍ۙݴޠͰTranslate-TrainTranslate-TestΛԼճΔ͕ɺԕ͍ݴޠ Ͱੑೳ͕Լ͢Δ
• ػց༁ͷํͷӨڹʁ 19
࣮ݧ݁ՌɿαϒλεΫͷӨڹ 20 • LFͷΈͷ߹ɺTranslate-TestΑΓੑೳ͕͍ • NLͱLPͲͪΒ͔Ճ͢Δ࣌ɺNLΛՃͨ͠ํ͕ੑೳ͕ߴ͍ • ֤ݴޠͷࣗવݴޠจʹదԠ͢Δํ͕େࣄ
࣮ݧ݁ՌɿαϒλεΫͷӨڹ 21 • LFͷΈͷ߹ɺTranslate-TestΑΓੑೳ͕͍ • NLͱLPͲͪΒ͔Ճ͢Δ࣌ɺNLΛՃͨ͠ํ͕ੑೳ͕ߴ͍ • ֤ݴޠͷࣗવจʹదԠ͢Δ͜ͱ͕େࣄ
࣮ݧ݁ՌɿαϒλεΫͷӨڹ • αϒλεΫͰֶशʹؚ·Εͳ͍ݴޠ(HI,TR)Ͱੑೳ্ • ֶश͍ͯ͠ͳͯ͘ݴޠؒͷજࡏදݱ͕վળ͞Ε͍ͯΔ 22
࣮ݧ݁ՌɿαϒλεΫͷӨڹ • αϒλεΫͰֶशʹؚ·Εͳ͍ݴޠ(HI,TR)Ͱੑೳ্ • ֶश͍ͯ͠ͳͯ͘ݴޠؒͷજࡏදݱ͕վળ͞Ε͍ͯΔ 23 ˝ຒΊࠐΈͷՄࢹԽ
࣮ݧ݁ՌɿαϒλεΫͷӨڹ • ୯ݴޠͷΈར༻͢Δ߹(τ = 0.0)ΑΓ༁ߦ͏߹(τ = 0.5) ͷੑೳ͕ߴ͍ • దͳจΛར༻͢Δ߹(ParaCrawl)ΑΓਪ࣌ͷೖྗͱಉ࣭͡
จΛֶशʹར༻͢Δ߹(MKQA)ͷੑೳ͕ߴ͍ 24
·ͱΊ ✓ θϩγϣοτଟݴޠҙຯղੳϞσϧɺZX-parseͷఏҊ • જࡏදݱͷΞϥΠϝϯτʹண͠సҠֶशͷޡࠩΛ࠷খԽ • ෳͷଛࣦؔΛಋೖ͢ΔϚϧνλεΫֶशʹΑΓҟͳΔݴޠͷ જࡏදݱ͕ྨࣅ͢ΔΑ͏ʹ • ରݴޠͷϖΞσʔλΛඞཁͱͤͣɺӳޠϖΞͱରݴޠͷࣗવ
จͷΈར༻ ✓ θϩγϣοτҙຯղੳλεΫʹ͓͍ͯෳͷݴޠͰߴ͍ੑೳ 25