Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Dialogue Natural Language Inference
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Scatter Lab Inc.
April 03, 2020
Research
2.4k
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Dialogue Natural Language Inference
Scatter Lab Inc.
April 03, 2020
More Decks by Scatter Lab Inc.
See All by Scatter Lab Inc.
zeta introduction
scatterlab
0
1.9k
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
scatterlab
0
4.4k
Adversarial Filters of Dataset Biases
scatterlab
0
2.3k
Sparse, Dense, and Attentional Representations for Text Retrieval
scatterlab
0
2.3k
Weight Poisoning Attacks on Pre-trained Models
scatterlab
0
2.2k
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval
scatterlab
0
2.5k
Beyond Accuracy: Behavioral Testing of NLP Models with CheckList
scatterlab
0
2.3k
Open-Retrieval Conversational Question Answering
scatterlab
0
2.3k
What Can Neural Networks Reason About?
scatterlab
0
2.3k
Other Decks in Research
See All in Research
さくらインターネット研究所テックトーク2026春、研究開発Gr.25年度成果26年度方針
kikuzo
0
140
機械学習で作った ポケモン対戦bot で 遊ぼう!
fufufukakaka
0
250
2026年度 生成AI を活用した論文執筆ガイド/ワークショップ / 2026 Academic Year Guide to Writing Papers Using Generative AI - Workshop
ks91
PRO
0
170
LINEヤフー データサイエンス Meetup「三井物産コモディティ予測チャレンジ」の舞台裏-AlpacaTechパート
gamella
1
550
Ankylosing Spondylitis
ankh2054
0
170
YOLO26_ Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
satai
3
780
AY 2026 Guide to Academic Writing Using Generative AI - Workshop
ks91
PRO
0
120
多様なデータを許容し学習し続ける模倣学習 / Advanced Imitation Learning for VLA
prinlab
0
210
業界横断 副業コンプライアンス調査 三者(副業者・本業先・発注者)におけるトラブル認知ギャップの構造分析
fkske
0
1.3k
明日から使える!研究効率化ツール入門
matsui_528
13
7.2k
ScoreMatchingRiesz for Automatic Debiased Machine Learning and Policy Path Estimation with an Application to Japanese Monetary Policy Evaluation
masakat0
0
290
2026年3月1日(日)福島「除染土」の公共利用をかんがえる
atsukomasano2026
0
620
Featured
See All Featured
HU Berlin: Industrial-Strength Natural Language Processing with spaCy and Prodigy
inesmontani
PRO
0
400
Faster Mobile Websites
deanohume
310
31k
The Curious Case for Waylosing
cassininazir
1
380
How to audit for AI Accessibility on your Front & Back End
davetheseo
0
400
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
12
1.7k
Digital Projects Gone Horribly Wrong (And the UX Pros Who Still Save the Day) - Dean Schuster
uxyall
0
1.6k
Building Better People: How to give real-time feedback that sticks.
wjessup
370
20k
[RailsConf 2023] Rails as a piece of cake
palkan
59
6.7k
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
34
2.8k
The AI Revolution Will Not Be Monopolized: How open-source beats economies of scale, even for LLMs
inesmontani
PRO
3
3.5k
Building an army of robots
kneath
306
46k
Self-Hosted WebAssembly Runtime for Runtime-Neutral Checkpoint/Restore in Edge–Cloud Continuum
chikuwait
0
570
Transcript
Dialogue Natural Language Inference Sean Welleck et al., ACL’19 ࢿࠁ
(ML Research Scientist, Pingpong)
ݾର ݾର 1. Dialogue Consistency and NLI 2. Dialogue NLI
Dataset 1. Triple Generation 2. Triple Annotation 3. Re-ranking with NLI 4. Evaluation 1. On Dialogue NLI 2. On Consistency in Dialogue
Dialogue Consistency and NLI Dialogue Consistency and NLI
Dialogue Consistency and NLI Dialogue Consistency and NLI • ചীࢲ
࠺ੌҙࢿ • ࢚ਵ۽ ൞ӈೞա ೠߣ ߊࢤೞݶ ఋѺ ఀ • Semanticೠ ޙਸ ݅٘ח ֢۱݅ਵ۽ח ೧Ѿ ࠛо • Natural Language Inference (NLI) • NLU, sentence representation ١ NLP ߈ਸ ੜೞӝ ਤೠ ࣻױਵ۽ॄ જ • NLI ݽ؛ downstream task ࢿמ ೱ࢚ী ӝৈ
Dialogue Consistency and NLI Dialogue Consistency and NLI • ಕܰࣗա:
ޙ ഋక۽ അ. • ചীࢲ ੌҙࢿ • ӝࠄਵ۽ Persona consistency • ֤ܻਵ۽ ߓغח ݈ ইפۄب э ࢎۈ ݈ೡ Ѫ э ঋ ޙ • : ച ղীࢲ ೠ ࢎۈ ೠ ف ݈ ߓغח • : Ӓ ࢎۈ ಕܰࣗա৬ ߓغח P = {p1 , …, pm } (uA i , uA j ) (uA i , pA k )
Dialogue Consistency and NLI
Dialogue NLI Dataset Dialogue NLI Dataset
Dialogue NLI Dataset Dialogue NLI Dataset • ߊച-ಕܰࣗա , ಕܰࣗա-ಕܰࣗա
हਵ۽ ܖয • ߊച-ߊച हب ನೣغয ਵա प ೞ ঋ (ui , pj ) (pi , pj ) (ui , uj )
Triple Generation Dialogue NLI Dataset • Triple • PersonaChatীࢲ ಕܰࣗա
ޙҗ ߊച ੌࠗ۽ Triple ۨ࠶ • ՙܻ Entailment, Neutral, Contradiction కӦ • Tripleਸ ӝળਵ۽ E, N, Cܳ ݅ٚ! • Entailment: э Tripleী ࣘೞח ف ޙՙܻ • Neutral, Contradiction: 3о ߑߨ (e1 , r, e2 ) (u, p), (p, p)
Neutral Pairs Dialogue NLI Dataset • Miscellaneous utterance যו Tripleীب
ࣘೞ ঋח ߊച ৬ ಕܰࣗա ޙ ҙ҅ח Neutral • Persona pairing Ground truth ಕܰࣗաՙܻח ࠂغѢա ݽࣽغ ঋחח ઁ ೞী э Tripleਸ ҕਬೞ ঋ ח ಕܰࣗաՙܻ Ҋ, ೞਤ ޙٜՙܻب • Relation swap ࢲ۽ ة݀ੋ ࢎपਸ աఋղח ҙ҅ ী ࣘೞח ޙٜՙܻ u p (r, r′ )
Contradiction Pairs Dialogue NLI Dataset • Relation swap ࢲ۽ ݽࣽغח
ҙ҅ ী ࣘೞח ޙٜՙܻ • Entity swap Triple ীࢲ ೧ࢲ о عਸ ٸ ݽࣽغח ҃ ف Tripleী ࣘೞח ޙٜ ՙܻ • Numeric Tripleী ನೣػ ंܳ ܲ ं۽ ߄Լࢲ ٜ݅য ޙҗ ਗې Tripleী ؍ ޙٜਸ (r, r′ ) (e1 , r, e2 ) e2 → e′ 2 (e1 , r, e′ 2 )
Triple Annotation Dialogue NLI Dataset • ಕܰࣗա ޙ →
<category> <relation> <category> ex) <person> have_pet <animal> relation , entity ա, entityח schemaী হਵݶ ੑ۱ • ٜ݅য Triple۽ ࠙ܨ ف ઑѤ ೞաܳ ݅ೞݶ 1. о sub-string 2. (e1 , r, e2 ) ∈ ℛ ∈ ℰ u ∈ U u ∈ (e1 , r, e2 ) e2 u sim(u, p) ≥ τ
Statistics Dialogue NLI Dataset • Gold-standard test set: test set
ۨ࠶ ݏҊ ೠ ࢎۈ 3ݺ 2ݺ ࢚ੋ ࢠ݅ ݽ Ѫ
Dialogue NLI Dataset
Re-ranking with NLI Re-ranking with NLI
Consistent Dialogue Agents via NLI Re-ranking with NLI •
ߊച ஏী NLI ݽ؛ ஏ Ѿҗ ഝਊ NLI ݽ؛ Contradictionۄ ౸ױೠ റࠁח confidence݅ఀ ಕօ౭ܳ ષ ࢜۽ ࣻ۽ Re-ranking
Evaluation Evaluation
On Dialogue NLI Evaluation • InferSent, ESIM ف ݽ؛ ࢎਊ
On Consistency in Dialogue Evaluation • ݽ؛ • ച ݽ؛:
Key-value memory networkܳ PersonaChatਵ۽ ण • NLI ݽ؛: ESIMਸ Dialogue NLI۽ ण • ಣо ࣇ • PersonaChatীࢲ Triple ী ೧ೞח ߊച ܳ Ҋ agent ಕܰࣗաী ী ࣘೞח ޙ ਵݶ ܳ ਵ۽ р • Entailment ޙ 10ѐ, Contradiction ޙ 10ѐ, ޙ 10ѐܳ റࠁ۽ م • ݫܼ • Hits@k, Entail@k, Contradict@k (e1 , r, e2 ) u (e1 , r, e2 ) u
Evaluation
Result Evaluation
Human Evaluation Evaluation • ParlAIܳ ా೧ w/o re-rankingҗ w/ re-rankingਸ
࠺Ү • ಣо ୋب • ݽ؛ ݃ա ಕܰࣗաܳ ੜ ߸೮חо? (1~5) • ݽ؛ п ߊചо ಕܰࣗա৬ ੌҙغחо? (0, 1) • ݽ؛ п ߊചо ݽ؛ ߊച, ݽ؛ ಕܰࣗա৬ ݽࣽغחо? (0, 1)
хࢎפ✌ ୶о ޙ ژח ҾӘೠ ݶ ઁٚ ইې োۅ۽
োۅ ࣁਃ! ࢿࠁ (ML Research Scientist, Pingpong)
[email protected]