Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Open-Retrieval Conversational Question Answering
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Scatter Lab Inc.
July 24, 2020
Research
0
2.3k
Open-Retrieval Conversational Question Answering
Scatter Lab Inc.
July 24, 2020
Tweet
Share
More Decks by Scatter Lab Inc.
See All by Scatter Lab Inc.
zeta introduction
scatterlab
0
1.8k
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
scatterlab
0
4.3k
Adversarial Filters of Dataset Biases
scatterlab
0
2.3k
Sparse, Dense, and Attentional Representations for Text Retrieval
scatterlab
0
2.3k
Weight Poisoning Attacks on Pre-trained Models
scatterlab
0
2.2k
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval
scatterlab
0
2.5k
Beyond Accuracy: Behavioral Testing of NLP Models with CheckList
scatterlab
0
2.3k
What Can Neural Networks Reason About?
scatterlab
0
2.3k
Exploring the Limits of Transfer Learning with Unified Text-to-Text Transformer
scatterlab
0
2.2k
Other Decks in Research
See All in Research
大規模言語モデルにおけるData-Centric AIと合成データの活用 / Data-Centric AI and Synthetic Data in Large Language Models
tsurubee
1
480
LLM-Assisted Semantic Guidance for Sparsely Annotated Remote Sensing Object Detection
satai
3
440
第二言語習得研究における 明示的・暗示的知識の再検討:この分類は何に役に立つか,何に役に立たないか
tam07pb915
0
1k
空間音響処理における物理法則に基づく機械学習
skoyamalab
0
190
OWASP KansaiDAY 2025.09_文系OSINTハンズオン
owaspkansai
0
110
学習型データ構造:機械学習を内包する新しいデータ構造の設計と解析
matsui_528
6
2.9k
その推薦システムの評価指標、ユーザーの感覚とズレてるかも
kuri8ive
1
310
Akamaiのキャッシュ効率を支えるAdaptSizeについての論文を読んでみた
bootjp
1
430
Multi-Agent Large Language Models for Code Intelligence: Opportunities, Challenges, and Research Directions
fatemeh_fard
0
120
LLMアプリケーションの透明性について
fufufukakaka
0
110
Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification
satai
3
570
生成AI による論文執筆サポート・ワークショップ 論文執筆・推敲編 / Generative AI-Assisted Paper Writing Support Workshop: Drafting and Revision Edition
ks91
PRO
0
120
Featured
See All Featured
AI Search: Implications for SEO and How to Move Forward - #ShenzhenSEOConference
aleyda
1
1.1k
Measuring Dark Social's Impact On Conversion and Attribution
stephenakadiri
1
110
Noah Learner - AI + Me: how we built a GSC Bulk Export data pipeline
techseoconnect
PRO
0
100
The Impact of AI in SEO - AI Overviews June 2024 Edition
aleyda
5
720
ラッコキーワード サービス紹介資料
rakko
1
2.2M
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
54k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
16th Malabo Montpellier Forum Presentation
akademiya2063
PRO
0
46
Agile Leadership in an Agile Organization
kimpetersen
PRO
0
77
DevOps and Value Stream Thinking: Enabling flow, efficiency and business value
helenjbeal
1
89
Highjacked: Video Game Concept Design
rkendrick25
PRO
1
280
Exploring anti-patterns in Rails
aemeredith
2
240
Transcript
Open-Retrieval Conversational Question Answering ࢲ࢚ (ܻࢲ ࢎ౭झ, ೝಯ)
ѐਃ Open-Retrieval Conversational Question Answering
ѐਃ ѐਃ • SIGIR 20 • Chen Qu, Liu Yang,
Cen Chen, Minghui Qiu, W. Bruce Croft, Mohit Iyyer • University of Massachusetts Amherst, Ant Financial, Alibaba Group • Conversational searchਸ ਤ೧ ConvQAܳ open retrieval settingਵ۽ ഛೞח Ѫ ਃ োҳ ਃ
ѐਃ ѐਃ • Conversational search information retrieval Ҿӓੋ ݾী ೞա
• ୭Ӕ োҳٜ conversational searchܳ response rankingҗ conversational question answering۽ ೧Ѿ • ױࣽ ߸ਸ য candidate setীࢲ ҊܰѢա য passageীࢲ spanਸ ࢶఖ • ח conversational searchীࢲ retrieval ӝୡੋ ഝਸ ޖदೞח ߑध • ࠄ ֤ޙ open-retrieval conversational question answering(ORConvQA) settingਸ ઁউೞৈ ޙઁܳ ೧Ѿ
ѐਃ ѐਃ • ORConvQAী ೠ োҳܳ ਤ೧ OR-QuAC ؘఠ ࣇਸ
ٜ݅ਵݴ ORConvQAܳ ਤೠ end-to-end दझమਸ ҳ୷ೞݴ ےझನݠ ӝ߈ retriever, reranker ৬ reader ١ਸ ನೣ • OR-QuACܳ ࢚ਵ۽ ೠ ֤ޙ प learnable retriever ਃࢿਸ ૐݺ • ژೠ ݽٚ दझమ ҳࢿ ਃࣗ(retriever, reranker ৬ reader)ীࢲ history modelingਸ ࢎਊೞݶ दझమ ѱ ѐࢶ ؼ ࣻ ਸ ࠁ
Dataset Open-Retrieval Conversational Question Answering
ORConvQA? Dataset • conversational search systemsਸ ҳ୷ೞӝ ਤೠ ୶о ױ҅۽ࢲ
߸ਸ Ҋܰ ӝ ী retrieve evidenceܳ large collection۽ ࠗఠ Ѩ࢝ 1. ࠁܳ ҳೞח ചܳ ઁҕ(information seeker৬ information provider)৬ ೞח QuAC dataset 2. QuAC ޙਸ context-independentೞѱ द ࢿೠ CANARD dataset 3. Wikipedia passage
Dataset
CANARD? Dataset • QuAC dialogsח self-containedೞ ঋח ড חؘ ח
ࠛ৮ೠ ୡӝ ޙਵ۽ ੋ೧ ߊࢤ • ܳ ٜয seekerীѱ a Chinese polymathic scientistੋ Zhang Hengী ೧ ߓۄҊ ೮חؘ ޙ "җҗ ӝࣿҗ যڃ ҙ ҅о णפө?” • ۞ೠ ࠛౠೞҊ ݽഐೠ ୡӝ ޙ ചܳ ೧ࢳೞӝ য۵ѱ ೞӝ ٸޙী ҕѐ Ѩ࢝ ജ҃ীࢲ ޙઁܳ ঠӝ • CANARD ؘఠ ࣁীࢲ ઁҕೞח context-independent rewritesਵ۽ ೞৈ ޙઁܳ ೧Ѿ, Ӓۢ "Zhang Heng җ ӝ ࣿҗ যڃ ҙ҅о णפө?"۽ ޙ
CANARD? Dataset • ߣ૩ ޙী ೧ࢲ݅ Үܳ ࣻ೯ೞݶ ച
ղীࢲ history dependenciesਸ Ӓ۽ ਬೞݶࢲ ചо self-contained • QuAC test set ҕѐغয ঋӝ ٸޙী QuAC dev setਸ ਊೞৈ CANARD test setਸ ݅ٞ • ژೠ QuAC train set 10%ܳ dev۽ ഝਊ. • CANARDী হח QuAC ޙ ತӝ೮ਵݴ ܳ ਊೠ ࢤ ؘఠ ੋ OR-QuAC ؘఠ ా҅ח җ э.
Model Open-Retrieval Conversational Question Answering
ݽ؛ Retriever, Reranker, Reader۽ ա Model
ݽ؛ Retriever, Reranker, Reader۽ ա Model
Passage Retriever Dataset • Passage Encoder • Question Encoder •
Retrieval Score
Retrieval score ӝળਵ۽ ࢚ਤ top-Kѐ ޙࢲܳ rerank৬ reader۽ ׳ Model
ݽ؛ Retriever, Reranker, Reader۽ ա Model
Reranker& Reader Encoding Dataset • Input • Contextualized Representations •
sequence representation
Reranker& Reader Dataset • Sequence Representation • Reranker (W_rr is
vector) • Reader (span prediction)
Training Open-Retrieval Conversational Question Answering
Retriever pretraining Training • retrieval scores for the batch •
to maximize the probability of the gold passage for each question • Pretraining loss Pretraning റী passage encoderח offlineਵ۽ ك. Faissܳ ࢎਊ೧ࢲ Ѿҗܳ ࡳই১.
Concurrent Learning Training • Retriever loss • Reranker loss •
Reader loss
Inference Training • Retrieval Ѿҗ Top-K ޙࢲܳ ݽف ੋಌ۠झ ೞৈ
п ޙࢲ߹ spanਸ ஏ • Retriever loss + Reranker loss + Reader lossо ઁੌ ޙࢲ spanਸ ୭ઙ ਵ۽ ஏ
RESULTS Open-Retrieval Conversational Question Answering
Competing Method RESULTS • DrQA : TF-IDF + RNN based
reader • BERTserini : BM25 + BERT reader • ORConvQA without history : our method + window size 0 • ORConvQA : our method • Evaluation Metric : word level F1, human equivalence score (HEQ), Mean Reciprocal Rank(MRR), Recall
DrQA < BERTserini < Ours w/o hist < Ours RESULTS
Ablation study RESULTS
History windows size ઑ RESULTS
хࢎפ✌ ୶о ޙ ژח ҾӘೠ ݶ ઁٚ ইې োۅ۽
োۅ ࣁਃ! ࢲ࢚ (ܻࢲ ࢎ౭झ, ೝಯ)
[email protected]
Linked in. @pingpong