question q, answer a, • train: (q ~ a) • KB: (a ~ a’) <- not guarantee (previous methods) • goal: (q ~ a’) • OOV when test • For words which are not contained in train dataset • Incorporate KB (KB + train embedding) • Alleviated OOV problem !5 KB-QA System Motivation: Problems
• type: • context (1-hop entity) • • Ek ae to ee ar to er at to et ac = (c1 , c2 , . . . , cm ) to (ec1 , ec2 , . . . , ecm ) ec = 1 m m ∑ i=1 eei !14 Cross-Attention Model Answer Aspect Representation answer entity entity embedding answer relation relaltion embedding answer type type embedding answer context context embedding
2. Reread the question 3. Find which pard is focused (handling attention) 4. Go to next aspect 5. Reread the question 6. While all aspect is utilized • Scoring • Get “question - answer” score for all answer entities • Final score = weighted sum of them !15 Cross-Attention Model Process of Cross-Attention Question entity relation type context score score score Score
with unique answer (O) • Inference stage • • Smax = arg max a∈Cq {S(q, a)} A = { ̂ a|Smax − S(q, ̂ a) < γ} !20 Other Techniques Answer set for inference
Consider relation as translation in embedding space • Training loss • Set of KB facts • (randomly sampled) Set of corrupted fact • distance • • Train KB-QA and TransE in turns (s, p, o) ∈ S (s′, p, o′) ∈ S′ d(s + p, o) = ||s + p − o||2 2 Lk = ∑ (s,p,o)∈S ∑ (s′,p,o′)∈S′ [γk + d(s + p, o) − d(s′+ p, o′)]+ !21 Other Techniques Combining Global Knowledge
• Training: 3778 q-a pairs • Testing: 2032 q-a pairs • Collected from Google Suggest API • Answers are manually labeled by Amazon MTurk • All answers are from Freebase !24 Experiment Settings Main Task
: Michael S. Dell • who was darth vader in episode 3? : Hayden Christensen • where is the time zone in florida? : North American Eastern Time Zone • what does donald trump own? : Trump Tower • what year did michael jordan get drafted? : 1984 NBA Draft • who plays saruman in lord of the rings? : Christopher Lee • which team does ronaldinho play for 2013? : Brazil national football team • what undergraduate school did martin luther king jr. attend? : Morehouse College • where did will smith go to high school? : Overbrook High School • what is south korea's capital city? : Seoul • who is ruling north korea now? : Kim Jong-un • where do samsung lions play? : Daegu Baseball Stadium • Single answer, W5 question (who, when, where, what, which) !25 Experiment Settings Samples of WebQuestion
Methods • Bordes et al., 2014b • BOW to obtain single vector for question and answer • Bordes et al., 2014a • Subgraph embedding + BOW • Yang et al., 2014 • SP-based + map entities with relation from KB • Dong et al., 2015 • Use three CNNs to three aspects • Bordes et al., 2015 (+ improved by Sukhaatar et al., 2015) • Put KB-QA into Memory Networks framework !28 Results and Analysis Comparison with other approaches
• C-Attention (Cross-) • Global Knowledge Information • No GKI • Apply GKI • Improvement by each component • uni-LSTM to Bi-LSTM: 0.9 • No ATT to A-Q-ATT: 1.5 ~ 2.2 • A-Q-ATT to C-ATT : 0.2 ~ 0.3 • No GKI to GKI: 1 ~ 1.3 !29 Results and Analysis Modal Analysis
that Justin Bieber wrote? • answer type: /music/composition -> strong attention in “What” rather than “songs” • Probably due to bias of training data • Complex questions (35%) • Q: When was the last time Knicks won the championship? • predicted: all championships • Cannot learn what “last” mean • Labelling Error (3%) • Q: What college did John Nash teach at? • labeled answer: Princeton University • real answer: Massachusetts Institute of Technology !31 Results and Analysis Error Analysis
in question • Attention weights toward answer aspects • Dynamic representation is more precise and flexible • Leverage Global KB • Take full advantage of complete KB • Alleviate OOV problem • Results • Get state-of-the-art among end-to-end methods !33 Conclusion Summary