The Computational Linguistics Summarization Pilot Task

Slide 1

Slide 1 text

The Computational Linguistics Summarization Pilot Task @ TAC 2014 Kokil Jaidka*, Muthu Kumar Chandrasekaran, Rahul Jha, Christopher Jones, Min-Yen Kan, Ankur Khanna, Diego Molla- Aliod, Dragomir R. Radev, Francesco Ronzano, Horacio Saggion * Nanyang Technological University, Singapore koki0001@e.ntu.edu.sg

Slide 2

Slide 2 text

Photo Credits Dennis Jarvis @flickr Scientific Document Summarization I have an abstract 2 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014 I am done!

Slide 3

Slide 3 text

3 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014 Bornmann & Mutz, 2012 Exponential growth!

Slide 4

Slide 4 text

• CL-Summ so far • Citation based extractive summaries • Faceted summaries • Automatic literature review • ACL corpus • The CL-Summ Shared Task • TAC 2015: CL-Summ track • Acknowledgements Outline 18 November 2014 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 4

Slide 5

Slide 5 text

Scientific Document Summarization • Abstract – Authors’ own summary • Citation summary – Community creates a summary when citing • Faceted summary – Capture all aspects of a paper 5 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014

Slide 6

Slide 6 text

18 November 2014 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 6 Framework for Scientific document summarization

Slide 7

Slide 7 text

7 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014 Citations 1) select papers and 2) identify salient parts of the cited paper.

Slide 8

Slide 8 text

Scientific Document Summarization Citation based extractive summaries Scope of Citation • Qazvinian, V., and Radev, D. R. “Identifying non-explicit citing sentences for citation-based summarization” (ACL, 2010) • Abu-Jbara, Amjad, and Dragomir Radev. “Reference scope identification in citing sentences.” (ACL, 2012) 8 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014

Slide 9

Slide 9 text

18 November 2014 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 9 Citation summary Image credits Ken Ammi @flickr Citations are like celebrity quotes taken out of context

Slide 10

Slide 10 text

10 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014 Scientific summaries are written to fulfill information & argumentative functions

Slide 11

Slide 11 text

Scientific Document Summarization Citation-based extractive summaries Scope of Citation • Qazvinian, V., and Radev, D. R. “Identifying non-explicit citing sentences for citation-based summarization” (ACL, 2010) • Abu-Jbara, Amjad, and Dragomir Radev. “Reference scope identification in citing sentences.” (ACL, 2012) Coherence • Abu-Jbara, Amjad, and Dragomir Radev. “Coherent citation-based summarization of scientific papers.” (ACL 2011) 11 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014

Slide 12

Slide 12 text

TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014 12 Faceted summaries are structured abstracts. Common in domains such as Medicine, Biomedical, Bioinformatics

Slide 13

Slide 13 text

18 November 2014 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 13 Argumentative zones as facets demarcate new contributions of a paper from background work.

Slide 14

Slide 14 text

• Community concurs that a citation based summary of a scientific document is important to create • Citing papers cite different points of the same reference paper • Assigning facets to these citances may help create coherent summaries In summary, 18 November 2014 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 14

Slide 15

Slide 15 text

Outline • CL-Summ ground work done so far • The CL-Summ corpus, task evaluation • Highlights • Annotation • Evaluation results • TAC 2015: CL-Summ track • Acknowledgements 15 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014

Slide 16

Slide 16 text

CL-Summ Pilot: highlights First corpus in the computational linguistics community incorporating prior research on citation based summaries •10 teams registered • 3 teams participated in the evaluation • 2 teams submitted their systems’ performance • 1 more proposed algorithms to solve the tasks 16 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014

Slide 17

Slide 17 text

17 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014

Slide 18

Slide 18 text

CL training Corpus • 10 reference papers or topics randomly sampled from the ACL live anthology • Up to 10 citing papers per reference paper including those outside ACL live anthology • Annotated corpus publicly available https://github.com/WING-NUS/scisumm-corpus/ 18 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014

Slide 19

Slide 19 text

Annotation Pipeline 18 November 2014 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 19 Annotation! Post Processing to BiomedSumm format OCR & Section Parse ParsCit‘s SectLabel module

Slide 20

Slide 20 text

• 3 annotators • Released data has one gold standard annotation per topic or reference paper • Discourse facet has a minor change from Biomedsumm’s categories Annotating the SciSumm corpus 20 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014 ………….. ………..

Slide 21

Slide 21 text

Task 1A: Identify the text span in the RP which corresponds to the citances from the CP. Tasks 21 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014 Citing papers Citing text is called citance Reference Paper (RP) Citing paper (CP) Match the citing text in the CP to text in the RP

Slide 22

Slide 22 text

Tasks Task 1B: Identify the discourse facet for every cited text span from a predefined set of facets. 22 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014 Classify the cited text in RP into one of several facets Reference Paper (RP) Citing paper (CP) CPs

Slide 23

Slide 23 text

Tasks Task 2: Generate a faceted summary of up to 250 words, of the reference paper, using itself and the citing papers. 23 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014 Summary of RP Reference Paper (RP) Use citances and the RP to create a summary Task 2 Citing paper (CP)

Slide 24

Slide 24 text

Evaluation Small corpus: 10 fold cross validated evaluation over the 10 documents • Task 1A scored by ROUGE-L metric • Task 1B scored by classification metrics: Precision, Recall and F1 • Task 2 also scored by ROUGE-L metric TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 24 18 November 2014

Slide 25

Slide 25 text

Results – Task 1A TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 25 18 November 2014 MQ Clair_UMich Precision Recall F 1 Precision Recall F 1 0.212 0.335 0.223 0.444 0.574 0.487 • MQ was unsupervised while Clair_Umich was supervised • Challenging classification problem: Task seeks to map each citation sentence with a few out of 100s of potential matches in the Reference paper (RP) • Lexical, semantic and structural similarities between citances and RP sentences somewhat help

Slide 26

Slide 26 text

Results – Task 1A TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 26 18 November 2014 Paper ID MQ Clair_UMich C90_2039 0.235 0.635 C94_2154 0.288 0.536 E03_1020 0.239 0.478 H05_1115 0.350 0.375 H89_2014 0.332 0.546 J00_3003 0.196 0.559 J98_2005 0.101 0.344 N01_1011 0.221 0.498 P98_1081 0.200 0.367 X96_1048 0.248 0.535 Large deviation in scores, across topics, from both systems

Slide 27

Slide 27 text

Results – Task 2 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 27 18 November 2014 Paper ID MQ (using Task 1A MMR) C90_2039 0.293 C94_2154 0.120 E03_1020 0.196 H05_1115 0.321 H89_2014 0.320 J00_3003 0.367 J98_2005 0.233 N01_1011 0.284 P98_1081 0.206 Average 0.260 ROUGE-L scores here measure overlap over the abstract since we did not have human summaries Low scores could be due to deviation between summary of citances and the abstract of the paper

Slide 28

Slide 28 text

Citing text: “The line of our argument below follows a proof provided in… for the maximum likelihood estimator based on nite tree distributions.” False negative: “We will show that in both cases the estimated probability is tight.” Errors – Task 1A 18 November 2014 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 28 Clair_UMich MQ Target text from RP: “The work described here also makes use of hidden Markov model.” False positive: “The statistical methods can be described in terms of Markov models.”

Slide 29

Slide 29 text

Learning from the Pilot Task • Offset mismatch between the text file and the XML that annotators used – Corpus sentence segmented and sentences assigned a sentence ID • Problems in post-processing non-contiguous annotated reference spans. • Character offsets can be miscounted by different parsers • Handling non-UTF8 characters 18 November 2014 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 29

Slide 30

Slide 30 text

Limitations of this corpus • No gold standard citation based summaries • OCR errors: • The use of “...” where text spans are snippets • Errors in citation/reference offset numbers • Different text encodings • Errors in file construction • Small size of corpus! 18 November 2014 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 30

Slide 31

Slide 31 text

Acknowledgements • NIST and Hoa Dang • Lucy Vanderwende, MSR, • Anita de Ward, Elsevier Data Services • Kevin B. Cohen, Prabha Yadav (U. Colorado, Boulder) • Horacio Saggion for detailed bug report on the corpus • Rahul Jha (U. Mich, Ann Arbor) • All BiomedSumm track participants 31 TAC BiomedSumm: The Computational Linguistics Summarization Pilot Task 18 November 2014 Questions? Thank you!