Real-world NLP Tasks
Text summarization (E)
● pointer-generator network with attention
● Science Dailyからクロールしたデータ
○ extracted 60,900 Web pages
○ (i) s2s, story to summary
○ (ii) sh2s, shuffled story to summary
○ (iii) s2t, story to title
○ (iv) oods2s, outof-domain testing for s2s
● CNN/ Daily Mail corpus
○ train/ dev/test : 287,226/13,368/11,490
text–summary pairs
17