LLMによる日本語ニュース記事の平易化 / Japanese News Articles Simplification via Large Language Models

Slide 1

Slide 1 text

--.ʹΑΔ೔ຊޠχϡʔεهࣄͷฏқԽ ே೔৽ฉࣾϝσΟΞࣄۀຊ෦ϝσΟΞݚڀ։ൃηϯλʔɹ   Ӝ઒௨

Slide 8

Slide 8 text

--.ʹΑΔςΩετฏқԽ Sentence Simplification via Large Language Models Yutao Feng1, Jipeng Qiang1, Yun Li1, Yunhao Yuan1, and Yi Zhu1 1 College of Information Engineering, Yangzhou University [email protected], {jpqiang, liyun, yhyuan, zhuyi}@yzu.edu.cn Abstract Sentence Simplification aims to rephrase complex sentences into simpler sentences while retaining original meaning. Large Lan- guage models (LLMs) have demonstrated the ability to perform a variety of natural language processing tasks. However, it is not yet known whether LLMs can be served as a high-quality sentence simplification system. In this work, we empirically analyze the zero- /few-shot learning ability of LLMs by evaluating them on a number of benchmark test sets. Experimental results show LLMs outperform state-of-the-art sentence simplification methods, and are judged to be on a par with human annotators. 1 Introduction Sentence Simplification (SS) is a task of rephras- ing a sentence into a new form that is easier to read and understand while retaining its meaning, which can be used for increasing accessibility for people with dyslexia(Rello et al., 2013), autism(Evans et al., 2014) et al., 2020; Thoppilan et al., 2022; Chowdhery et al., 2022). Nevertheless, it remains unclear how LLMs perform in SS task compared to current SS methods. To address this gap in research, we undertake a systematic evaluation of the Zero-/Few-Shot learning capability of LLMs, by assessing their performance on existing SS benchmarks. We carry out an empirical comparison of the performance of ChatGPT and the most advanced GPT3.5 model (text-davinci-003). To the best of our knowledge, this is the first study of LLMs’s capabilities on SS task, aiming to provide a pre- liminary evaluation, including simplification prompt, multilingual simplification, and simplification robust- ness. The key findings and insights are summarized as follows: (1) GPT3.5 or ChatGPT based on one-shot learning outperform the state-of-the-art SS methods. We found that these models excel at deleting non-essential information and adding new information, while existing supervised SS methods tend to preserve the content without change. (2) ChatGPT is a monolithic model capable of sup- porting multiple languages, which makes it a compre- hensive multilingual text simplification technique. Af- ter evaluating the performance of ChatGPT on the task :2302.11957v1 [cs.CL] 23 Feb 2023 IUUQTBSYJWPSHBCT

Slide 9

Slide 9 text

Sentence Simplification via Large Language Models Yutao Feng1, Jipeng Qiang1, Yun Li1, Yunhao Yuan1, and Yi Zhu1 1 College of Information Engineering, Yangzhou University [email protected], {jpqiang, liyun, yhyuan, zhuyi}@yzu.edu.cn Abstract Sentence Simplification aims to rephrase complex sentences into simpler sentences while retaining original meaning. Large Lan- guage models (LLMs) have demonstrated the ability to perform a variety of natural language processing tasks. However, it is not yet known whether LLMs can be served as a high-quality sentence simplification system. In this work, we empirically analyze the zero- /few-shot learning ability of LLMs by evaluating them on a number of benchmark test sets. Experimental results show LLMs outperform state-of-the-art sentence simplification methods, and are judged to be on a par with human annotators. 1 Introduction Sentence Simplification (SS) is a task of rephras- ing a sentence into a new form that is easier to read and understand while retaining its meaning, which can be used for increasing accessibility for people with dyslexia(Rello et al., 2013), autism(Evans et al., 2014) et al., 2020; Thoppilan et al., 2022; Chowdhery et al., 2022). Nevertheless, it remains unclear how LLMs perform in SS task compared to current SS methods. To address this gap in research, we undertake a systematic evaluation of the Zero-/Few-Shot learning capability of LLMs, by assessing their performance on existing SS benchmarks. We carry out an empirical comparison of the performance of ChatGPT and the most advanced GPT3.5 model (text-davinci-003). To the best of our knowledge, this is the first study of LLMs’s capabilities on SS task, aiming to provide a pre- liminary evaluation, including simplification prompt, multilingual simplification, and simplification robust- ness. The key findings and insights are summarized as follows: (1) GPT3.5 or ChatGPT based on one-shot learning outperform the state-of-the-art SS methods. We found that these models excel at deleting non-essential information and adding new information, while existing supervised SS methods tend to preserve the content without change. (2) ChatGPT is a monolithic model capable of sup- porting multiple languages, which makes it a compre- hensive multilingual text simplification technique. Af- ter evaluating the performance of ChatGPT on the task :2302.11957v1 [cs.CL] 23 Feb 2023 IUUQTBSYJWPSHBCT --.ʹΑΔهࣄͷฏқԽʢ(15 $IBU(15ʣ ϓϩϯϓτʮҙຯ͸ม͑ͣʹγϯϓϧʹͯ͠ʯ ࣗಈʗਓखධՁͦΕͧΕͰߴ͍ਫ਼౓ --.ʹΑΔςΩετฏқԽ

Slide 1

Slide 1 text

Slide 2

Slide 2 text

Slide 3

Slide 3 text

Slide 4

Slide 4 text

Slide 5

Slide 5 text

Slide 6

Slide 6 text

Slide 7

Slide 7 text

Slide 8

Slide 8 text

Slide 9

Slide 9 text

Slide 10

Slide 10 text

Slide 11

Slide 11 text

Slide 12

Slide 12 text

Slide 13

Slide 13 text

Slide 14

Slide 14 text

Slide 15

Slide 15 text

Slide 16

Slide 16 text

Slide 17

Slide 17 text

Slide 18

Slide 18 text

Slide 19

Slide 19 text

Slide 20

Slide 20 text

Slide 21

Slide 21 text

Slide 22

Slide 22 text

Slide 23

Slide 23 text