Towards Diverse and Fair Language Generation -- Teaching ChatGPT to be “nice” (Inaugural Professorial Lecture -- 07 Nov 2024)

Slide 1

Slide 1 text

Towards Diverse and Fair Language Generation -- Teaching ChatGPT to be nice Danushka Bollegala Inaugural Lecture

Slide 90

Slide 90 text

Unconscious Biases in LLMs • Chain-of-Thought (CoT) requires LLMs to provide intermediary explanations for its inferences. • Can CoT make LLMs aware of their unconscious social biases? 24 der Bias in Large Language Models MNLP submission Figure 1: Example of multi-step gender bias reasoning task. Kojima et al., 2022). 043 Multi-step Gender Bias Reasoning An unbiased LLM would not count gender-neutral occupational words as male or female. CoT instruction: Lets think Step-by-Step opt-125m 16.2 / 14.0 5.2 / 3.0 16.2 / 14.0 5.2 / 3.0 2.0 / 8.0 0.0 / 1.6 opt-350m 9.0 / 15.2 0.6 / 6.8 9.0 / 15.2 0.6 / 6.8 1.1 / 0.6 -0.9 / 1.2 opt-1.3b 2.6 / 0.6 2.6 / 1.0 2.6 / 0.6 2.6 / 1.0 -0.4 / -0.2 -0.6 / -0.4 opt-2.7b 14.8 / 17.0 3.4 / 2.8 14.8 / 17.0 3.4 / 2.8 0.0 / 0.2 1.8 / 0.0 opt-6.7b 7.6 / 2.6 5.8 / 1.7 7.6 / 2.6 5.8 / 1.7 0.4 / 0.2 0.0 / 0.5 opt-13b 17.0 / 23.6 4.8 / 0.4 17.0 / 23.5 4.8 / 0.4 0.0 / 0.0 2.0 / 0.4 opt-30b 23.2 / 25.4 6.2 / 6.6 23.0 / 25.2 6.1 / 6.4 0.0 / 0.0 0.0 / 0.0 opt-66b 25.6 / 31.2 17.6 / 25.0 25.3 / 30.9 17.4 / 25.0 0.0 / 0.0 0.0 / 0.0 gpt-j-6B 5.8 / 6.4 3.2 / 0.6 5.8 / 6.4 3.2 / 0.6 0.6 / 0.2 0.0 / 0.0 mpt-7b 1.8 / 1.8 0.8 / 5.0 1.8 / 1.8 0.8 / 5.0 0.4 / 0.6 17.0 / 15.2 mpt-7b-inst. 5.4 / 4.8 6.0 / 3.6 5.4 / 4.8 6.0 / 3.6 5.8 / 6.6 12.6 / 11.0 falcon-7b 2.8 / 4.0 0.2 / 0.4 2.8 / 4.0 0.2 / 0.4 0.0 / 8.6 0.0 / 0.0 falcon-7b-inst. 2.2 / 3.2 5.0 / 3.8 2.2 / 3.2 5.0 / 3.8 0.0 / 0.0 0.0 / 0.0 gpt-neox-20b 33.2 / 33.8 -0.1 / 3.0 33.0 / 33.6 0.0 / 2.9 0.0 / 0.0 7.4 / 3.0 falcon-40b 34.0 / 29.0 2.0 / 3.0 34.0 / 29.0 1.9 / 3.0 7.6 / 3.0 -0.2 / 0.0 falcon-40b-inst. 5.2 / 3.6 3.4 / 3.7 4.9 / 3.4 3.3 / 3.5 2.2 / 3.4 1.7 / 2.5 bloom 40.2 / 28.0 12.0 / 11.0 40.0 / 27.7 11.9 / 11.0 7.4 / 4.2 5.4 / 2.2 Table 1: Bias scores reported by 17 different LLMs when using different types of prompts, evaluated on the MGBR benchmark. Female vs. Male bias scores are separated by ‘/’ in the Table. and is used as a pro-stereotypical text. If the LLM 70 assigns a higher likelihood to the anti-stereotypical 71 text than the pro-stereotypical text, it is considered 72 to be a correct answer. Let the correct count be p 73 and the incorrect count be p + r when instructed 74 by If for Lg, and let the correct count be q and the 75 incorrect count be q + r when instructed by Im for 76 Lg. Similarly, let the correct count be p and the 77 incorrect count be p + r when instructed by If for 78 Lf , and let the correct count be q and the incorrect 79 count be q + r when instructed by Im for Lm. 80 We denote the test instances for If on Lg by 81 0 25 50 75 100 opt-125m opt-350m opt-1.3b opt-2.7b opt-6.7b opt-13b opt-30b opt-66b Few-shot Few-shot+Debiased Few-shot+CoT Figure 2: Accuracy of the Few-shot, Few-shot+CoT, accuracy

Slide 94

Slide 94 text

GenAI and Diversity • We have 8B unique humans in the world, talking to a handful of LLMs • Given the cultural background, socio-economic, ethnic factors and the mood of the opponent, LLMs need to generate diverse responses even when the same questions are being asked from di ff erent humans. 25 Candle: Extracting Cultural Commonsense Knowledge at Scale [Nguyen+ 23] Fish and chips is a popular dish in the UK. 0.71 The majority of sentences are about meat, speci fi cally British meat. 0.68 Mince pies are a traditional British Christmas dessert made with fruit and spices. 0.67 Sticky to ff ee pudding is a classic British dessert made with dates and molasses. 0.66 Christmas crackers are a British tradition that is enjoyed by many during the Christmas season. 0.65 FareShare is a UK-based charity fi ghting hunger and food waste. 0.65 The most popular dish in Britain is chicken tikka masala. 0.64 Cottage pie is a British savory pie, typically made with ground beef and a mashed potato crust. 0.64 Puddings are a typical British dish which has been around for centuries. 0.64 The UK has a food waste problem, with seven million tonnes of food waste generated annually. 0.64 Okonomiyaki is a savory Japanese pancake or omelette, made with rice fl our and vegetables. 0.79 Miso soup is a popular and staple dish in Japanese cuisine. 0.78 Miso soup is a popular dish in Japan that is often eaten with meals. 0.73 Natto is a traditional Japanese dish made from fermented soybeans. 0.73 Udon noodles are thick Japanese noodles made of wheat fl our. 0.71 Soba noodles are a Japanese noodle made from buckwheat. 0.7 Shabu shabu is a Japanese hot pot dish. 0.7 Tempura is a Japanese dish of deep- fried fi sh or vegetables. 0.7 Sushi is a popular food in Japan that is often seen as a symbol of Japanese culture. 0.69 Persimmons are a popular fruit in Japan that have many di ff erent uses. 0.69

Slide 95

Slide 95 text

GenAI and Diversity • We have 8B unique humans in the world, talking to a handful of LLMs • Given the cultural background, socio-economic, ethnic factors and the mood of the opponent, LLMs need to generate diverse responses even when the same questions are being asked from di ff erent humans. 25 Candle: Extracting Cultural Commonsense Knowledge at Scale [Nguyen+ 23] (PPEOJHIUBUQN <4IXBU[`> Fish and chips is a popular dish in the UK. 0.71 The majority of sentences are about meat, speci fi cally British meat. 0.68 Mince pies are a traditional British Christmas dessert made with fruit and spices. 0.67 Sticky to ff ee pudding is a classic British dessert made with dates and molasses. 0.66 Christmas crackers are a British tradition that is enjoyed by many during the Christmas season. 0.65 FareShare is a UK-based charity fi ghting hunger and food waste. 0.65 The most popular dish in Britain is chicken tikka masala. 0.64 Cottage pie is a British savory pie, typically made with ground beef and a mashed potato crust. 0.64 Puddings are a typical British dish which has been around for centuries. 0.64 The UK has a food waste problem, with seven million tonnes of food waste generated annually. 0.64 Okonomiyaki is a savory Japanese pancake or omelette, made with rice fl our and vegetables. 0.79 Miso soup is a popular and staple dish in Japanese cuisine. 0.78 Miso soup is a popular dish in Japan that is often eaten with meals. 0.73 Natto is a traditional Japanese dish made from fermented soybeans. 0.73 Udon noodles are thick Japanese noodles made of wheat fl our. 0.71 Soba noodles are a Japanese noodle made from buckwheat. 0.7 Shabu shabu is a Japanese hot pot dish. 0.7 Tempura is a Japanese dish of deep- fried fi sh or vegetables. 0.7 Sushi is a popular food in Japan that is often seen as a symbol of Japanese culture. 0.69 Persimmons are a popular fruit in Japan that have many di ff erent uses. 0.69

Slide 98

Slide 98 text

Example Generations 28 efault+MoE 91.2 84.6 9.7 60.3 66.5 60.0 51.2 40.6 34.8 72.9 51.6 62.3 versiﬁed+MoE 86.7 80.4 9.8 63.3 59.2 53.5 50.7 40.6 34.0 71.3 56.3 55.0 CD+MoE 91.1 82.6 9.8 64.8 59.0 51.1 52.4 42.2 34.5 73.5 58.7 62.3 Table 4: Downstream evaluation of the LLM-generated sentences. Top block methods use human-generated esources for training, while the ones in the bottom block are trained on LLM-generated sentences. MoE approaches re shown in the middle block and bottom block. BART-large is used as the generator for MoE-based methods. Best results for each metric are shown in bold, while the best performing MoE for quality is shown in underline. Human: • The group will use the tool to make a piece of art out of metal. • I use a tool to cut a piece of metal out of the car. • The man used a piece of metal and the tools. Default: • A piece of metal is being used as a tool. • A metal tool is being used to shape a piece. • A metal tool is being used to work on a piece. ICD: • A tool is being utilized to manipulate a piece of metal. • Metal is being shaped using a specific tool. • The use of a tool is necessary to work with a piece of metal. CommonGen: Input: (piece, use, tool, metal) Human: • A pizza parlor wouldn't have workout equipment, and sells fattening food. • A pizza parlor is not a good place to exercise. • Pizza parlors do not have exercise equipment. Default: • Pizza parlors are not typically associated with exercise or physical activity. • Pizza parlors are not typically associated with exercise or physical activity. • Pizza parlors are not places for exercise, they are places to eat pizza. ICD: • People usually go to a gym, park or fitness center to exercise, not a pizza parlor. • Pizza parlors are not typically associated with exercise. • Exercise is not typically done at a pizza parlor. ComVE: Input: If a person wants to exercise, they go to a pizza parlor. Figure 4: Sentences generated by default prompt and ICD against those by humans on CommonGen and ComVE est instances. ICD generates more diverse and high quality sentences than default. .3 Diversity-Awareness of LLMs Given that we use LLMs to produce diverse genera- ions via ICL, it remains an open question whether n LLM would agree with humans on the diversity diagonal quadrants and a Cohen’s Kappa of 0.409 indicating a moderate level of agreement between GPT and human ratings for diversity. The generated sentences using the de- Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning, Zhang, Peng, and Bollegala. Empirical Methods in Natural Language Processing (EMNLP), 2024.

Slide 1

Slide 1 text

Slide 2

Slide 2 text

Slide 3

Slide 3 text

Slide 4

Slide 4 text

Slide 5

Slide 5 text

Slide 6

Slide 6 text

Slide 7

Slide 7 text

Slide 8

Slide 8 text

Slide 9

Slide 9 text

Slide 10

Slide 10 text

Slide 11

Slide 11 text

Slide 12

Slide 12 text

Slide 13

Slide 13 text

Slide 14

Slide 14 text

Slide 15

Slide 15 text

Slide 16

Slide 16 text

Slide 17

Slide 17 text

Slide 18

Slide 18 text

Slide 19

Slide 19 text

Slide 20

Slide 20 text

Slide 21

Slide 21 text

Slide 22

Slide 22 text

Slide 23

Slide 23 text

Slide 24

Slide 24 text

Slide 25

Slide 25 text

Slide 26

Slide 26 text

Slide 27

Slide 27 text

Slide 28

Slide 28 text

Slide 29

Slide 29 text

Slide 30

Slide 30 text

Slide 31

Slide 31 text

Slide 32

Slide 32 text

Slide 33

Slide 33 text

Slide 34

Slide 34 text

Slide 35

Slide 35 text

Slide 36

Slide 36 text

Slide 37

Slide 37 text

Slide 38

Slide 38 text

Slide 39

Slide 39 text

Slide 40

Slide 40 text