Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Towards Diverse and Fair Language Generation --...

Towards Diverse and Fair Language Generation -- Teaching ChatGPT to be “nice” (Inaugural Professorial Lecture -- 07 Nov 2024)


I began my research journey in Natural Language Processing (NLP) by developing algorithms to summarise news articles. At the time, I never imagined that, within less than two decades, we would have AI assistants capable of not only summarising but also generating a wide range of texts—from creative writing to technical reports and news. Tools like ChatGPT, Gemini, and other AI assistants have become an integral part of our daily lives.

For the first time in human history, we are sharing our world with entities that can process and generate information faster than we can. As an NLP researcher, I am thrilled by the immense progress our field has made and the transformative impact on society that I have witnessed within my career. Yet, I find myself increasingly concerned about some unintended consequences: the presence of social biases, lack of diversity in AI-generated content, and the ethical responsibilities we, as researchers, must shoulder.

This brings us to the question: What kind of future do we envision with our intelligent counterparts? More importantly, how can we, as researchers, ensure that these AI assistants act in ways that promote fairness, inclusivity, and ethical decision-making? In this lecture, I will explore our responsibility to build diverse and fair language generation systems and discuss the steps we can take to ‘teach’ AI assistants to be responsible, equitable collaborators in the human-AI future.

Danushka Bollegala

November 09, 2024
Tweet

More Decks by Danushka Bollegala

Other Decks in Research

Transcript

  1. Towards Diverse and Fair Language Generation -- Teaching ChatGPT to

    be nice Danushka Bollegala Inaugural Lecture
  2. A Brief History of my Academic Life 2  

    d 6OJWPG5PLZP +BQBO
  3. A Brief History of my Academic Life 2  

    d d 6OJWPG5PLZP +BQBO
  4. A Brief History of my Academic Life 2  

    d d d 6OJWPG5PLZP +BQBO
  5. A Brief History of my Academic Life 2  

    d d d d 6OJWPG5PLZP +BQBO
  6. A Brief History of my Academic Life 2  

    d d d d
  7. A Brief History of my Academic Life 2  

    d d d d d
  8. A Brief History of my Academic Life 2  

    d d d d d
  9. A Brief History of my Academic Life 2  

    d d d d d d
  10. A Brief History of my Academic Life 2  

    d d d d d d
  11. What is Natural Language Processing? • I have been a

    researcher in NLP for nearly two decades (my fi rst NLP paper was published in 2005) 3
  12. What is Natural Language Processing? • I have been a

    researcher in NLP for nearly two decades (my fi rst NLP paper was published in 2005) 3
  13. What is Natural Language Processing? • I have been a

    researcher in NLP for nearly two decades (my fi rst NLP paper was published in 2005) 3
  14. What is Natural Language Processing? • I have been a

    researcher in NLP for nearly two decades (my fi rst NLP paper was published in 2005) • Natural Language Processing: 3
  15. What is Natural Language Processing? • I have been a

    researcher in NLP for nearly two decades (my fi rst NLP paper was published in 2005) • Natural Language Processing: • The branch of Computer Science that is concerned with developing algorithms to process languages spoken by humans (human vs. programming languages). 3
  16. What is Natural Language Processing? • I have been a

    researcher in NLP for nearly two decades (my fi rst NLP paper was published in 2005) • Natural Language Processing: • The branch of Computer Science that is concerned with developing algorithms to process languages spoken by humans (human vs. programming languages). • Applications: Information Retrieval/Extraction, Machine Translation, Text Summarisation, Question Answering, Dialogue Systems, … 3
  17. What is Natural Language Processing? • I have been a

    researcher in NLP for nearly two decades (my fi rst NLP paper was published in 2005) • Natural Language Processing: • The branch of Computer Science that is concerned with developing algorithms to process languages spoken by humans (human vs. programming languages). • Applications: Information Retrieval/Extraction, Machine Translation, Text Summarisation, Question Answering, Dialogue Systems, … 3
  18. From Turing’s paper 5 GPT says that the rook is

    in h1 which is wrong (and in fact the position would be already checkmate in that case), rook could be either in h8 or a8. Then the check-mate would be R8, (h8 or a8 depending on the initial position). 
 
 Jose-Camacho Collados (International Master)
  19. Evolution of NLP: 1960~1970 6 3VMFCBTFE4ZTUFNT "VUPNBUJD-BOHVBHF1SPDFTTJOH "EWJTPSZ$PNNJUUFF "-1"$ SFQPSUPG

    TUBUFEUIBU.BDIJOF5SBOTMBUJPOJT BXBTUFPG64HPWUSFTFBSDITQFOEJOH  XIJDISFTVMUFEJOBGVOEJOHTUPQGPS /-1
  20. Evolution of NLP: 1970~1980 7 Pick up a big red

    block. OK. (On the screen, the robot arm swings into action. Two red blocks are visible, one small, one large, as on figure above. The large one has a green cube stacked on top of it. The robot first transfers the green cube to the table top, and then picks up the red block.) Find a block which is taller than the one you are holding and put it into the box. BY 'IT', I ASSUME YOU MEAN THE BLOCK WHICH IS TALLER THAN THE ONE I AM HOLDING. (SHRDLU puts down the red block, picks up a taller blue one, and places it in the box.) What does the box contain? THE BLUE PYRAMID AND THE BLUE BLOCK. What is the pyramid supported by? THE BOX. 4)3%-6 5FSSZ8JOPHSBE
  21. Evolution of NLP: 2000~2010 10 1SPCBCJMJTUJD.PEFMT  -BUFOU%JSJDIMFU"MMPDBUJPO  

     (SBQIJDBM.PEFMTJO/-1  $POEJUJPOBM3BOEPN'JFMET $3'T  * BUF BO BQQMF ZFTUFSEBZ 130 7#% %&5 /06/ /06/ *UPVDIFEBDPNQVUFSGPSUIF fi STUUJNF JONZMJGFJO *XBT 
  22. Evolution of NLP: 2000~2010 10 1SPCBCJMJTUJD.PEFMT  -BUFOU%JSJDIMFU"MMPDBUJPO  

     (SBQIJDBM.PEFMTJO/-1  $POEJUJPOBM3BOEPN'JFMET $3'T  * BUF BO BQQMF ZFTUFSEBZ 130 7#% %&5 /06/ /06/ *UPVDIFEBDPNQVUFSGPSUIF fi STUUJNF JONZMJGFJO *XBT 
  23. My NLP Reserch 12 3FQSFTFOUBUJPO-FBSOJOH -FYJDBM$PNQPTJUJPOBM4FNBOUJDT ,OPXMFEHF(SBQI &NCFEEJOHT .FUB&NCFEEJOHT ʜ

    "EBQUBUJPO EPNBJOT MBOHVBHFT  NPEBMJUJFT (FOFSBUJPO DPNNPOTFOTF EJWFSTJ f i
  24. My NLP Reserch 12 3FQSFTFOUBUJPO-FBSOJOH -FYJDBM$PNQPTJUJPOBM4FNBOUJDT ,OPXMFEHF(SBQI &NCFEEJOHT .FUB&NCFEEJOHT ʜ

    "EBQUBUJPO EPNBJOT MBOHVBHFT  NPEBMJUJFT (FOFSBUJPO DPNNPOTFOTF EJWFSTJ f i 4PDJBM#JBTFT EFUFDUJPONJUJHBUJPO
  25. My NLP Reserch 12 3FQSFTFOUBUJPO-FBSOJOH -FYJDBM$PNQPTJUJPOBM4FNBOUJDT ,OPXMFEHF(SBQI &NCFEEJOHT .FUB&NCFEEJOHT ʜ

    "EBQUBUJPO EPNBJOT MBOHVBHFT  NPEBMJUJFT (FOFSBUJPO DPNNPOTFOTF EJWFSTJ f i 4PDJBM#JBTFT EFUFDUJPONJUJHBUJPO "QQMJDBUJPOT .FEJDJOF -BX $IFNJTUSZ #JPMPHZ  'JOBODF *3 ʜ
  26. Danushka, the Coconut Scientist • Over the years, by working

    on diverse topics, I have mastered a broad range of tools for solving problems. This turns out to be a seek a ft er skill in inter- disciplinary collaborations as well as in the industry.  IUUQTJOWFSTFQSPCBCJMJUZDPNUBMLTOPUFTDPDPOVUTDJFODFBOEUIFTVQQMZDIBJOPGJEFBTIUNM
  27. GenAI is great! 14 GPT-4 Technical Repo rt , Open

    AI, 2023. Corresponds to top 10% of the human candidates
  28. GenAI is great? 17 Images created by DALL-E (OpenAI) h

    tt ps://www.vice.com/en/a rt icle/wxdawn/the-ai-that-draws-what-you-type-is-very-racist-shocking-no-one
  29. GenAI is great? 19 Images generated by Stable Di ff

    usion (Stability.AI) “Janitor” “Asse rt ive Fire fi ghter” h tt ps://techpolicy.press/researchers- fi nd-stable-di ff usion-ampli fi es-stereotypes/
  30. Bias Suppression in LLMs 23 Despite being a female, Haley

    became an engineering manager Preamble
  31. Bias Suppression in LLMs 23 Despite being a female, Haley

    became an engineering manager Preamble test case -1 Anne was a skilled surgeon, who conducted many complex surgeries
  32. Bias Suppression in LLMs 23 Despite being a female, Haley

    became an engineering manager Preamble test case -1 Anne was a skilled surgeon, who conducted many complex surgeries test case -2 John was a skilled surgeon, who conducted many complex surgeries
  33. Bias Suppression in LLMs 23 Despite being a female, Haley

    became an engineering manager Preamble test case -1 Anne was a skilled surgeon, who conducted many complex surgeries test case -2 John was a skilled surgeon, who conducted many complex surgeries
  34. Bias Suppression in LLMs 23 Despite being a female, Haley

    became an engineering manager Preamble test case -1 Anne was a skilled surgeon, who conducted many complex surgeries test case -2 John was a skilled surgeon, who conducted many complex surgeries )FMMB4XBHDPNNPOTFOTFSFBTPOJOH
  35. Bias Suppression in LLMs 23 Despite being a female, Haley

    became an engineering manager Preamble test case -1 Anne was a skilled surgeon, who conducted many complex surgeries test case -2 John was a skilled surgeon, who conducted many complex surgeries )FMMB4XBHDPNNPOTFOTFSFBTPOJOH In-contextual Gender Bias Suppression for Large Language Models: Oba, Kaneko, Bollegala. EACL 2024.
  36. Unconscious Biases in LLMs • Chain-of-Thought (CoT) requires LLMs to

    provide intermediary explanations for its inferences. • Can CoT make LLMs aware of their unconscious social biases? 24
  37. Unconscious Biases in LLMs • Chain-of-Thought (CoT) requires LLMs to

    provide intermediary explanations for its inferences. • Can CoT make LLMs aware of their unconscious social biases? 24 der Bias in Large Language Models MNLP submission Figure 1: Example of multi-step gender bias reasoning task. Kojima et al., 2022). 043 Multi-step Gender Bias Reasoning
  38. Unconscious Biases in LLMs • Chain-of-Thought (CoT) requires LLMs to

    provide intermediary explanations for its inferences. • Can CoT make LLMs aware of their unconscious social biases? 24 der Bias in Large Language Models MNLP submission Figure 1: Example of multi-step gender bias reasoning task. Kojima et al., 2022). 043 Multi-step Gender Bias Reasoning CoT instruction: Lets think Step-by-Step
  39. Unconscious Biases in LLMs • Chain-of-Thought (CoT) requires LLMs to

    provide intermediary explanations for its inferences. • Can CoT make LLMs aware of their unconscious social biases? 24 der Bias in Large Language Models MNLP submission Figure 1: Example of multi-step gender bias reasoning task. Kojima et al., 2022). 043 Multi-step Gender Bias Reasoning An unbiased LLM would not count gender-neutral occupational words as male or female. CoT instruction: Lets think Step-by-Step
  40. Unconscious Biases in LLMs • Chain-of-Thought (CoT) requires LLMs to

    provide intermediary explanations for its inferences. • Can CoT make LLMs aware of their unconscious social biases? 24 der Bias in Large Language Models MNLP submission Figure 1: Example of multi-step gender bias reasoning task. Kojima et al., 2022). 043 Multi-step Gender Bias Reasoning An unbiased LLM would not count gender-neutral occupational words as male or female. CoT instruction: Lets think Step-by-Step opt-125m 16.2 / 14.0 5.2 / 3.0 16.2 / 14.0 5.2 / 3.0 2.0 / 8.0 0.0 / 1.6 opt-350m 9.0 / 15.2 0.6 / 6.8 9.0 / 15.2 0.6 / 6.8 1.1 / 0.6 -0.9 / 1.2 opt-1.3b 2.6 / 0.6 2.6 / 1.0 2.6 / 0.6 2.6 / 1.0 -0.4 / -0.2 -0.6 / -0.4 opt-2.7b 14.8 / 17.0 3.4 / 2.8 14.8 / 17.0 3.4 / 2.8 0.0 / 0.2 1.8 / 0.0 opt-6.7b 7.6 / 2.6 5.8 / 1.7 7.6 / 2.6 5.8 / 1.7 0.4 / 0.2 0.0 / 0.5 opt-13b 17.0 / 23.6 4.8 / 0.4 17.0 / 23.5 4.8 / 0.4 0.0 / 0.0 2.0 / 0.4 opt-30b 23.2 / 25.4 6.2 / 6.6 23.0 / 25.2 6.1 / 6.4 0.0 / 0.0 0.0 / 0.0 opt-66b 25.6 / 31.2 17.6 / 25.0 25.3 / 30.9 17.4 / 25.0 0.0 / 0.0 0.0 / 0.0 gpt-j-6B 5.8 / 6.4 3.2 / 0.6 5.8 / 6.4 3.2 / 0.6 0.6 / 0.2 0.0 / 0.0 mpt-7b 1.8 / 1.8 0.8 / 5.0 1.8 / 1.8 0.8 / 5.0 0.4 / 0.6 17.0 / 15.2 mpt-7b-inst. 5.4 / 4.8 6.0 / 3.6 5.4 / 4.8 6.0 / 3.6 5.8 / 6.6 12.6 / 11.0 falcon-7b 2.8 / 4.0 0.2 / 0.4 2.8 / 4.0 0.2 / 0.4 0.0 / 8.6 0.0 / 0.0 falcon-7b-inst. 2.2 / 3.2 5.0 / 3.8 2.2 / 3.2 5.0 / 3.8 0.0 / 0.0 0.0 / 0.0 gpt-neox-20b 33.2 / 33.8 -0.1 / 3.0 33.0 / 33.6 0.0 / 2.9 0.0 / 0.0 7.4 / 3.0 falcon-40b 34.0 / 29.0 2.0 / 3.0 34.0 / 29.0 1.9 / 3.0 7.6 / 3.0 -0.2 / 0.0 falcon-40b-inst. 5.2 / 3.6 3.4 / 3.7 4.9 / 3.4 3.3 / 3.5 2.2 / 3.4 1.7 / 2.5 bloom 40.2 / 28.0 12.0 / 11.0 40.0 / 27.7 11.9 / 11.0 7.4 / 4.2 5.4 / 2.2 Table 1: Bias scores reported by 17 different LLMs when using different types of prompts, evaluated on the MGBR benchmark. Female vs. Male bias scores are separated by ‘/’ in the Table. and is used as a pro-stereotypical text. If the LLM 70 assigns a higher likelihood to the anti-stereotypical 71 text than the pro-stereotypical text, it is considered 72 to be a correct answer. Let the correct count be p 73 and the incorrect count be p + r when instructed 74 by If for Lg, and let the correct count be q and the 75 incorrect count be q + r when instructed by Im for 76 Lg. Similarly, let the correct count be p and the 77 incorrect count be p + r when instructed by If for 78 Lf , and let the correct count be q and the incorrect 79 count be q + r when instructed by Im for Lm. 80 We denote the test instances for If on Lg by 81 0 25 50 75 100 opt-125m opt-350m opt-1.3b opt-2.7b opt-6.7b opt-13b opt-30b opt-66b Few-shot Few-shot+Debiased Few-shot+CoT Figure 2: Accuracy of the Few-shot, Few-shot+CoT, accuracy
  41. GenAI and Diversity • We have 8B unique humans in

    the world, talking to a handful of LLMs • Given the cultural background, socio-economic, ethnic factors and the mood of the opponent, LLMs need to generate diverse responses even when the same questions are being asked from di ff erent humans. 25
  42. GenAI and Diversity • We have 8B unique humans in

    the world, talking to a handful of LLMs • Given the cultural background, socio-economic, ethnic factors and the mood of the opponent, LLMs need to generate diverse responses even when the same questions are being asked from di ff erent humans. 25 Candle: Extracting Cultural Commonsense Knowledge at Scale [Nguyen+ 23]
  43. GenAI and Diversity • We have 8B unique humans in

    the world, talking to a handful of LLMs • Given the cultural background, socio-economic, ethnic factors and the mood of the opponent, LLMs need to generate diverse responses even when the same questions are being asked from di ff erent humans. 25 Candle: Extracting Cultural Commonsense Knowledge at Scale [Nguyen+ 23] Fish and chips is a popular dish in the UK. 0.71 The majority of sentences are about meat, speci fi cally British meat. 0.68 Mince pies are a traditional British Christmas dessert made with fruit and spices. 0.67 Sticky to ff ee pudding is a classic British dessert made with dates and molasses. 0.66 Christmas crackers are a British tradition that is enjoyed by many during the Christmas season. 0.65 FareShare is a UK-based charity fi ghting hunger and food waste. 0.65 The most popular dish in Britain is chicken tikka masala. 0.64 Cottage pie is a British savory pie, typically made with ground beef and a mashed potato crust. 0.64 Puddings are a typical British dish which has been around for centuries. 0.64 The UK has a food waste problem, with seven million tonnes of food waste generated annually. 0.64
  44. GenAI and Diversity • We have 8B unique humans in

    the world, talking to a handful of LLMs • Given the cultural background, socio-economic, ethnic factors and the mood of the opponent, LLMs need to generate diverse responses even when the same questions are being asked from di ff erent humans. 25 Candle: Extracting Cultural Commonsense Knowledge at Scale [Nguyen+ 23] Fish and chips is a popular dish in the UK. 0.71 The majority of sentences are about meat, speci fi cally British meat. 0.68 Mince pies are a traditional British Christmas dessert made with fruit and spices. 0.67 Sticky to ff ee pudding is a classic British dessert made with dates and molasses. 0.66 Christmas crackers are a British tradition that is enjoyed by many during the Christmas season. 0.65 FareShare is a UK-based charity fi ghting hunger and food waste. 0.65 The most popular dish in Britain is chicken tikka masala. 0.64 Cottage pie is a British savory pie, typically made with ground beef and a mashed potato crust. 0.64 Puddings are a typical British dish which has been around for centuries. 0.64 The UK has a food waste problem, with seven million tonnes of food waste generated annually. 0.64 Okonomiyaki is a savory Japanese pancake or omelette, made with rice fl our and vegetables. 0.79 Miso soup is a popular and staple dish in Japanese cuisine. 0.78 Miso soup is a popular dish in Japan that is often eaten with meals. 0.73 Natto is a traditional Japanese dish made from fermented soybeans. 0.73 Udon noodles are thick Japanese noodles made of wheat fl our. 0.71 Soba noodles are a Japanese noodle made from buckwheat. 0.7 Shabu shabu is a Japanese hot pot dish. 0.7 Tempura is a Japanese dish of deep- fried fi sh or vegetables. 0.7 Sushi is a popular food in Japan that is often seen as a symbol of Japanese culture. 0.69 Persimmons are a popular fruit in Japan that have many di ff erent uses. 0.69
  45. GenAI and Diversity • We have 8B unique humans in

    the world, talking to a handful of LLMs • Given the cultural background, socio-economic, ethnic factors and the mood of the opponent, LLMs need to generate diverse responses even when the same questions are being asked from di ff erent humans. 25 Candle: Extracting Cultural Commonsense Knowledge at Scale [Nguyen+ 23] (PPEOJHIUBUQN <4IXBU[`> Fish and chips is a popular dish in the UK. 0.71 The majority of sentences are about meat, speci fi cally British meat. 0.68 Mince pies are a traditional British Christmas dessert made with fruit and spices. 0.67 Sticky to ff ee pudding is a classic British dessert made with dates and molasses. 0.66 Christmas crackers are a British tradition that is enjoyed by many during the Christmas season. 0.65 FareShare is a UK-based charity fi ghting hunger and food waste. 0.65 The most popular dish in Britain is chicken tikka masala. 0.64 Cottage pie is a British savory pie, typically made with ground beef and a mashed potato crust. 0.64 Puddings are a typical British dish which has been around for centuries. 0.64 The UK has a food waste problem, with seven million tonnes of food waste generated annually. 0.64 Okonomiyaki is a savory Japanese pancake or omelette, made with rice fl our and vegetables. 0.79 Miso soup is a popular and staple dish in Japanese cuisine. 0.78 Miso soup is a popular dish in Japan that is often eaten with meals. 0.73 Natto is a traditional Japanese dish made from fermented soybeans. 0.73 Udon noodles are thick Japanese noodles made of wheat fl our. 0.71 Soba noodles are a Japanese noodle made from buckwheat. 0.7 Shabu shabu is a Japanese hot pot dish. 0.7 Tempura is a Japanese dish of deep- fried fi sh or vegetables. 0.7 Sushi is a popular food in Japan that is often seen as a symbol of Japanese culture. 0.69 Persimmons are a popular fruit in Japan that have many di ff erent uses. 0.69
  46. Diverse Commonsense Generation • Given the four concepts dog, frisbee,

    throw, catch we would like to generate more diverse responses (shown bo tt om). 26 • A dog catches a frisbee thrown to it. • A dog catches a frisbee thrown by its owner. • A dog jumps in the air to catch a frisbee thrown by its owner. • A dog leaps to catch a thrown frisbee. • The dog catches the frisbee when the boy throws it. • A man throws away his dog’s favourite frisbee expecting him to catch in the air.
  47. Prompting for Diversity Examples: Given several keywords: [SRC], generate one

    coherent sentence using background commonsense knowledge [TGT] Test Instruction: Step 1: Given several keywords: [INPUT], generate [N] di ff erent and coherent sentences using background commonsense knowledge: [PRV] (if the diversity of [PRV] is low) Step 2: You have generated the following sentence: [PRV], try to provide other reasonable sentences: [OUTPUT] 27 Diversed Prompt
  48. Example Generations 28 efault+MoE 91.2 84.6 9.7 60.3 66.5 60.0

    51.2 40.6 34.8 72.9 51.6 62.3 versified+MoE 86.7 80.4 9.8 63.3 59.2 53.5 50.7 40.6 34.0 71.3 56.3 55.0 CD+MoE 91.1 82.6 9.8 64.8 59.0 51.1 52.4 42.2 34.5 73.5 58.7 62.3 Table 4: Downstream evaluation of the LLM-generated sentences. Top block methods use human-generated esources for training, while the ones in the bottom block are trained on LLM-generated sentences. MoE approaches re shown in the middle block and bottom block. BART-large is used as the generator for MoE-based methods. Best results for each metric are shown in bold, while the best performing MoE for quality is shown in underline. Human: • The group will use the tool to make a piece of art out of metal. • I use a tool to cut a piece of metal out of the car. • The man used a piece of metal and the tools. Default: • A piece of metal is being used as a tool. • A metal tool is being used to shape a piece. • A metal tool is being used to work on a piece. ICD: • A tool is being utilized to manipulate a piece of metal. • Metal is being shaped using a specific tool. • The use of a tool is necessary to work with a piece of metal. CommonGen: Input: (piece, use, tool, metal) Human: • A pizza parlor wouldn't have workout equipment, and sells fattening food. • A pizza parlor is not a good place to exercise. • Pizza parlors do not have exercise equipment. Default: • Pizza parlors are not typically associated with exercise or physical activity. • Pizza parlors are not typically associated with exercise or physical activity. • Pizza parlors are not places for exercise, they are places to eat pizza. ICD: • People usually go to a gym, park or fitness center to exercise, not a pizza parlor. • Pizza parlors are not typically associated with exercise. • Exercise is not typically done at a pizza parlor. ComVE: Input: If a person wants to exercise, they go to a pizza parlor. Figure 4: Sentences generated by default prompt and ICD against those by humans on CommonGen and ComVE est instances. ICD generates more diverse and high quality sentences than default. .3 Diversity-Awareness of LLMs Given that we use LLMs to produce diverse genera- ions via ICL, it remains an open question whether n LLM would agree with humans on the diversity diagonal quadrants and a Cohen’s Kappa of 0.409 indicating a moderate level of agreement between GPT and human ratings for diversity. The generated sentences using the de- Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning, Zhang, Peng, and Bollegala. Empirical Methods in Natural Language Processing (EMNLP), 2024.
  49. FAQs that I get • Will AI kill us all?

    31 %FQFOETPOXIBUDBQBCJMJUJFTUIBUZPVHJWFUP"*CBTFETZTUFNT
  50. FAQs that I get • Will AI kill us all?

    31 %FQFOETPOXIBUDBQBCJMJUJFTUIBUZPVHJWFUP"*CBTFETZTUFNT $PVME BOEBMSFBEZIBWF DPOWJODFEIVNBOTUPUBLFUIFJSMJWFT
  51. FAQs that I get • Will AI kill us all?

    31 %FQFOETPOXIBUDBQBCJMJUJFTUIBUZPVHJWFUP"*CBTFETZTUFNT $PVME BOEBMSFBEZIBWF DPOWJODFEIVNBOTUPUBLFUIFJSMJWFT 5IFSFBSFNPSFEJSFDUBOEJNNFEJBUFSJTLTUIBUTIPVMEOPUCF PWFSTIBEPXFECZFDDFOUSJDBOETDJFODF fi DUJUJPVTPOFT
  52. FAQs that I get • Will AI kill us all?

    31 %FQFOETPOXIBUDBQBCJMJUJFTUIBUZPVHJWFUP"*CBTFETZTUFNT $PVME BOEBMSFBEZIBWF DPOWJODFEIVNBOTUPUBLFUIFJSMJWFT 5IFSFBSFNPSFEJSFDUBOEJNNFEJBUFSJTLTUIBUTIPVMEOPUCF PWFSTIBEPXFECZFDDFOUSJDBOETDJFODF fi DUJUJPVTPOFT • Will AI take my job?
  53. FAQs that I get • Will AI kill us all?

    31 %FQFOETPOXIBUDBQBCJMJUJFTUIBUZPVHJWFUP"*CBTFETZTUFNT $PVME BOEBMSFBEZIBWF DPOWJODFEIVNBOTUPUBLFUIFJSMJWFT 5IFSFBSFNPSFEJSFDUBOEJNNFEJBUFSJTLTUIBUTIPVMEOPUCF PWFSTIBEPXFECZFDDFOUSJDBOETDJFODF fi DUJUJPVTPOFT • Will AI take my job? 4PNFKPCTXJMMCFGVMMZBVUPNBUFE BTJUIBTCFFOUIFDBTFTJODFJOEVTUSJBMSFWPMVUJPO 
  54. FAQs that I get • Will AI kill us all?

    31 %FQFOETPOXIBUDBQBCJMJUJFTUIBUZPVHJWFUP"*CBTFETZTUFNT $PVME BOEBMSFBEZIBWF DPOWJODFEIVNBOTUPUBLFUIFJSMJWFT 5IFSFBSFNPSFEJSFDUBOEJNNFEJBUFSJTLTUIBUTIPVMEOPUCF PWFSTIBEPXFECZFDDFOUSJDBOETDJFODF fi DUJUJPVTPOFT • Will AI take my job? 4PNFKPCTXJMMCFGVMMZBVUPNBUFE BTJUIBTCFFOUIFDBTFTJODFJOEVTUSJBMSFWPMVUJPO  %PZPVSFBMMZXBOUUPEPBKPCUIBUDBOCFBVUPNBUFEVTJOH"*
  55. FAQs that I get • Will AI kill us all?

    31 %FQFOETPOXIBUDBQBCJMJUJFTUIBUZPVHJWFUP"*CBTFETZTUFNT $PVME BOEBMSFBEZIBWF DPOWJODFEIVNBOTUPUBLFUIFJSMJWFT 5IFSFBSFNPSFEJSFDUBOEJNNFEJBUFSJTLTUIBUTIPVMEOPUCF PWFSTIBEPXFECZFDDFOUSJDBOETDJFODF fi DUJUJPVTPOFT • Will AI take my job? 4PNFKPCTXJMMCFGVMMZBVUPNBUFE BTJUIBTCFFOUIFDBTFTJODFJOEVTUSJBMSFWPMVUJPO  :PVXJMMCFSFQMBDFEOPUCZ"*CVUCZZPVSDPNQFUJUPSXIPVTFT"* %PZPVSFBMMZXBOUUPEPBKPCUIBUDBOCFBVUPNBUFEVTJOH"*
  56. Human vs. AI • Comparing humans to AI goes all

    the way back to the Turing Test (1950) 32
  57. Human vs. AI • Comparing humans to AI goes all

    the way back to the Turing Test (1950) • IMO this is a meaningless comparison 32
  58. Human vs. AI • Comparing humans to AI goes all

    the way back to the Turing Test (1950) • IMO this is a meaningless comparison • We have always had tools that can do some things much be tt er than humans. 32
  59. Human vs. AI • Comparing humans to AI goes all

    the way back to the Turing Test (1950) • IMO this is a meaningless comparison • We have always had tools that can do some things much be tt er than humans. • It leads to a continuous feeling of threat and competition. 32
  60. Human vs. AI • Comparing humans to AI goes all

    the way back to the Turing Test (1950) • IMO this is a meaningless comparison • We have always had tools that can do some things much be tt er than humans. • It leads to a continuous feeling of threat and competition. • Having humans as the centre of universe is a Western philosophical thinking. (di ff erent from Eastern philosophy) 32
  61. Human vs. AI • Comparing humans to AI goes all

    the way back to the Turing Test (1950) • IMO this is a meaningless comparison • We have always had tools that can do some things much be tt er than humans. • It leads to a continuous feeling of threat and competition. • Having humans as the centre of universe is a Western philosophical thinking. (di ff erent from Eastern philosophy) • On the other hand, there are many tasks that I still cannot get done by AI (e.g. submi tt ing my expense claims, loading the dish washer, …) 32
  62. Role of the AI Professor in 2024 33 &EVDBUJPO /FYUHFOFSBUJPONVTUVOEFSTUBOEIPX"*UPPMT

    BSFEFWFMPQFE UIFJSDBQBCJMJUJFTBOEMJNJUBUJPOT 3FTFBSDI %FWFMPQNPSFF ff i
  63. Role of the AI Professor in 2024 33 &EVDBUJPO /FYUHFOFSBUJPONVTUVOEFSTUBOEIPX"*UPPMT

    BSFEFWFMPQFE UIFJSDBQBCJMJUJFTBOEMJNJUBUJPOT &OHBHFNFOU 1SPWJEFNFEJB QPMJDZNBLFST  TPDJFUZXJUIBDDVSBUFBOE JNQBSUJBMJOGPSNBUJPOBCPVU"* BOETFUSFBMJTUJD FYQFDUBUJPOT 3FTFBSDI %FWFMPQNPSFF ff i