Guesser Answerer Team1 Guesser Answerer Team2 Croissant keyword Round: 2 guess yes no Is it a type of food? Is it a bird? Croissant Headphone Win! Lose…
また、これらの質問は軽量なLLMでは誤答するケースが多く見られた Answerer ルールベースパート 1/2 28 Does the keyword start with the letter ‘A’? Does the keyword include the letter ‘A’? Does the keyword precede ‘melon’ in alphabetical order? 開始・終了文字 辞書順 含有文字
また、これらの質問は軽量なLLMでは誤答することケースが多く見られた • これらの典型的な質問を正規表現で一致判定し、ルールベースで回答 Answerer ルールベースパート 2/2 29 Does the keyword start with the letter ‘A’? Does the keyword include the letter ‘A’? Does the keyword precede ‘melon’ in alphabetical order? 開始・終了文字 辞書順 含有文字 starts|start|begins|begin|end|ends) with the letter['"]?([a-zA-Z])['"]?\?$ (includes|include|contains|contain) the letter ['"]?([a-zA-Z])['"]?\?$ keyword.*(?:come before|precede) "([^"]+)" .+ order\?$
‘yes’ か ‘no’を出力させる LLMパート 1/2 30 messages = [ { "role": "system", "content": "You are an AI that answers with 'yes' or 'no' to determine if a statement based on a keyword is true. Answer yes or no only." }, { "role": "user", "content": "Keyword: {keyword}, Question: {question}" } ]
キーワードがHare(野うさぎ)の場合 43 Round Question Answer Guess Top 3 guesses with probabilities 3 Is the keyword related to food or animals? 食べ物 or 動物? yes seaweed seaweed(0.004) granola bar(0.003) fruit(0.003) 8 Is the living thing a mammal? 哺乳類? yes gerbil gerbil(0.047) hedgehog(0.040) moose(0.035) 9 Is it typically associated with pets? ペット? no moose moose(0.050) grizzly bear(0.042) zebra(0.041) 11 Does the keyword start with the letter ‘c’? ‘c’から始まる? no Hare Hare(0.063) koala(0.035) marmot(0.032) ラウンド後半に なるにつれ、 具体的な質問に なっていく