DEIM2024 / 大規模言語モデルを用いたカテゴリ説明文付与によるニュース推薦の性能向上

Slide 1

Slide 1 text

大規模言語モデルを用いたカテゴリ説明文付与によるニュース推薦の性能向上矢田宙生†, 山名早人† †早稲田大学院基幹理工学研究科

Slide 2

Slide 2 text

Executive Summary ● ニュースコンテンツの理解に重要な情報源 →カテゴリ名単体(e.g. ”tv-golden-glove” )は情報量が不十分ニュースカテゴリ情報 ● 閲覧履歴やプロフィールからユーザが好みそうなニュース記事を推薦 ● 言語モデルを活用した手法が高い性能を示す [1] ニュース推薦 ● 大規模言語モデル (GPT-4)によりニュースカテゴリの説明文を生成 → 生成した説明文を追加情報として推薦モデルに入力提案手法 “The TV-Golden Globes category focuses on news related to the Golden Globe Awards …..” GPT-4 ● ベースライン: title only: 記事タイトル文のみ template-based: テンプレートにカテゴリ名を当てはめた文章 ● NRMS-BERT[1]をMIND(Microsoft News Dataset)[2]で訓練することで検証. 結果, 全ての指標において提案手法が最も高い性能を示した評価実験 “The news category is tv-golden-glove” ニュース記事カテゴリ情報を入力記事カテゴリの詳細な説明文を生成 BERT Additional Layer "Pierce Brosnan's Sons Paris and Dylan Named 2020 (ニュース記事タイトル ) "The TV-Golden Globes category focuses on news …” (生成した記事カテゴリの説明文) w 1 w 2 w n-1 w n SEP w 1 w 2 w m-1 w m ・・・・・・ w 1 + ニュースベクトル図 1.1. 記事カテゴリの説明文生成図 1.2. カテゴリ説明文の推薦モデルへの入力 [1] C. Wu, F. Wu, T. Qi, and Y. Huang, “Empowering News Recommendation with Pre-Trained Language Models,” in Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual, 2021, pp. 1652–1656. [2] F. Wu et al., “MIND: A Large-scale Dataset for News Recommendation,” in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Virtual, 2020, pp. 3597–3606. 2

Slide 21

Slide 21 text

関連研究: 大規模言語モデルによる生成文の応用 21 ● 2021年, Yooら[10]は, GPT-3[11]によりテキスト拡張を文書分類に応用するGPT3Mixを提案. GPT-3で訓練データの拡張を行い , BERTによる文書分類器を学習 . 未拡張のデータセットによる学習と比較し , 高い性能を示した. ● 2022年, Liuら[12]は, GPT-3によるテキスト拡張を質問応答に応用. 質問文とGPT-3により生成した関連知識をモデルへと入力 . 質問文のみを入力するベースラインと比較し , 高い性能を示した ● 2023年, Pratt ら[13]は, CLIPによるゼロショット画像分類に,大規模言語モデルによるテキスト拡張を応用した CuPLを提案. 画像キャプションのテキストを GPT-3により生成し, モデルに入力. テンプレート「a photo of a {}」ベースのキャプションを用いた手法と比較し ,高い性能を示した [10] K. M. Yoo et al., “GPT3Mix: Leveraging Large-scale Language Models for Text Augmentation,” in Findings of the Association for Computational Linguistics: EMNLP 2021, Punta Cana, Dominican Republic, 2021, pp. 2225–2239. [11] T. B. Brown et al., “Language models are few-shot learners,” in Proceedings of the 34th International Conference on Neural Information Processing Systems. Vancouver, BC, Canada: Curran Associates Inc., 2020, pp. 1877–1901. [12] J. Liu et al., “Generated knowledge prompting for commonsense reasoning,” in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Dublin, Ireland, 2022, pp. 3154–3169. [13] S. Pratt, I. Covert, R. Liu, and A. Farhadi, “What does a platypus look like? Generating customized prompts for zero-shot image classiﬁcation,” in 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2023, pp. 15 645–15 655.

Slide 1

Slide 1 text

Slide 2

Slide 2 text

Slide 3

Slide 3 text

Slide 4

Slide 4 text

Slide 5

Slide 5 text

Slide 6

Slide 6 text

Slide 7

Slide 7 text

Slide 8

Slide 8 text

Slide 9

Slide 9 text

Slide 10

Slide 10 text

Slide 11

Slide 11 text

Slide 12

Slide 12 text

Slide 13

Slide 13 text

Slide 14

Slide 14 text

Slide 15

Slide 15 text

Slide 16

Slide 16 text

Slide 17

Slide 17 text

Slide 18

Slide 18 text

Slide 19

Slide 19 text

Slide 20

Slide 20 text

Slide 21

Slide 21 text

Slide 22

Slide 22 text

Slide 23

Slide 23 text

Slide 24

Slide 24 text

Slide 25

Slide 25 text

Slide 26

Slide 26 text

Slide 27

Slide 27 text

Slide 28

Slide 28 text

Slide 29

Slide 29 text

Slide 30

Slide 30 text

Slide 31

Slide 31 text

Slide 32

Slide 32 text

Slide 33

Slide 33 text

Slide 34

Slide 34 text

Slide 35

Slide 35 text

Slide 36

Slide 36 text

Slide 37

Slide 37 text

Slide 38

Slide 38 text

Slide 39

Slide 39 text

Slide 40

Slide 40 text