2024/02/07, 第52回 NLPコロキウム
https://nlp-colloquium-jp.github.io/schedule/2024-02-07_goro-kobayashi/
以下3論文から主要な知見をまとめてご紹介しました。
- Attention is Not Only a Weight: Analyzing Transformers with Vector Norms (EMNLP'20) https://aclanthology.org/2020.emnlp-main.574/
- Incorporating Residual and Normalization Layers into Analysis of Masked Language Models (EMNLP'21) https://aclanthology.org/2021.emnlp-main.373/
- Analyzing Feed-Forward Blocks in Transformers through the Lens of Attention Map (ICLR'24 Spotlight) https://openreview.net/forum?id=mYWsyTuiRp