Knowledge Neurons in Pretrained Transformers (for SNLP2022)

Slide 1

Slide 1 text

Knowledge Neurons in Pretrained Transformers (ACL2022) Damai Dai, Li Dong, Yaru Hao, Zhifang Sui, Baobao Chang, Furu Wei 紹介者: ⼩林悟郎 (東北⼤乾研 D1) 2022/09/26-27 第14回最先端NLP勉強会セッション: 知識と⾔語モデル

Slide 6

Slide 6 text

先⾏研究: フィードフォワードネットを記憶装置 (key-value memory) とみなす [Geva+ʼ21] フィードフォワードネット (=2層MLP) は注意機構と似ている 2022/09/27 ୈ14ճ࠷ઌ୺NLPษڧձ Attention head Attention weights Key vectors Value vectors !! !" !# … weighted sum inner product … … … … … mer block works as a key-value memory. The first linear ner product. Taking the activation of these neurons as e vectors through weighted sum. We hypothesize that expressing factual knowledge. in Transformers, even without any fine-tuning. 2 Background: Transformer Transformer (Vaswani et al., 2017) is one of the most popular and effective NLP architectures. A Transformer encoder is stacked with L identical blocks. Each Transformer block mainly contains two modules: a self-attention module, and a feed- forward network (abbreviated as FFN) module. Let X 2 Rn⇥d denote the input matrix, two modules can be formulated as follows: Qh = XW Q h ,Kh = XW K h , Vh = XW V h , (1) Self-Atth(X) = softmax QhK T h Vh, (2) FFN(H) = gelu (HW1) W2, (3) ʢॏΈߦྻʣ ʢॏΈߦྻʣ ඇৗʹྨࣅ 2: Illustration of how an FFN module in a Transformer block works as a key-value memory. The first linear FFN(key) computes intermediate neurons through inner product. Taking the activation of these neurons as s, the second linear layer FFN(val) integrates value vectors through weighted sum. We hypothesize that edge neurons in the FFN module are responsible for expressing factual knowledge. fectiveness of the proposed knowledge at- on method. First, suppressing and ampli- knowledge neurons notably affects the ex- on of the corresponding knowledge. Second, d that knowledge neurons of a fact tend to ivated more by corresponding knowledge- ssing prompts. Third, given the knowledge ns of a fact, the top activating prompts re- d from open-domain texts usually express rresponding fact, while the bottom activating pts do not express the correct relation. our case studies, we try to leverage knowl- neurons to explicitly edit factual knowledge trained Transformers without any fine-tuning. esent two preliminary studies: updating facts, asing relations. After identifying the knowl- neurons, we perform a knowledge surgery in Transformers, even without any fine-tuning. 2 Background: Transformer Transformer (Vaswani et al., 2017) is one of the most popular and effective NLP architectures. A Transformer encoder is stacked with L identical blocks. Each Transformer block mainly contains two modules: a self-attention module, and a feed- forward network (abbreviated as FFN) module. Let X 2 Rn⇥d denote the input matrix, two modules can be formulated as follows: Qh = XW Q h ,Kh = XW K h , Vh = XW V h , (1) Self-Atth(X) = softmax QhK T h Vh, (2) FFN(H) = gelu (HW1) W2, (3) where W Q h , W K h , W V h , W1, W2 are parameter ma-

Slide 7

Slide 7 text

先⾏研究: フィードフォワードネットを記憶装置 (key-value memory) とみなす [Geva+ʼ21] フィードフォワードネット (=2層MLP) は注意機構と似ている 2022/09/27 ୈ14ճ࠷ઌ୺NLPษڧձ ඇৗʹྨࣅ mer block works as a key-value memory. The first linear ner product. Taking the activation of these neurons as e vectors through weighted sum. We hypothesize that expressing factual knowledge. in Transformers, even without any fine-tuning. 2 Background: Transformer Transformer (Vaswani et al., 2017) is one of the most popular and effective NLP architectures. A Transformer encoder is stacked with L identical blocks. Each Transformer block mainly contains two modules: a self-attention module, and a feed- forward network (abbreviated as FFN) module. Let X 2 Rn⇥d denote the input matrix, two modules can be formulated as follows: Qh = XW Q h ,Kh = XW K h , Vh = XW V h , (1) Self-Atth(X) = softmax QhK T h Vh, (2) FFN(H) = gelu (HW1) W2, (3) 2: Illustration of how an FFN module in a Transformer block works as a key-value memory. The first linear FFN(key) computes intermediate neurons through inner product. Taking the activation of these neurons as s, the second linear layer FFN(val) integrates value vectors through weighted sum. We hypothesize that edge neurons in the FFN module are responsible for expressing factual knowledge. fectiveness of the proposed knowledge at- on method. First, suppressing and ampli- knowledge neurons notably affects the ex- on of the corresponding knowledge. Second, d that knowledge neurons of a fact tend to ivated more by corresponding knowledge- ssing prompts. Third, given the knowledge ns of a fact, the top activating prompts re- d from open-domain texts usually express rresponding fact, while the bottom activating pts do not express the correct relation. our case studies, we try to leverage knowl- neurons to explicitly edit factual knowledge trained Transformers without any fine-tuning. esent two preliminary studies: updating facts, asing relations. After identifying the knowl- neurons, we perform a knowledge surgery in Transformers, even without any fine-tuning. 2 Background: Transformer Transformer (Vaswani et al., 2017) is one of the most popular and effective NLP architectures. A Transformer encoder is stacked with L identical blocks. Each Transformer block mainly contains two modules: a self-attention module, and a feed- forward network (abbreviated as FFN) module. Let X 2 Rn⇥d denote the input matrix, two modules can be formulated as follows: Qh = XW Q h ,Kh = XW K h , Vh = XW V h , (1) Self-Atth(X) = softmax QhK T h Vh, (2) FFN(H) = gelu (HW1) W2, (3) where W Q h , W K h , W V h , W1, W2 are parameter ma- Attention head Attention weights Key vectors Value vectors !! !" !# … weighted sum inner product … … … … … ʢॏΈߦྻʣ ʢॏΈߦྻʣ 1. 入力をqueryとして，各keyとの内積で重みを計算 2. この重みをかけながら各valueを総和 (重み付け和)

Slide 18

Slide 18 text

知識ニューロンの性質提案法で識別した知識ニューロンは • 後半層に多く分布 • 🤔 Transformer の層の深さによる勾配消失 [Takase+ʼ22]︖ • より排他的 2022/09/27 ୈ14ճ࠷ઌ୺NLPษڧձ P463 (member_of) [X] is a member of [Y] [X] belongs to the organization of [Y] [X] is affiliated with [Y] P407 (language_of_work) [X] was written in [Y] The language of [X] is [Y] [X] was a [Y]-language work able 1: Example prompt templates of three relations in PARAREL. [X] and [Y] are the placeholders for the head nd tail entities, respectively. Owing to the page width, we show only three templates for each relation. Prompt mplates in PARAREL produce 253,448 knowledge-expressing prompts in total for 27,738 relational facts. Experiments .1 Experimental Settings We conduct experiments for BERT-base-cased (De- lin et al., 2019), one of the most widely-used pre- ained models. It contains 12 Transformer blocks, here the hidden size is 768 and the FFN inner idden size is 3,072. Notice that our method is ot limited to BERT and can be easily general- zed to other pretrained models. For each prompt, e set the attribution threshold t to 0.2 times the maximum attribution score. For each relation, we nitialize the refining threshold p% (Section 3.3) s 0.7. Then, we increase or decrease it by 0.05 t a time until the average number of knowledge eurons lies in [2, 5]. We run our experiments on NVIDIA Tesla V100 GPUs. On average, it costs 3.3 seconds to identify knowledge neurons for a Figure 3: Percentage of knowledge neurons identified by our method in each Transformer layer. Type of Neurons Ours Baseline Knowledge neurons 4.13 3.96 T of intra-rel. fact pairs 1.23 2.85 T of inter-rel. fact pairs 0.09 1.92 Table 2: Statistics of knowledge neurons. T denotes the intersection of knowledge neurons of fact pairs. “rel.” is the shorthand of relation. Our method iden- tifies more exclusive knowledge neurons. 共通の relation を持つ関係知識同⼠で共有されうる • と異なる relation を持つ関係知識同⼠では共有されない • と

Slide 1

Slide 1 text

Slide 2

Slide 2 text

Slide 3

Slide 3 text

Slide 4

Slide 4 text

Slide 5

Slide 5 text

Slide 6

Slide 6 text

Slide 7

Slide 7 text

Slide 8

Slide 8 text

Slide 9

Slide 9 text

Slide 10

Slide 10 text

Slide 11

Slide 11 text

Slide 12

Slide 12 text

Slide 13

Slide 13 text

Slide 14

Slide 14 text

Slide 15

Slide 15 text

Slide 16

Slide 16 text

Slide 17

Slide 17 text

Slide 18

Slide 18 text

Slide 19

Slide 19 text

Slide 20

Slide 20 text

Slide 21

Slide 21 text

Slide 22

Slide 22 text

Slide 23

Slide 23 text

Slide 24

Slide 24 text

Slide 25

Slide 25 text