5BTLT"SJUINFUJD<(*MIBSDP *$-3`> த෦େֶϩΰ த෦େֶϩΰ τ = θft − θpre ϑΝΠϯνϡʔχϯάޙͷॏΈ ࣄલֶशࡁΈϞσϧͷॏΈ Published as a conference paper at ICLR 2023 τ τnew = − τ Published as a conference paper at ICLR 2023 τA τB τnew = τA + τB Published as a conference paper at ICLR 2023 τB τA τnew = τC + (τA − τB ) τC ̍ɽࣝʢλεΫʣͷ٫ ̎ɽࣝʢλεΫʣͷՃࢉ ̏ɽλεΫΞφϩδʔ
த෦େֶϩΰ Published as a conference paper at ICLR 2023 Figure 1: An illustration of task vectors and the arithmetic operations we study for editi (a) A task vector is obtained by subtracting the weights of a pre-trained model from the the same model after fine-tuning (Section 2). (b) Negating a task vector degrades perfo τ τnew = − τ ٫ ඞཁɾෆඞཁͷ྆ํͷࣝΛอ࣋ θ θnew = θ + λτnew ඞཁͳࣝͷΈอ࣋ Ϟσϧ 5BTLT"SJUINFUJD<(*MIBSDP *$-3`>
1: An illustration of task vectors and the arithmetic operations we study for (a) A task vector is obtained by subtracting the weights of a pre-trained model from the same model after fine-tuning (Section 2). (b) Negating a task vector degrades p the task, without substantial changes in control tasks (Section 3). (c) Adding task v improves the performance of the pre-trained model on the tasks under considerati (d) When tasks form an analogy relationship such as supervised and unsupervised l different data sources, it is possible to improve performance on a supervised target vectors from the remaining three combinations of objectives and datasets (Section 5) τA τB τnew = τA + τB ֶʹରԠ θmath ຊޠʹରԠ θjpn ӳޠͰֶशͨ͠--. θ ֶɾຊޠͷ྆ରԠ θnew = θ + λτnew ֶɾຊޠͷ྆ରԠ τmath τjpn λεΫϕΫτϧͷՃࢉ τnew = τmath + τjpn ϕΫτϧͷՃࢉʹΑΔ૯ϊϧϜͷେ͖͞Λௐ͢Δ 5BTLT"SJUINFUJD<(*MIBSDP *$-3`>
<44JOHI /FVS*14`>.PEFM'VTJPOWJB0QUJNBM5SBOTQPSU w <('BOH $713`>%FQHSBQI5PXBSET"OZ4USVDUVSBM1SVOJOH w <&+)V *$-3`>-P3"-PX3BOL"EBQUBUJPOPG-BSHF-BOHVBHF.PEFMT w <$)VBOH $0-.`>-PSB)VC& ff i DJFOU$SPTT5BTL(FOFSBMJ[BUJPOWJB%ZOBNJD-P3"$PNQPTJUJPO ࢀߟจݙ த෦େֶϩΰ த෦େֶϩΰ