(θx mt ) + ℒrobust (θmt ) + ℒlm (θy mt ) x = x1 , …, xi , …, xI P(y|x; θmt ) = ΠJ j=1 P(yj |z≤j , h; θmt ) y = y1 , …, yj , …, yJ e(x) = e(x1 ), …, e(xi ), …, e(xI ) AdvGen: Ұఆׂ߹ͷ୯ޠΛஔ
ɾ ʹج͖ͮޠኮީิݶఆ
ɾ༁ޡࠩʹج͖ͮஔޠኮܾఆ Q ∙ Qsrc (xi , x) = Plm (xi |x<i , x>i ; θx lm ) Qtrg (zi , z) = λPlm (zi |z<j , z>j ; θy lm ) + (1 − λ)P(zi |z<i , x′; θmt ) ∙′ i = arg max ∙∈∙ sim (e( ∙ ) − e( ∙i ), ∇e(∙i )(−log P(y| ∙ ; θmt ))) where ∙ = {x, z} Decoder z = z1 , …, zj , …, zJ e(z) = e(z1 ), …, e(zj ), …, e(zJ ) x′ i e(x′ i ) z′ j e(z′ j ) ℒclean (θmt ) = 1 |S| ∑ (x,y)∈S − log P(y|x; θmt ) ℒrobust (θmt ) = 1 |S| ∑ (x,y)∈S − log P(y|x′, z′; θmt ) /25