ç¹°ãè¿ãïŒrepetitiveïŒ â« éŠå°Ÿäžè²«ããªãïŒincoherentïŒ ãããã«å¯ŸåŠããããã以äžã®æ¹åãæœãã ⪠Intra-attention ⪠New objective function
ç¹°ãè¿ãïŒrepetitiveïŒ â« éŠå°Ÿäžè²«ããªãïŒincoherentïŒ ãããã«å¯ŸåŠããããã以äžã®æ¹åãæœãã ⪠Intra-attention ⪠New objective function
ç¹°ãè¿ãïŒrepetitiveïŒ â« éŠå°Ÿäžè²«ããªãïŒincoherentïŒ ãããã«å¯ŸåŠããããã以äžã®æ¹åãæœãã ⪠Intra-attention ⪠New objective function
+ Policy Learning ⪠Teacher Forcing loss L ml ⪠Policy Learning loss L rl ⪠Mixed loss L mixed åºå y ãšæ£è§£ y* ãšã® maximum-likelihood policy learning ã® loss self-critical sequence training ãšããææ³ã䜿çš
+ Policy Learning ⪠Teacher Forcing loss L ml ⪠Policy Learning loss L rl ⪠Mixed loss L mixed åºå y ãšæ£è§£ y* ãšã® maximum-likelihood policy learning ã® loss
â« éŠå°Ÿäžè²«ããªãïŒincoherentïŒ ãããã«å¯ŸåŠããããã以äžã®æ¹åãæœãã ⪠Intra-attention ⪠New objective function Introduction