Slide 14
Slide 14 text
関連研究 (3)
• CNN + Self-attention
• augmenting feature maps for image classification [Bello+, 2019]
• further processing the output of a CNN using self-attention
• for object detection [Hu et al., 2018; Carion et al., 2020]
• video processing [Wang et al., 2018; Sun et al., 2019]
• image classification [Wu et al., 2020]
• unsupervised object discovery [Locatello et al., 2020]
• unified text-vision tasks [Chen et al., 2020c; Lu et al., 2019; Li et al., 2019].
2021/4/18 14