Slide 5
Slide 5 text
ൃදऀͷཱɾࢹ
• CV ษڧձͰɺҎԼͷจΛհ͍͖ͤͯͨͩ͞·ͨ͠
• “A Hierarchical Approach for Generating Descriptive Image Paragraphs” [Krause et al., CVPR
2017]
• “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale” [Dosovitskiy
et al., ICLR 2021]
• “Transitional Adaptation of Pretrained Models for Visual Storytelling” [Yu et al., CVPR 2021]
• “Panoptic Narrative Grounding” [González et al., ICCV 2021]
• “GeoDiff: A Geometric Diffusion Model for Molecular Conformation Generation” [Xu et al.,
ICLR 2022]
• “It is Okay to Not be Okay: Overcoming Emotional Bias in Affective Image Captioning by
Contrastive Data Collection” [Mohamed et al., CVPR 2022]
• “Ego-Body Pose Estimation via Ego-Head Pose Estimation” [Li et al., CVPR 2023]
• ͜ͷϖʔδͰݴ͍͍ͨ͜ͱɿ
ʢࣗͷͷ͍͕ɺʣCV ษڧձͰ༷ʑͳจ͕հ͞ΕɺϩάࢀߟʹͳΔʂ
2023/07/23 ୈճ$7ษڧձ!ؔ౦ 5