plant” Task: Multimodal Language Understanding for Fetching Instruction - 3 - “Grasp the glass “in the sink” Transfer Simulation Real-world Binary classification for each object Pos. Neg. Neg. Neg. Neg. Pos. Neg. Neg. Neg.
l Introduce domain transfer to multimodal language understanding l Extend prototypical contrastive loss for classification problems in two domains - 4 - PCL [Li+, ICLR’21] Related work: MCDDA [Saito+, CVPR’18]
l Introduce domain transfer to multimodal language understanding l Extend prototypical contrastive loss for classification problems in two domains - 5 - PCL [Li+, ICLR’21] Related work: MCDDA [Saito+, CVPR’18] Domain transfer for single modality (vision) task
l Introduce domain transfer to multimodal language understanding l Extend prototypical contrastive loss for classification problems in two domains - 6 - PCL [Li+, ICLR’21] Related work: MCDDA [Saito+, CVPR’18] Performs domain transfer based on contrastive learning Inspired by PCL
down the stairs to the “lower balcony area and turn off “the lamp on the dresser.” From REVERIE [Qi+, CVPR’20] #sample: 10342 From ALFRED [Shridhar+, CVPR’20] #sample: 34286 Real-world Transfer “Pick up the “tissue box on the desk“ Simulation
Train Test Acc. [%]ˢ Target Domain Only Real Real 73.0±1.87 MCDDA+ [Saito, CVPR’18] Sim Real Real 74.9±3.94 PCTL (Ours) Sim+Real Real 78.1±2.49 Improved by domain transfer +5.1
Acc. [%]ˢ Target Domain Only Real Real 73.0±1.87 MCDDA+ [Saito, CVPR’18] Sim Real Real 74.9±3.94 PCTL (Ours) Sim+Real Real 78.1±2.49 +3.2 Outperformed existing method
collection by domain transfer Novelty: l Introduce domain transfer to multimodal language understanding l Extend prototypical contrastive loss for classification problems in two domains Result: Outperformed target-domain only condition & existing domain transfer method - 11 -