Slide 1

Slide 1 text

The “Like-Me” Hypothesis Self-experience allows infants to interpret the act of other Self-experience plays an important role in goal inference and imitation Meltzoff, Dev Sci, 2007
 Meltzoff, Acta Psychologica, 2007 Computational Model: Probabilistic instantiation of “Like-me” hypothesis

Slide 2

Slide 2 text

Computational Model : goal : action : start state : final state Friesen & Rao, Cog Sci, 2011

Slide 3

Slide 3 text

Computational Model: Gaze Following : desired fixation location : head motor commands : start head pose : final head pose Friesen & Rao, Cog Sci, 2011

Slide 4

Slide 4 text

Computational Model: Gaze Following : desired fixation location : head motor commands : start head pose : final head pose Random Variables are Continuous Relations are Probabilistic Functions

Slide 5

Slide 5 text

Probabilistic Functions: Gaussian Processes (GP) ‣ nonparametric ‣ probabilistic distribution over functions Effective with small numbers of training examples

Slide 6

Slide 6 text

Modeling Relations Transition model: Learned from “Exploration” or “Body Babbling”

Slide 7

Slide 7 text

Modeling Relations Policy model: Learned from “Rejection sampling” “Planning via Probabilistic Inference” (Verma & Rao, NIPS, 2005)

Slide 8

Slide 8 text

Goal-directed Planning G A B Xi Xf Given goal and start state, infer action. Closed form solutions.

Slide 9

Slide 9 text

Goal Inference G A B Xi Xf Given start state and final state, infer goal. Approximate via Bayesian Monte Carlo.

Slide 10

Slide 10 text

Gaze Following Gm Am Bm Xm i Xm f A G B Xi X f change of coords. Given: mentor start state, mentor final state, start state
 Infer: mentor goal, action OTHER SELF Infer goal

Slide 11

Slide 11 text

Model Simulation 0 200 400 600 −600 −400 −200 0 200 400 x position (cm) y position (cm) Agent Mentor true fixation points inferred fixation points 500 -500 500 100

Slide 12

Slide 12 text

Model Performance 0 200 400 600 −600 −400 −200 0 200 400 x position (cm) y position (cm) Agent Mentor true fixation points inferred fixation points

Slide 13

Slide 13 text

Blindfold Self-Experience Task Gm Am Bm Xm i Xm f A G B Xi X f change of coords.

Slide 14

Slide 14 text

Blindfold Self-Experience Task Baseline Window Opaque 0 1 2 3 4 Looking Score (a) Computational Model (b) Infant Data AGENT (simulation results) INFANT (actual behavior data) Friesen & Rao, Cog Sci, 2011

Slide 15

Slide 15 text

Computational Model: Robotic Tabletop Manipulation : desired tabletop position : control commands : start object location : final object location Random Variables are Discrete Relations are Probability Tables

Slide 16

Slide 16 text

Robotic Tabletop Manipulation Task Chung et al, Tech report, 2014

Slide 17

Slide 17 text

Learning: Transition Model Transition model:

Slide 18

Slide 18 text

Learning: Policy Model Policy model:

Slide 19

Slide 19 text

Goal-Based Imitation and Action Selection Step1: Goal-Inference Step II: Action-Inference Step III:
 Final State Prediction Gm Am Bm Xm i Xm f A G B Xi X f change of coords.

Slide 20

Slide 20 text

Learned Transition Model Place Right Place Off-table Push Right Push Off-table Place Right Place Off-table Push Right Push Off-table

Slide 21

Slide 21 text

Learned Transition Model Place Right Place Off-table Push Right Push Off-table Place Right Place Off-table Push Right Push Off-table

Slide 22

Slide 22 text

Learned Transition Model Place Right Place Off-table Push Right Push Off-table Place Right Place Off-table Push Right Push Off-table

Slide 23

Slide 23 text

Goal Inference Goal ✴MAP Prediction

Slide 24

Slide 24 text

Action Inference Place Left Place Right Place Off Push Left Push Right Push Off Place Left Place Right Place Off Push Left Push Right Push Off Goal ✴MAP Prediction Demonstration: Xi = LEFT, Xf = RIGHT

Slide 25

Slide 25 text

Final State Prediction

Slide 26

Slide 26 text

Imitation Results

Slide 27

Slide 27 text

A Bayesian Developmental Approach to Goal-Based Imitation Learning Michael Jae-Yoon Chung, Abram L. Friesen,
 Dieter Fox, Andrew N. Meltzoff, and Rajesh P.N. Rao University of Washington HRI2015 Workshop Cognition: A Bridge between robotics and interaction

Slide 28

Slide 28 text

Imitation Learning in Humans Not mere trajectory following. Imitation based on goal inference. Demonstration Goal-based imitation Infants above 1.5 years of age can imitate action even from an unsuccessful demonstration (Meltzoff & Brook 1998)

Slide 29

Slide 29 text

Another Example: Gaze Following and Blindfolds 12 month olds Meltzoff & Brooks, Dev Psych Blind fold experience no training after training

Slide 30

Slide 30 text

The “Like-Me” Hypothesis Self-experience allows infants to interpret the act of others. Self-experience plays an important role in goal inference and imitation. Computational Model: Probabilistic instantiation of “Like-me” hypothesis. Meltzoff, Dev Sci, 2007 Metlzoff, Acta Psychologica, 2007

Slide 31

Slide 31 text

Model Performance 0 10 20 30 40 0 0.05 0.1 0.15 0.2 gaze error (degrees) p(error) −40 −20 0 0 0.05 0.1 0.15 0.2 gaze error (degrees) p(error) −40 −20 0 0 0.05 0.1 0.15 0.2 gaze error (degrees) p(error) (a) Forward Inference (b) Reverse Inference (c) Gaze Following