) - policy - policy ( fully observed ) st ot at - state - observation - action o1 s1 a1 o2 s2 a2 o3 s3 a3 1. Drawing a graphically model to relate state, observation, and action
2. Observing previous observations might give you more information p(st+1 ∣ st , at ) p(st+1 ∣ st , at )