T T T T V(S t ) ← E π R t+1 + γV(S t+1 ) [ ] S t = X a ⇡(a|St) X s0,r p(s0, r|St, a)[r + V (s0)] r a s0 http://incompleteideas.net/609%20dropbox/slides%20(pdf%20and%20keynote)/11-12-TD.pdf
T T T T T T T T T T T V(S t ) ←V(S t )+α R t+1 + γV(S t+1 )−V(S t ) [ ] S t R t+1 S t+1 http://incompleteideas.net/609%20dropbox/slides%20(pdf%20and%20keynote)/11-12-TD.pdf