Upgrade to Pro — share decks privately, control downloads, hide ads and more …

TensorFlow & DeepMind Lab & UNREAL

TensorFlow & DeepMind Lab & UNREAL

TensorFlowで実装したUNREALアルゴリズムでDeepMind Labの3D迷路を解く

Kosuke Miyoshi

April 20, 2017
Tweet

More Decks by Kosuke Miyoshi

Other Decks in Technology

Transcript

  1. 1PMJDZ К 7ͷޯ഑ R= = = w 7͸3ʹ͚ۙͮΔ༷ʹߋ৽ w 37͕ਖ਼ͳΒɺऔͬͨBDUJPO͕ग़Δ֬཰Λ૿΍༷͢ʹߋ৽


    37͕ෛͳΒɺऔͬͨBDUJPO͕ग़Δ֬཰ΛݮΒ༷͢ʹߋ৽ 
 V network: Policy network: ˞্هͷදهͰ7͸(SBEJFOU%FTDFOU 1PMJDZ͸(SBEJFOU"TDFOUθv = θv - α * dθv, θ = θ + α * dθ 1PMJDZ 7