Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Metalearning shared Hierarchy

Metalearning shared Hierarchy

Metalearning shared Hierarchy

논문 review

Wonseok Jung

August 28, 2018
Tweet

More Decks by Wonseok Jung

Other Decks in Science

Transcript

  1. 1.2 SOLVE EACH TASK INDEPENDENTLY AND FROM SCRATCH SUPERMARIO WITH

    R.L https://www.youtube.com/watch?v=IjvbhwuCaF0
  2. 2.1 NOTATION Time step Action Transition Function Reward Set of

    states Set of actions Start state Discount factor t a P(s′, r ∣ s, a) r A S S0 γ Set of reward
 
 Policy Reward State R π r REINFORCEMENT LEARNING s
  3. 2.2 NOTATION META LEARNING SHARED HIERARCHIES EJTUSJCVUJPOPWFS.%1T "HFOUחQBSBNFUFSWFDUPSܳ઱ӝ੸ਵ۽VQEBUFೠ׮  పझ௼ٜՙܻҕਬೞח౵ۄ޷ఠ੄૘೤

      пపझ௼౵ۄ޷ఠ੄૘೤ 
 BHFOUоഅ੤పझ௼.ਸߓ਋ݴসؘ੉౟ೞח౵ۄ޷ఠ  PM πθ,ϕ(a∣s) ϕ θ
  4. "DUJPO "HFOU &OWJSPONFOU 3FXBSE At Rt 4UBUF St Rt+1 St+1

    REINFORCEMENT LEARNING 2.3 OBJECTIVE MDP
  5. REINFORCEMENT LEARNING 2.4 NEW MDP &OWJSPONFOU 3FXBSE At Rt St

    Rt+1 St+1 5BQUIFCBMM 1PTJUJWF3FXBSE
 New MDP
  6. SUPERMARIO WITH R.L 2.5 NEW MDP-2 "DUJPO "HFOU &OWJSPONFOU 3FXBSE

    At Rt 4UBUF St Rt+1 St+1 3FXBSE  1FOBMUZ Another New MDP