Slide 13
Slide 13 text
• ैདྷݚڀɺͷධՁͷਝͳߋ৽ʹযΛ͍ͯͯͨ
• ݮਰɺΟϯυɺมԽݕग़ɺঢ়ଶۭؒ
• ͜ΕΒɺมԽޙͷใु͔ΒͷҰఆͷใुαϯϓϧ͕ඞཁ
• ͋Δ࣌ظʹ༗ޮੑͷ͔ͬͨɺͦͦબఆ͞Εͳ͍ͨΊɺධՁͷߋ৽͕
͍͠ɻ
• ͜ͷ՝औΓΜͩઌߦݚڀ[1][2]
ͰɺҰఆͷׂ߹Ͱ୳ࡧ༻ͷࢼߦػձΛ֬
อ͍ͯ͠Δɻ
13
ඇఆৗͳଟόϯσΟοτʹ͓͚ΔมԽͷ
• [1] Fang Liu, Joohyun Lee, and Ness Shroff. 2018. A change-detection based framework for piecewise-stationary multi-armed bandit problem. In Proceedings of the AAAI Conference on Artificial
Intelligence, Vol. 32.
• [2] Yang Cao, Zheng Wen, Branislav Kveton, and Yao Xie. 2019. Nearly optimal adaptive procedure with change detection for piecewise-stationary bandit. In The 22nd International Conference on
Artificial Intelligence and Statistics. PMLR, 418–427.