Slide 152
Slide 152 text
ݚڀ֓ཁ: ଟ༷͔ͭܧଓతʹมԽ͢ΔڥʹదԠ͢ΔใγεςϜ
എܠͱత ՝
Ռ
[1] ࡾ ༔հ, ็ ߃ݑ, Synapse: จ຺ʹԠͯ͡ܧଓతʹਪનख๏ͷબΛ࠷దԽ͢Δਪન
γεςϜ, ిࢠใ௨৴ֶձจࢽD, Vol.J103-D, No.11, pp.764-775, Nov 2020.
[2] ࡾ ༔հ, ็ ߃ݑ, Synapse: จ຺ͱ࣌ؒܦաʹԠͯ͡ਪનख๏ͷબΛ࠷దԽ͢Δϝ
λਪનγεςϜ, ిࢠใ௨৴ֶձจࢽD, Vol.J105-D, No.11, pp.641-652, Nov. 2022.
[3] Yusuke Miyake, Tsunenori Mine, Contextual and Nonstationary Multi-armed Bandits
Using the Linear Gaussian State Space Model for the Meta-Recommender System, 2023
IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp.3138-3145,
Oct 2023.
[4] Yusuke Miyake, Ryuji Watanabe, Tsunenori Mine, Online Nonstationary and Nonlinear
Bandits with Recursive Weighted Gaussian Process, The 48th IEEE International
Conference on Computers, Software, and Applications (COMPSAC 2024) (to appear)
1. ܧଓతʹબΛ࠷దԽ͢ΔใγεςϜͷઃܭ [1]
՝1ʹ͍ͭͯɺਪનγεςϜΛࡐʹɺҎԼͰఏҊ͢ΔػցֶशϞσϧͷಛੑΛߟྀͨ͠ଟόϯσΟοτͷํࡦʹΑͬͯɺࣗಈత͔ͭܧଓతʹબΛ
࠷దԽ͢ΔదԠܕใγεςϜج൫ΛઃܭɾධՁͨ͠ɻ
2. ଟ༷͔ͭܧଓతʹมԽ͢ΔڥͷదԠ [2]
՝2ʹ͍ͭͯɺཻࢠϑΟϧλΛ༻͍ͨจ຺͖͔ͭඇఆৗͳଟόϯσΟοτํࡦΛఏҊ͠ɺଟ༷͔ͭܧଓతʹมԽ͢ΔڥͷదԠྗ্Λ࣮ݱͨ͠ɻ
3. దԠͷߴԽ [3]
՝2ɾ3ʹ͍ͭͯɺઢܗΧϧϚϯϑΟϧλΛ༻͍ͨจ຺͖͔ͭඇఆৗͳଟόϯσΟοτํࡦͷఏҊ͠ɺదԠͷߴԽΛ࣮ݱͨ͠ɻ
4. ඇઢܗੑͷରԠ [4]
՝2ɾ3ɾ4ʹ͍ͭͯɺॏΈ͖ஞ࣍ΨεաఔճؼΛ༻͍ͨඇఆৗ͔ͭඇઢܗͳจ຺͖ଟόϯσΟοτํࡦΛఏҊ͠ɺඇઢܗͳઃఆʹ͓͚Δඇఆৗ
ੑͷରԠͱॲཧͷ্Λ࣮ݱͨ͠ɻ
1. ࣮ڥͰͷධՁʹΑΔػձଛࣦ
ैདྷͷଟόϯσΟοτͷํࡦΛ༻͍ͯϞσϧબͷ࠷దԽΛਤΔใγεςϜͰɺػցֶश
ϞσϧͷಛੑΛߟྀͰ͖ͣɺػձଛࣦΛेʹ͑ΒΕͳ͍ɻ
2. จ຺࣌ؒͷܦաʹΑΔ༗༻ੑͷมಈ
ػցֶशϞσϧͷ༗༻ੑɺར༻ऀγεςϜͷঢ়گʹΑͬͯɺ·ͨಉ͡ঢ়گͰ͋ͬͯ࣌ؒͷܦա
ʹΑͬͯมಈ͢Δɻ
3. దԠʹ͏͕࣌ؒٴ΅͢༗༻ੑͷӨڹ
จ຺࣌ؒͷܦաΛߟྀͨ͠ܧଓతͳൺֱධՁͷΈͷಋೖɺԠʹΕΛҾ͖ى͜͢ɻ
4. ༗༻ੑͷਪఆʹ͓͚Δෳࡶͳؔੑ
จ຺ͱػցֶशϞσϧͷ༗༻ੑͷؒʹɺඇઢܗͳؔੑ͋ΓಘΔɻ
1. ใγεςϜͱڥมԽ
ଟ༷͔ͭܧଓతʹมԽ͢ΔڥͷதͰɺใγεςϜ͕ܧଓతʹػೳ͢ΔʹɺैདྷͷਓखʹΑΔӡ༻
Ͱͳ͘ɺࣗಈԽ͞ΕͨదԠػߏͷ࣮ݱ͕՝ʹͳΔɻ
2. ڥมԽʹࣗΒదԠ͢ΔใγεςϜ
దԠܕใγεςϜͷ࣮ݱʹɺσʔλ͔Βಈతʹಈ࡞Λઃܭ͢ΔػցֶशϞσϧͱͷ౷߹͕ෆՄܽͰ
͋Δ͕ɺͲͷػցֶशϞσϧ͕ਅʹޮՌతͰ͋Δ͔Λ༧ΊΔ͜ͱ͍͠ɻ
3. બͷ࠷దԽ
࣮ڥͰͷධՁʹΑΔػցֶशϞσϧͷબͰɺظతͳධՁʹΑΔػձଛࣦ࠷దͳϞσϧΛݟಀ
͢ϦεΫ͕͏ͨΊɺ͜ͷબաఔΛ࠷దԽ͠ػձଛࣦΛܰݮ͢ΔΈ͕ٻΊΒΕΔɻ
બͷ࠷దԽͷ՝
దԠܕใγεςϜͷ࣮ݱʹ͚ͨબ
ͷ࠷దԽ
બͷ࠷దԽͷ՝ͷղܾ
IUUQTJDPOTDPN