Slide 1

Slide 1 text

+FUTPO/BOPͰಈ͘ਂ૚ڧԽֶशΛ࢖ͬͨ ϥδίϯ޲͚ࣗಈӡసιϑτ΢ΣΞͷ঺հ *P5"-(:"/प೥*P5ࡇ !NBTBUP@LB

Slide 2

Slide 2 text

͋͞ʂ"*3$$BSΛ࢝ΊΑ͏ ࢢൢϥδίϯ4#$ "*ͱΧϝϥͷΈ -&7&-૬౰ͷ૸ߦ ੈքதͰϨʔε͕։࠵͞Ε͍ͯΔ ৄ͘͠͸IUUQNBTBUPLBIBUFOBCMPHDPN

Slide 3

Slide 3 text

"*3$$BS૸ߦͷ࢓૊Έ 4FMG%SJWJOH $JSDVJU ……….. ڭࣔσʔλͷऩू "*ͷֶश ਪ࿦૸ߦ "*Λ࢖͍ೝ஌͔Β൑அ·Ͱ&OEUP&OEͰ૸ߦ͢Δ ৄ͘͠͸IUUQNBTBUPLBIBUFOBCMPHDPN

Slide 4

Slide 4 text

"*3$$BSϨʔεͰউͭͨΊʹʂ ϋʔυ΢ΣΞ "*$7੍ޚ ύΠϩοτదੑ ڭࣔσʔλ͕উഊΛେ͖͘࡞༻͢Δ ৄ͘͠͸IUUQNBTBUPLBIBUFOBCMPHDPN

Slide 5

Slide 5 text

͓खຊͳ͠ʹ૸ΓํΛֶͿਂ૚ڧԽֶश ࢼߦͱใु͔Βྑ͍૸Γͷಛ௃ΛֶͿ 4FMG%SJWJOH $JSDVJU ΞΫγϣϯ Environment AI͕σʔλΛ ूΊΔɻ ※ใुͷ૯࿨͕࠷େͱͳΔΑ͏ͳಈ͖ํΛ୳ࡧ͍ͯ͠Δɻ ใु,ঢ়ଶ ৄ͘͠͸IUUQNBTBUPLBIBUFOBCMPHDPN Jetson NanoͰ10෼ʙ20෼Ͱ૸ߦํ๏Λ֫ಘ

Slide 6

Slide 6 text

࣮૷ͨ͠ιϑτ΢ΣΞͷಛ௃ JETSON NANO SAC VAE Haarnoja, T., Zhou, A., Abbeel, P., and Levine, S. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. arXiv preprint arXiv:1801.01290, 2018. ΤοδͰਂ૚ڧԽֶशΛ͢ΔͨΊͷ޻෉ Antonin Raffin, Learning to Drive Smoothly in Minutes Reinforcement Learning on a Small Racing Car , 2019. ৄ͘͠͸IUUQNBTBUPLBIBUFOBCMPHDPN

Slide 7

Slide 7 text

ιϑτͷެ։ͱ൓ڹ (JUIVCͷ3&"%.&͸ӳޠ (PPHMF຋༁ Ͱެ։ ৄ͘͠͸IUUQNBTBUPLBIBUFOBCMPHDPN /7*%*"͞Μͷ+FUTPO$PNNVOJUZ1SPKFDUT IUUQTEFWFMPQFSOWJEJBDPNFNCFEEFEDPNNVOJUZKFUTPOQSPKFDUTSFJOGPSDFNFOU@KFUCPU

Slide 8

Slide 8 text

໨ࢦͤόʔνϟϧϨʔε ੈքதͷϢʔβ͕ࢀՃͯ͠ΦϯϥΠϯͰରઓ γϛϡϨʔλΛ࢖ͬͨϨʔε ΦϯϥΠϯͰϦΞϧλΠϜରઓ ੈքνϟϯϐΦϯ͸೔ຊਓ ৄ͘͠͸IUUQNBTBUPLBIBUFOBCMPHDPN

Slide 9

Slide 9 text

·ͱΊ "*3$$BSΛ࢝ΊͯΈ·ͤΜ͔ʁ "*Ͱܭଌ੍ޚΛֶͿ"*3$$BS ΤοδσόΠεͰਂ૚ڧԽֶश ੈքͷίϛϡχςΟͱܨ͕Ζ͏ ৄ͘͠͸IUUQNBTBUPLBIBUFOBCMPHDPN

Slide 10

Slide 10 text

No content