Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Autonomous Agent Path Finding
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
hankeceli
January 19, 2021
0
23
Autonomous Agent PathΒ Finding
hankeceli
January 19, 2021
Tweet
Share
More Decks by hankeceli
See All by hankeceli
Introduction to Data Structures
hankeceli
0
95
Featured
See All Featured
Tips & Tricks on How to Get Your First Job In Tech
honzajavorek
0
450
<Decoding/> the Language of Devs - We Love SEO 2024
nikkihalliwell
1
150
Abbi's Birthday
coloredviolet
2
5.3k
Introduction to Domain-Driven Design and Collaborative software design
baasie
1
640
Kristin Tynski - Automating Marketing Tasks With AI
techseoconnect
PRO
0
190
Designing Powerful Visuals for Engaging Learning
tmiket
0
270
Why Your Marketing Sucks and What You Can Do About It - Sophie Logan
marketingsoph
0
110
Ten Tips & Tricks for a π± transition
stuffmc
0
87
Fantastic passwords and where to find them - at NoRuKo
philnash
52
3.6k
Optimizing for Happiness
mojombo
378
71k
Mozcon NYC 2025: Stop Losing SEO Traffic
samtorres
0
180
Darren the Foodie - Storyboard
khoart
PRO
3
2.8k
Transcript
Autonomous Agent Path Finding
Q-learning β’ β’ β’
Deep Q-learning
Building Steps β° β° β° β° β° β°
Modeling the environment. β’ β’ β’ β’
Goal β’ β’ β’ β’
Goal(cont.)
Setting parameters angle: rotation: pos=(self.car.x, self.car.y): velocity=(velocity_x, velocity_y): sensor1:(sensor1_x, sensor1_y):
sensor2:(sensor2_x, sensor2_y): sensor3:(sensor3_x, sensor3_y): signal1: signal2: signal3:
β’ β’ sand β’ sand Setting parameters(cont.) Figure 2: Sand
array get only 1βs in its first rows, remaining element get 0
Setting parameters(cont.) goal_x: goal_y: xx=(goal_x-self.car.x): yy=(goal_y-self.car.y): orientation:
β’ β’ orientation The input states
β’ β’ sensors β’ β’ The input states(cont.)
β’ β’ β’ β’ The output actions
β’ β’ The output actions(cont.)
β’ β’ β’ β’ The rewards
β’ β’ β’ The rewards(cont.)
REFERENCES β° β°
THANKS! Any questions? β° β°