winPIBT: Extended Prioritized Algorithm for Iterative Multi-agent Path Finding

Slide 1

Slide 1 text

winPIBT Extended Prioritized Algorithm for Iterative Multi-Agent Path Finding Keisuke Okumura, Yasumasa Tamura & Xavier Defago 7th Jan. 2021 IJCAI-20 Workshop on MAPF Tokyo Institute of Technology, Japan ౦ژ޻ۀେֶ 5PLZP*OTUJUVUFPG5FDIOPMPHZ virtual conference

Slide 2

Slide 2 text

/28 2 MAPF: Multi-Agent Path Finding given agents (starts) graph goals solution paths without collisions

Slide 3

Slide 3 text

/28 3 Motivation winPIBT: extension of PIBT [okumura+ IJCAI-19] high low result of PIBT ideal anticipating a single-step ahead make PIBT anticipating multi-steps ahead time-window

Slide 4

Slide 4 text

/28 4 PIBT Priority Inheritance with Backtracking [Okumura+ IJCAI-19] solving MAPF iteratively quick & scalable adaptivity for decentralization 500 agents within 0.5 sec Applicable to Multi-agent Pickup & Delivery [Ma+ AAMAS-17] sub-optimal

Slide 5

Slide 5 text

/28 5 locations at t=1 t=2 t=3 repeat one-timestep prioritized planning high low mid How PIBT works – 1/6 … 1 2 3 4 5 6 7 8 9 decision order time-window

Slide 6

Slide 6 text

/28 6 How PIBT works – 2/6 high low mid stuck

Slide 7

Slide 7 text

/28 7 How PIBT works – 3/6 high low mid as high priority inheritance [Sha+ 1990] +backtracking (skip)

Slide 8

Slide 8 text

/28 8 How PIBT works – 4/6 high as high as high as high as high stuck

Slide 9

Slide 9 text

/28 9 How PIBT works – 5/6 invalid valid re-plan re-plan valid You can move invalid You must re-plan, I will stay introduce backtracking

Slide 10

Slide 10 text

/28 10 In biconnected graphs the agent with highest priority can move to any adjacent locations reachability: all agents reach their goals within finite timesteps +dynamic priorities How PIBT works – 6/6

Slide 11

Slide 11 text

/28 11 one-shot MAPF reachability ≠ completeness lifelong scenarios unsolvable, but we can ensure reachability

Slide 12

Slide 12 text

/28 12 winPIBT

Slide 13

Slide 13 text

/28 13 A B C D E F G H E C collision- free potentially collide! E F G H D C G must plan additional paths Paths with Different Lengths – 1/2

Slide 14

Slide 14 text

/28 14 A B C D E F G H E C collision- free safe E A B C D C G G G G can stay at the last location two paths are isolated Paths with Different Lengths – 2/2

Slide 15

Slide 15 text

/28 15 Revisit PIBT A B D E C F high D C F F E D two paths will not be isolated => priority inheritance as high

Slide 16

Slide 16 text

/28 16 Concept of winPIBT repeat until terminate + dynamic priorities In biconnected graphs, winPIBT ensures reachability plan an arbitrary-length path for one agent reserve nodes in the path sequentially when violating the isolation with another agent, make this agent plan a single-step ahead using priority inheritance so that two paths are isolated 1. 2. 3.

Slide 17

Slide 17 text

/28 17 Example of winPIBT high t=0 low t=0 mid t=0 time-window:3 locations at t=1 t=2 t=3 high low mid 1 t=1 goals

Slide 18

Slide 18 text

/28 18 Example time-window:3 locations at t=1 t=2 t=3 high low mid 1 2 t=2 goals low t=0 mid t=0 high t=1

Slide 19

Slide 19 text

/28 19 Example time-window:3 locations at t=1 t=2 t=3 high low mid 1 2 t=3 violate, priority inheritance goals low t=0 mid t=0 high t=2 3

Slide 20

Slide 20 text

/28 20 Example time-window:3 locations at t=1 t=2 t=3 high low mid 1 4 2 3 goals high t=2 low t=0 mid t=0 t=3 t=1

Slide 21

Slide 21 text

/28 21 Example time-window:3 locations at t=1 t=2 t=3 high low mid 1 4 2 3 goals high t=3 low t=1 mid t=0 5 t=1

Slide 22

Slide 22 text

/28 22 Example time-window:3 locations at t=1 t=2 t=3 high low mid 1 4 2 3 goals high t=3 low t=1 mid t=1 5 t=2 violate, priority inheritance 6

Slide 23

Slide 23 text

/28 23 Example time-window:3 locations at t=1 t=2 t=3 high low mid 1 4 2 3 goals high t=3 low t=1 mid t=1 5 t=2 t=2 6 7

Slide 24

Slide 24 text

/28 24 Example time-window:3 locations at t=1 t=2 t=3 high low mid 1 4 2 3 goals high t=3 low t=2 mid t=2 5 t=3 6 7 8

Slide 25

Slide 25 text

/28 25 Example time-window:3 locations at t=1 t=2 t=3 high low mid 1 4 2 3 goals high t=3 low t=2 mid t=3 5 6 7 8 t=3 9

Slide 26

Slide 26 text

/28 26 Example time-window:3 locations at t=1 t=2 t=3 high low mid 1 4 2 3 goals high t=3 low t=3 mid t=3 5 6 7 8 9

Slide 27

Slide 27 text

/28 27 Evaluation with fixed time-windows

Slide 28

Slide 28 text

/28 28 agents cost per agent 2 winPIBT-30 3 10 PIBT agents makespan 2 winPIBT-30 3 10 PIBT empty-32-32 25 instances One-shot MAPF the effect is not significant in open space

Slide 29

Slide 29 text

/28 29 agents runtime (sec) 2 winPIBT-30 3 10 PIBT agents success instances 2 winPIBT-30 3 10 PIBT empty-32-32 25 instances One-shot MAPF c.f., reachability ≠ completeness winPIBT takes time

Slide 30

Slide 30 text

/28 30 Naïve Iterative MAPF kiva-like 2000 targets 10 repetitions agents service time 50 30 5 10 PIBT agents runtime (sec) 50 30 5 10 PIBT longer windows need time winPIBT: good

Slide 31

Slide 31 text

/28 31 Concluding Remarks develop adaptive windows future directions decentralization, c.f., our latest work [Okumura+ AAAI-21] winPIBT to overcome short-sighted behavior of PIBT motivation proposal target solving MAPF iteratively, c.f., reachability