iPolicy: Incremental Policy Algorithms for Feedback Motion Planning
Guoxiang Zhao, Devesh K. Jha, Yebin Wang, Minghui Zhu
TL;DR
This work develops iPolicy, an incremental, policy-based motion-planning framework that couples sampling-based graph construction with set-valued dynamic programming to synthesize feedback controllers for dynamical robots. By transforming the minimal travel time into a Kruzhkov-transformed value function $\varTheta$, iPolicy performs asynchronous value iterations on an expanding graph to converge to $\varTheta^*$ with probability one, leveraging contraction properties over both fixed and evolving graphs. Theoretical analysis establishes convergence under specific resolution and scheduling assumptions, and experiments on a point-mass, a simple car, and a Dubins car demonstrate effective, anytime improvement and robustness to obstacles. A computation-saving variant and extensive numerical results illustrate practical scalability and the potential for acceleration via parallelism and learning-based techniques in future work.
Abstract
This paper presents policy-based motion planning for robotic systems. The motion planning literature has been mostly focused on open-loop trajectory planning which is followed by tracking online. In contrast, we solve the problem of path planning and controller synthesis simultaneously by solving the related feedback control problem. We present a novel incremental policy (iPolicy) algorithm for motion planning, which integrates sampling-based methods and set-valued optimal control methods to compute feedback controllers for the robotic system. In particular, we use sampling to incrementally construct the state space of the system. Asynchronous value iterations are performed on the sampled state space to synthesize the incremental policy feedback controller. We show the convergence of the estimates to the optimal value function in continuous state space. Numerical results with various different dynamical systems (including nonholonomic systems) verify the optimality and effectiveness of iPolicy.
