iA*: Imperative Learning-based A* Search for Path Planning
Xiangyu Chen, Fan Yang, Chen Wang
TL;DR
The paper tackles the efficiency and generalization limits of traditional A* and supervised learning-based path planning. It proposes Imperative A* (iA*), a self-supervised bilevel framework that couples a neural cost predictor with a differentiable A* search, enabling end-to-end gradient-based training without labeled paths. By formulating a three-component BLO framework (upper-level encoder, differentiable lower-level search, and memory bridge) and using a self-supervised loss derived from actual lower-level outcomes, iA* achieves a better balance between search area and path length while improving generalization. Experimental results across MP, Maze, Matterport3D, and simulated environments demonstrate substantial reductions in search area and runtime and improved robustness to unseen maps, highlighting iA*’s practical impact for robot navigation.
Abstract
Path planning, which aims to find a collision-free path between two locations, is critical for numerous applications ranging from mobile robots to self-driving vehicles. Traditional search-based methods like A* search guarantee path optimality but are often computationally expensive when handling large-scale maps. While learning-based methods alleviate this issue by incorporating learned constraints into their search procedures, they often face challenges like overfitting and reliance on extensive labeled datasets. To address these limitations, we propose Imperative A* (iA*), a novel self-supervised path planning framework leveraging bilevel optimization (BLO) and imperative learning (IL). The iA* framework integrates a neural network that predicts node costs with a differentiable A* search mechanism, enabling efficient self-supervised training via bilevel optimization. This integration significantly enhances the balance between search efficiency and path optimality while improving generalization to previously unseen maps. Extensive experiments demonstrate that iA* outperforms both classical and supervised learning-based methods, achieving an average reduction of 65.7\% in search area and 54.4\% in runtime, underscoring its effectiveness in robot path planning tasks.
