Generalizable Motion Planning via Operator Learning
Sharath Matada, Luke Bhan, Yuanyuan Shi, Nikolay Atanasov
TL;DR
This work reframes motion planning as learning the solution operator of an Eikonal PDE, mapping environment cost functions to optimal value functions $V(\bm{x})$ via a neural operator. The authors introduce the Planning Neural Operator (PNO), a resolution-invariant architecture that encodes obstacle geometry and enforces goal-position generalization through a triangle-inequality projection layer, enabling zero-shot super-resolution and cross-environment planning (including 3D iGibson and 4-DOF manipulators). A theoretical result guarantees the existence of a neural-operator approximation $\hat{\Psi}$ to the Eikonal solution operator, with an error bound $\epsilon$, under viscosity-solution and compactness assumptions. Empirically, PNO achieves fast, accurate value-function predictions at resolutions far beyond training (e.g., $16\times$ training resolution), outperforms several baselines on grid-world and real-world maps, and can serve as an $\epsilon$-consistent neural heuristic to substantially reduce node expansions in $A^*$ while preserving near-optimal paths. These advances enable efficient, generalizable motion planning in complex environments and offer practical impact for real-time robotic planning and navigation in changing scenes.
Abstract
In this work, we introduce a planning neural operator (PNO) for predicting the value function of a motion planning problem. We recast value function approximation as learning a single operator from the cost function space to the value function space, which is defined by an Eikonal partial differential equation (PDE). Therefore, our PNO model, despite being trained with a finite number of samples at coarse resolution, inherits the zero-shot super-resolution property of neural operators. We demonstrate accurate value function approximation at $16\times$ the training resolution on the MovingAI lab's 2D city dataset, compare with state-of-the-art neural value function predictors on 3D scenes from the iGibson building dataset and showcase optimal planning with 4-DOF robotic manipulators. Lastly, we investigate employing the value function output of PNO as a heuristic function to accelerate motion planning. We show theoretically that the PNO heuristic is $ε$-consistent by introducing an inductive bias layer that guarantees our value functions satisfy the triangle inequality. With our heuristic, we achieve a $30\%$ decrease in nodes visited while obtaining near optimal path lengths on the MovingAI lab 2D city dataset, compared to classical planning methods ($A^\ast$, $RRT^\ast$).
