NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning
Ruihua Han, Shuai Wang, Shuaijun Wang, Zeqing Zhang, Jianjun Chen, Shijie Lin, Chengyang Li, Chengzhong Xu, Yonina C. Eldar, Qi Hao, Jia Pan
TL;DR
NeuPAN tackles dense-scenario, mapless navigation for nonholonomic robots by directly mapping raw lidar points to a latent distance feature space and solving an end-to-end perception-control problem with a tightly coupled PAN-based framework. The system combines a Deep Unfolded Neural Encoder (DUNE) that outputs latent distance features with a Neural Regularized Motion Planner (NRMP) that uses a differentiable, learnable optimization layer to generate safe, efficient motions within a receding horizon $H$. Theoretical grounding via strong duality transforms pointwise collision constraints into a convex-regularized objective, enabling real-time performance and interpretability, while experimental results across ground, wheel-legged, and passenger-vehicle platforms show improved accuracy, robustness, and generalization over state-of-the-art baselines. NeuPAN’s end-to-end, model-based learning paradigm yields perception-aware, physically interpretable motions suitable for cluttered, unknown environments, offering practical impact for autonomous systems in home, office, and urban contexts.
Abstract
Navigating a nonholonomic robot in a cluttered, unknown environment requires accurate perception and precise motion control for real-time collision avoidance. This paper presents NeuPAN: a real-time, highly accurate, map-free, easy-to-deploy, and environment-invariant robot motion planner. Leveraging a tightly coupled perception-to-control framework, NeuPAN has two key innovations compared to existing approaches: 1) it directly maps raw point cloud data to a latent distance feature space for collision-free motion generation, avoiding error propagation from the perception to control pipeline; 2) it is interpretable from an end-to-end model-based learning perspective. The crux of NeuPAN is solving an end-to-end mathematical model with numerous point-level constraints using a plug-and-play (PnP) proximal alternating-minimization network (PAN), incorporating neurons in the loop. This allows NeuPAN to generate real-time, physically interpretable motions. It seamlessly integrates data and knowledge engines, and its network parameters can be fine-tuned via backpropagation. We evaluate NeuPAN on a ground mobile robot, a wheel-legged robot, and an autonomous vehicle, in extensive simulated and real-world environments. Results demonstrate that NeuPAN outperforms existing baselines in terms of accuracy, efficiency, robustness, and generalization capabilities across various environments, including the cluttered sandbox, office, corridor, and parking lot. We show that NeuPAN works well in unknown and unstructured environments with arbitrarily shaped objects, transforming impassable paths into passable ones.
