Neural Randomized Planning for Whole Body Robot Motion
Yunfan Lu, Yuchen Ma, David Hsu, Panpan Cai
TL;DR
The paper tackles the challenge of real-time, long-range whole-body motion planning for high-DOF robots in cluttered homes. It introduces Neural Randomized Planner (NRP), which combines a global SBMP backbone (RRT/IRRT*-based) with a locally learned neural sampler to generate optimal waypoints conditioned on local surroundings, then stitches these local samples into a global plan. The local samplers come in discriminative (classification of optimal samples) and generative (CVAE-based sampling) variants and are trained offline from PRM*-generated ground truth; the global planner preserves probabilistic completeness and asymptotic optimality. Across Gibson-simulated and real Fetch robot experiments, NRPs outperform classical and learning-enhanced SBMP methods in success rates and path quality, and demonstrate zero-shot transfer to novel environments without fine-tuning. The work shows that local, locally-conditioned learning integrated with planning yields data-efficient, scalable, and robust whole-body motion in realistic environments, with future directions including fast re-planning and more expressive generative models.
Abstract
Robot motion planning has made vast advances over the past decades, but the challenge remains: robot mobile manipulators struggle to plan long-range whole-body motion in common household environments in real time, because of high-dimensional robot configuration space and complex environment geometry. To tackle the challenge, this paper proposes Neural Randomized Planner (NRP), which combines a global sampling-based motion planning (SBMP) algorithm and a local neural sampler. Intuitively, NRP uses the search structure inside the global planner to stitch together learned local sampling distributions to form a global sampling distribution adaptively. It benefits from both learning and planning. Locally, it tackles high dimensionality by learning to sample in promising regions from data, with a rich neural network representation. Globally, it composes the local sampling distributions through planning and exploits local geometric similarity to scale up to complex environments. Experiments both in simulation and on a real robot show \NRP yields superior performance compared to some of the best classical and learning-enhanced SBMP algorithms. Further, despite being trained in simulation, NRP demonstrates zero-shot transfer to a real robot operating in novel household environments, without any fine-tuning or manual adaptation.
