RoboArm-NMP: a Learning Environment for Neural Motion Planning
Tom Jurgenson, Matan Sudry, Gal Avineri, Aviv Tamar
TL;DR
RoboArm-NMP introduces a unified learning and evaluation environment for neural motion planning in a 7-DOF robot arm, integrating PyBullet simulation, demonstrations, and multiple obstacle-encoding schemes. The study systematically compares IL and RL approaches, assessing goal-generalization, obstacle-generalization, and inference speed, and finds that combining demonstrations with hindsight improves learning while obstacle generalization remains problematic. Among encoders, VQ-VAE provides the best obstacle encodings, yet even the best policies often underperform a simple go-to-goal baseline in cluttered and variable obstacle settings. The work highlights the need for more robust scene representations and generalization strategies, and positions RoboArm-NMP as a scalable platform for advancing NMP research.
Abstract
We present RoboArm-NMP, a learning and evaluation environment that allows simple and thorough evaluations of Neural Motion Planning (NMP) algorithms, focused on robotic manipulators. Our Python-based environment provides baseline implementations for learning control policies (either supervised or reinforcement learning based), a simulator based on PyBullet, data of solved instances using a classical motion planning solver, various representation learning methods for encoding the obstacles, and a clean interface between the learning and planning frameworks. Using RoboArm-NMP, we compare several prominent NMP design points, and demonstrate that the best methods mostly succeed in generalizing to unseen goals in a scene with fixed obstacles, but have difficulty in generalizing to unseen obstacle configurations, suggesting focus points for future research.
