LASMP: Language Aided Subset Sampling Based Motion Planner
Saswati Bhattacharjee, Anirban Sinha, Chinwe Ekenna
TL;DR
LASMP addresses the inefficiency of traditional sampling-based planners by grounding natural language commands into low-level navigation cues and guiding a subset-based RRT to solve a sequence of subproblems. It integrates Whisper for speech-to-text, RoBERTa for NER to extract turns and destinations, and a modified RRT that samples from a local rectangular subset defined by heading parameters, using ray-casting to detect feasible intersections. The method achieves substantial improvements in sample efficiency (reducing nodes by about 55% and random queries by about 80%) while maintaining safe, collision-free paths, and it demonstrates both simulation and real-world viability in indoor environments. The framework lays groundwork for practical, language-assisted navigation, with future extensions to dynamic obstacles and larger-scale deployment.
Abstract
This paper presents the Language Aided Subset Sampling Based Motion Planner (LASMP), a system that helps mobile robots plan their movements by using natural language instructions. LASMP uses a modified version of the Rapidly Exploring Random Tree (RRT) method, which is guided by user-provided commands processed through a language model (RoBERTa). The system improves efficiency by focusing on specific areas of the robot's workspace based on these instructions, making it faster and less resource-intensive. Compared to traditional RRT methods, LASMP reduces the number of nodes needed by 55% and cuts random sample queries by 80%, while still generating safe, collision-free paths. Tested in both simulated and real-world environments, LASMP has shown better performance in handling complex indoor scenarios. The results highlight the potential of combining language processing with motion planning to make robot navigation more efficient.
