LAMS: LLM-Driven Automatic Mode Switching for Assistive Teleoperation
Yiran Tao, Jehan Yang, Dan Ding, Zackory Erickson
TL;DR
LAMS addresses the challenge of frequent mode switching in assistive teleoperation by using an LLM to map low-DoF joystick inputs to high-DoF robot actions based on language-grounded task context, without task demonstrations. It incrementally improves by incorporating user-generated mode-switching examples into its prompts, and is validated through ablation and a user study with 10 participants on complex, long-horizon tasks, showing fewer manual switches and strong user preference. The approach leverages a three-part input to the LLM (prefix, rules, pose grounding) and a probability-based decoding strategy to select robust mappings, with rule synthesis performed by a separate LLM from user examples. Results indicate LAMS generalizes across tasks better than heuristic or static methods, reduces cognitive load, and learns from user interaction over time, indicating practical value for assistive robotics and teleoperation. However, challenges remain in differentiating certain rotational actions and in grounding 3D orientation in natural language, motivating future exploration of cross-task rule transfer and more nuanced NL descriptions.
Abstract
Teleoperating high degrees-of-freedom (DoF) robotic manipulators via low-DoF controllers like joysticks often requires frequent switching between control modes, where each mode maps controller movements to specific robot actions. Manually performing this frequent switching can make teleoperation cumbersome and inefficient. On the other hand, existing automatic mode-switching solutions, such as heuristic-based or learning-based methods, are often task-specific and lack generalizability. In this paper, we introduce LLM-Driven Automatic Mode Switching (LAMS), a novel approach that leverages Large Language Models (LLMs) to automatically switch control modes based on task context. Unlike existing methods, LAMS requires no prior task demonstrations and incrementally improves by integrating user-generated mode-switching examples. We validate LAMS through an ablation study and a user study with 10 participants on complex, long-horizon tasks, demonstrating that LAMS effectively reduces manual mode switches, is preferred over alternative methods, and improves performance over time. The project website with supplementary materials is at https://lams-assistance.github.io/.
