Don't Yell at Your Robot: Physical Correction as the Collaborative Interface for Language Model Powered Robots
Chuye Zhang, Yifei Simon Shao, Harshil Parekh, Junyao Shi, Pratik Chaudhari, Vijay Kumar, Nadia Figueroa
TL;DR
This work addresses the challenge of reliable long-horizon tasking for LLM-powered robots by replacing exclusive reliance on natural language with physical corrections as a real-time interface. It introduces a DS-based action framework in which semantic actions issued by an LLM are mapped to 6-DoF dynamical-system commands, while a particle-filter maintains a belief over DS parameters that can be updated through human touch and reflected back into LLM prompts. The system combines confidence-based variable impedance control, a particle-filter estimator, and an interface manager to translate between semantic and DS actions, enabling real-time corrections and learning from corrections. In hybrid real+virtual experiments, the approach demonstrates that physical corrections align robot behavior with human intent, enable memory of corrections in the LLM, and support smoother multi-step task execution, highlighting a practical pathway toward proactive, physically guided human–robot collaboration.
Abstract
We present a novel approach for enhancing human-robot collaboration using physical interactions for real-time error correction of large language model (LLM) powered robots. Unlike other methods that rely on verbal or text commands, the robot leverages an LLM to proactively executes 6 DoF linear Dynamical System (DS) commands using a description of the scene in natural language. During motion, a human can provide physical corrections, used to re-estimate the desired intention, also parameterized by linear DS. This corrected DS can be converted to natural language and used as part of the prompt to improve future LLM interactions. We provide proof-of-concept result in a hybrid real+sim experiment, showcasing physical interaction as a new possibility for LLM powered human-robot interface.
