On the Vulnerability of LLM/VLM-Controlled Robotics
Xiyang Wu, Souradip Chakraborty, Ruiqi Xian, Jing Liang, Tianrui Guan, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Singh Bedi
TL;DR
The paper investigates a critical reliability gap in LLM/VLM-controlled robotics: sensitivity to small input-modality variations can trigger misalignment and substantial task failures. It formalizes a vulnerability framework with a mathematical objective for perturbation-triggered failures, and proposes perturbation strategies that operate without modifying models. Through experiments on VIMA and Instruct2Act, it shows that perception-physical world perturbations can dramatically reduce success rates (e.g., up to ~29.9% for VIMA), while task type and generalization level modulate robustness, highlighting the need for improved cross-modal alignment and richer training data. The work offers a foundation for robust, safe deployment of LLM/VLM-enabled robots and points to future directions in automated vulnerability analysis and alignment-enhanced training.
Abstract
In this work, we highlight vulnerabilities in robotic systems integrating large language models (LLMs) and vision-language models (VLMs) due to input modality sensitivities. While LLM/VLM-controlled robots show impressive performance across various tasks, their reliability under slight input variations remains underexplored yet critical. These models are highly sensitive to instruction or perceptual input changes, which can trigger misalignment issues, leading to execution failures with severe real-world consequences. To study this issue, we analyze the misalignment-induced vulnerabilities within LLM/VLM-controlled robotic systems and present a mathematical formulation for failure modes arising from variations in input modalities. We propose empirical perturbation strategies to expose these vulnerabilities and validate their effectiveness through experiments on multiple robot manipulation tasks. Our results show that simple input perturbations reduce task execution success rates by 22.2% and 14.6% in two representative LLM/VLM-controlled robotic systems. These findings underscore the importance of input modality robustness and motivate further research to ensure the safe and reliable deployment of advanced LLM/VLM-controlled robotic systems.
