The Safety Challenge of World Models for Embodied AI Agents: A Review
Lorenzo Baraldi, Zifan Zeng, Chongzhe Zhang, Aradhana Nayak, Hongbo Zhu, Feng Liu, Qunli Zhang, Peng Wang, Shiming Liu, Zheng Hu, Angelo Cangelosi, Lorenzo Baraldi
TL;DR
This review addresses the safety challenges of World Models in embodied AI by surveying scene-generation and control WM approaches in autonomous driving and robotics, and by proposing a pathology-based safety taxonomy. It combines a structured literature analysis with empirical predictions from SoTA models to identify common fault modes and to quantify safety-related pathologies using multimodal and task-specific metrics. The key contributions include a formal pathology criteria framework, a structured experimental protocol, and a forward-looking set of directions (metrics, neurosymbolic control, and MLLM feedback) to improve safety and reliability. The work highlights the gap between current WM predictions and real-world safety requirements, underscoring practical implications for deploying safe, robust embodied agents.
Abstract
The rapid progress in embodied artificial intelligence has highlighted the necessity for more advanced and integrated models that can perceive, interpret, and predict environmental dynamics. In this context, World Models (WMs) have been introduced to provide embodied agents with the abilities to anticipate future environmental states and fill in knowledge gaps, thereby enhancing agents' ability to plan and execute actions. However, when dealing with embodied agents it is fundamental to ensure that predictions are safe for both the agent and the environment. In this article, we conduct a comprehensive literature review of World Models in the domains of autonomous driving and robotics, with a specific focus on the safety implications of scene and control generation tasks. Our review is complemented by an empirical analysis, wherein we collect and examine predictions from state-of-the-art models, identify and categorize common faults (herein referred to as pathologies), and provide a quantitative evaluation of the results.
