From Screens to Scenes: A Survey of Embodied AI in Healthcare
Yihao Liu, Xu Cao, Tingting Chen, Yankai Jiang, Junjie You, Minghua Wu, Xiaosong Wang, Mengling Feng, Yaochu Jin, Jintai Chen
TL;DR
This survey addresses a pressing problem in healthcare by examining how Embodied AI can interact with the physical world to improve diagnosis, treatment, and care delivery. It surveys foundational EmAI capabilities—multimodal perception, low-level actuation, high-level planning, and memory—and maps them to four healthcare domains: clinical intervention, daily care and companionship, infrastructure support, and biomedical research. The paper introduces an intelligent five-level framework, catalogs datasets and benchmarks, and discusses ethical, security, and interoperability challenges, offering a path toward safe, generalizable, and patient-centered EmAI systems. Its practical impact lies in guiding researchers and practitioners to build cohesive, end-to-end EmAI solutions that can augment clinicians, expand access, and accelerate medical innovation while addressing societal implications.
Abstract
Healthcare systems worldwide face persistent challenges in efficiency, accessibility, and personalization. Powered by modern AI technologies such as multimodal large language models and world models, Embodied AI (EmAI) represents a transformative frontier, offering enhanced autonomy and the ability to interact with the physical world to address these challenges. As an interdisciplinary and rapidly evolving research domain, "EmAI in healthcare" spans diverse fields such as algorithms, robotics, and biomedicine. This complexity underscores the importance of timely reviews and analyses to track advancements, address challenges, and foster cross-disciplinary collaboration. In this paper, we provide a comprehensive overview of the "brain" of EmAI for healthcare, wherein we introduce foundational AI algorithms for perception, actuation, planning, and memory, and focus on presenting the healthcare applications spanning clinical interventions, daily care & companionship, infrastructure support, and biomedical research. Despite its promise, the development of EmAI for healthcare is hindered by critical challenges such as safety concerns, gaps between simulation platforms and real-world applications, the absence of standardized benchmarks, and uneven progress across interdisciplinary domains. We discuss the technical barriers and explore ethical considerations, offering a forward-looking perspective on the future of EmAI in healthcare. A hierarchical framework of intelligent levels for EmAI systems is also introduced to guide further development. By providing systematic insights, this work aims to inspire innovation and practical applications, paving the way for a new era of intelligent, patient-centered healthcare.
