EmoBipedNav: Emotion-aware Social Navigation for Bipedal Robots with Deep Reinforcement Learning
Wei Zhu, Abirath Raju, Abdulaziz Shamsah, Anqi Wu, Seth Hutchinson, Ye Zhao
TL;DR
EmoBipedNav proposes an emotion-aware, end-to-end DRL framework for socially navigating bipedal robots. By representing environments with sequential LiDAR grid maps that encode collision regions and emotion-driven discomfort zones, and by training directly with full-body robot dynamics, the approach bridges ROM-controller gaps and achieves robust, socially compliant navigation. Across simulations and hardware demos, EmoBipedNav with time-varying emotions demonstrates superior success rates and effective handling of pedestrian emotions, while LGMs outperform traditional occupancy grids. The work advances practical, emotion-sensitive social navigation for real-world bipedal robots and highlights sim-to-real transfer capabilities.
Abstract
This study presents an emotion-aware navigation framework -- EmoBipedNav -- using deep reinforcement learning (DRL) for bipedal robots walking in socially interactive environments. The inherent locomotion constraints of bipedal robots challenge their safe maneuvering capabilities in dynamic environments. When combined with the intricacies of social environments, including pedestrian interactions and social cues, such as emotions, these challenges become even more pronounced. To address these coupled problems, we propose a two-stage pipeline that considers both bipedal locomotion constraints and complex social environments. Specifically, social navigation scenarios are represented using sequential LiDAR grid maps (LGMs), from which we extract latent features, including collision regions, emotion-related discomfort zones, social interactions, and the spatio-temporal dynamics of evolving environments. The extracted features are directly mapped to the actions of reduced-order models (ROMs) through a DRL architecture. Furthermore, the proposed framework incorporates full-order dynamics and locomotion constraints during training, effectively accounting for tracking errors and restrictions of the locomotion controller while planning the trajectory with ROMs. Comprehensive experiments demonstrate that our approach exceeds both model-based planners and DRL-based baselines. The hardware videos and open-source code are available at https://gatech-lidar.github.io/emobipednav.github.io/.
