Learning Early Social Maneuvers for Enhanced Social Navigation
Yigit Yildirim, Mehmet Suzer, Emre Ugur
TL;DR
This work addresses the challenge of socially compliant mobile navigation by proposing a purely data-driven Learning from Demonstration framework that relies on raw sensory data and explicitly accounts for future pedestrian trajectories. The method combines a Conditional Neural Process–based LfD module with a traditional local planner (e.g., Dynamic Window Approach) and introduces a CNN-based state encoder to incorporate full environmental context, including 360° LiDAR data. It integrates a pedestrian trajectory forecaster (LSTM-based RL) to predict future paths and feed these into the planning module, enabling anticipatory maneuvers. While current results are demonstrated in synthetic environments and component-level evaluations, the authors outline a path toward real-world validation, offline and online integration, and assessments of social trust and acceptance, with plans for multimodal extensions.
Abstract
Socially compliant navigation is an integral part of safety features in Human-Robot Interaction. Traditional approaches to mobile navigation prioritize physical aspects, such as efficiency, but social behaviors gain traction as robots appear more in daily life. Recent techniques to improve the social compliance of navigation often rely on predefined features or reward functions, introducing assumptions about social human behavior. To address this limitation, we propose a novel Learning from Demonstration (LfD) framework for social navigation that exclusively utilizes raw sensory data. Additionally, the proposed system contains mechanisms to consider the future paths of the surrounding pedestrians, acknowledging the temporal aspect of the problem. The final product is expected to reduce the anxiety of people sharing their environment with a mobile robot, helping them trust that the robot is aware of their presence and will not harm them. As the framework is currently being developed, we outline its components, present experimental results, and discuss future work towards realizing this framework.
