Online Context Learning for Socially Compliant Navigation
Iaroslav Okunevich, Alexandre Lombard, Tomas Krajnik, Yassine Ruichek, Zhi Yan
TL;DR
This work addresses the challenge of socially compliant robot navigation across changing human contexts by proposing a two-layer architecture that couples a DRL-based navigation foundation (SARL) with an online, tracklet-driven social module. The online social context learning mechanism updates only the social module in real time, using a GRU-MLP social value network and a labeling strategy based on tracklet dynamics to adapt to new environments without disrupting the core navigation policy. The approach achieves improvements over state-of-the-art methods in simulations and real-robot experiments, including enhanced social efficiency and robustness across diverse scenarios, and is complemented by open-source code and data. Collectively, the method enables fast online adaptation to unseen social contexts, supporting more robust and socially aware long-term robot deployment in human-rich spaces.
Abstract
Robot social navigation needs to adapt to different human factors and environmental contexts. However, since these factors and contexts are difficult to predict and cannot be exhaustively enumerated, traditional learning-based methods have difficulty in ensuring the social attributes of robots in long-term and cross-environment deployments. This letter introduces an online context learning method that aims to empower robots to adapt to new social environments online. The proposed method adopts a two-layer structure. The bottom layer is built using a deep reinforcement learning-based method to ensure the output of basic robot navigation commands. The upper layer is implemented using an online robot learning-based method to socialize the control commands suggested by the bottom layer. Experiments using a community-wide simulator show that our method outperforms the state-of-the-art ones. Experimental results in the most challenging scenarios show that our method improves the performance of the state-of-the-art by 8%. The source code of the proposed method, the data used, and the tools for the per-training step are publicly available at https://github.com/Nedzhaken/SOCSARL-OL.
