RLSLM: A Hybrid Reinforcement Learning Framework Aligning Rule-Based Social Locomotion Model with Human Social Norms

Yitian Kou; Yihe Gu; Chen Zhou; DanDan Zhu; Shuguang Kuai

RLSLM: A Hybrid Reinforcement Learning Framework Aligning Rule-Based Social Locomotion Model with Human Social Norms

Yitian Kou, Yihe Gu, Chen Zhou, DanDan Zhu, Shuguang Kuai

TL;DR

RLSLM addresses socially aware navigation by integrating a psychology-derived social discomfort field into a reinforcement learning reward. The method formulates a multi-objective RL objective with $G=\sum_t \gamma r_t$ and $r_t=R_d(s_t,s_{t-1}) + R_e(s_t) + \sigma R_s(s_t)$, where $R_e(s_t)=-\alpha$, $R_d(s_t,s_{t-1})=(D_{t-1}-D_t)/l$ and $R_s(s_t)$ aggregates a three-component social influence field (HRSC, HISC, CAC). A VR-based evaluation demonstrates that RLSLM achieves higher comfort ratings than rule-based baselines and ablation shows improved interpretability relative to purely data-driven methods. The work offers a scalable, human-centered framework that fuses cognitive science with reinforcement learning for practical social navigation.

Abstract

Navigating human-populated environments without causing discomfort is a critical capability for socially-aware agents. While rule-based approaches offer interpretability through predefined psychological principles, they often lack generalizability and flexibility. Conversely, data-driven methods can learn complex behaviors from large-scale datasets, but are typically inefficient, opaque, and difficult to align with human intuitions. To bridge this gap, we propose RLSLM, a hybrid Reinforcement Learning framework that integrates a rule-based Social Locomotion Model, grounded in empirical behavioral experiments, into the reward function of a reinforcement learning framework. The social locomotion model generates an orientation-sensitive social comfort field that quantifies human comfort across space, enabling socially aligned navigation policies with minimal training. RLSLM then jointly optimizes mechanical energy and social comfort, allowing agents to avoid intrusions into personal or group space. A human-agent interaction experiment using an immersive VR-based setup demonstrates that RLSLM outperforms state-of-the-art rule-based models in user experience. Ablation and sensitivity analyses further show the model's significantly improved interpretability over conventional data-driven methods. This work presents a scalable, human-centered methodology that effectively integrates cognitive science and machine learning for real-world social navigation.

RLSLM: A Hybrid Reinforcement Learning Framework Aligning Rule-Based Social Locomotion Model with Human Social Norms

TL;DR

Abstract

RLSLM: A Hybrid Reinforcement Learning Framework Aligning Rule-Based Social Locomotion Model with Human Social Norms

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)