COLSON: Controllable Learning-Based Social Navigation via Diffusion-Based Reinforcement Learning
Yuki Tomita, Kohei Matsumoto, Yuki Hyodo, Ryo Kurazume
TL;DR
COLSON presents a diffusion-based reinforcement learning framework for social navigation that leverages a graph neural network encoder and Q-score matching to produce multimodal actions in dynamic pedestrian environments. It introduces post-training guidance mechanisms—SDEdit-based action smoothing and obstacle-avoidance guidance—to adapt to unseen conditions like static obstacles without retraining. The approach demonstrates superior performance against baselines in simulated circle-crossing and pedestrian-density scenarios and validates real-world feasibility on a mobile robot. The work advances diffusion-based policies for mobile robotics and highlights practical benefits for smooth, safe, and scalable navigation in human-centric environments.
Abstract
Mobile robot navigation in dynamic environments with pedestrian traffic is a key challenge in the development of autonomous mobile service robots. Recently, deep reinforcement learning-based methods have been actively studied and have outperformed traditional rule-based approaches owing to their optimization capabilities. Among these, methods that assume a continuous action space typically rely on a Gaussian distribution assumption, which limits the flexibility of generated actions. Meanwhile, the application of diffusion models to reinforcement learning has advanced, allowing for more flexible action distributions compared with Gaussian distribution-based approaches. In this study, we applied a diffusion-based reinforcement learning approach to social navigation and validated its effectiveness. Furthermore, by leveraging the characteristics of diffusion models, we propose an extension that enables post-training action smoothing and adaptation to static obstacle scenarios not considered during the training steps.
