Transformable Gaussian Reward Function for Socially-Aware Navigation with Deep Reinforcement Learning
Jinyeob Kim, Sumin Kang, Sungwoo Yang, Beomjoon Kim, Jargalbaatar Yura, Donghan Kim
TL;DR
Problem: reward design for socially-aware robot navigation in crowds is manual and brittle, hindering scalable learning. Approach: Transformable Gaussian Reward Function (TGRF) uses a Gaussian form with $w_{TGRF}$, $\mu_{TGRF}$, and $\sigma_{TGRF}$ and normalization $C_{norm}$ to flexibly shape penalties based on $x_{TGRF}$ (e.g., distance), reducing hyperparameter burden and accelerating DRL learning. Contributions: a low-hyperparameter, adaptable reward-shaping framework validated across multiple reward components and navigation policies, with faster learning and improved safety in crowded simulations and real-world tests. Findings: TGRF improves success rates up to around 95% and reduces intrusion in many settings, while real-world experiments reveal computation and sensor-noise challenges; the method shows strong practicality for socially-aware navigation but requires consideration of physical constraints. Significance: provides a scalable, adaptable reward-shaping tool for human-centric robotics, enabling safer, quicker policy learning in dynamic environments.
Abstract
Robot navigation has transitioned from prioritizing obstacle avoidance to adopting socially aware navigation strategies that accommodate human presence. As a result, the recognition of socially aware navigation within dynamic human-centric environments has gained prominence in the field of robotics. Although reinforcement learning technique has fostered the advancement of socially aware navigation, defining appropriate reward functions, especially in congested environments, has posed a significant challenge. These rewards, crucial in guiding robot actions, demand intricate human-crafted design due to their complex nature and inability to be automatically set. The multitude of manually designed rewards poses issues with hyperparameter redundancy, imbalance, and inadequate representation of unique object characteristics. To address these challenges, we introduce a transformable gaussian reward function (TGRF). The TGRF significantly reduces the burden of hyperparameter tuning, displays adaptability across various reward functions, and demonstrates accelerated learning rates, particularly excelling in crowded environments utilizing deep reinforcement learning (DRL). We introduce and validate TGRF through sections highlighting its conceptual background, characteristics, experiments, and real-world application, paving the way for a more effective and adaptable approach in robotics.The complete source code is available on https://github.com/JinnnK/TGRF
