Structured Graph Network for Constrained Robot Crowd Navigation with Low Fidelity Simulation
Shuijing Liu, Kaiwen Hong, Neeloy Chakraborty, Katherine Driggs-Campbell
TL;DR
This work tackles constrained robot crowd navigation under sim2real gaps by learning in a low-fidelity simulator using a split scene representation: detected humans and map/robot-localization derived obstacle point clouds. A spatio-temporal graph models interactions among robot, humans, and obstacles, with three dedicated attention networks (HH, OH, RH) and a GRU to produce robust policies trained via PPO. Empirical results in simulation show that full interaction modeling substantially improves navigation success and safety, while ablations highlight the importance of capturing human-human and human-obstacle interactions. Real-world experiments with a TurtleBot 2i demonstrate the approach's practical viability, though adversarial pedestrian behavior remains challenging and motivates future hierarchical planning and more realistic human models. Overall, the paper contributes a scalable, attention-guided framework that reduces sim2real gaps and enhances constrained crowd navigation in both simulated and real environments.
Abstract
We investigate the feasibility of deploying reinforcement learning (RL) policies for constrained crowd navigation using a low-fidelity simulator. We introduce a representation of the dynamic environment, separating human and obstacle representations. Humans are represented through detected states, while obstacles are represented as computed point clouds based on maps and robot localization. This representation enables RL policies trained in a low-fidelity simulator to deploy in real world with a reduced sim2real gap. Additionally, we propose a spatio-temporal graph to model the interactions between agents and obstacles. Based on the graph, we use attention mechanisms to capture the robot-human, human-human, and human-obstacle interactions. Our method significantly improves navigation performance in both simulated and real-world environments. Video demonstrations can be found at https://sites.google.com/view/constrained-crowdnav/home.
