Navigating the Human Maze: Real-Time Robot Pathfinding with Generative Imitation Learning
Martin Moder, Stephen Adhisaputra, Josef Pauli
TL;DR
This work tackles robust robot navigation in crowded environments by combining goal-conditioned generative models trained on human crowd data with Sampling-based Model Predictive Control (SMPC). It introduces goal-conditioned Neural Autoregressive (NAR) and Neural Inverse Autoregressive (NIAR) models to forecast human actions, integrating these forecasts into a Model Predictive Path Integral (MPPI) planner that respects robot dynamics via a Dynamic Window Constraint. A multi-faceted reward structure, including a map-based cost, collision penalties, human-imitation likelihood, and a Social Influence Reward, guides planning toward safe, human-like yet efficient trajectories; an Adaptive Sub-goal Navigation strategy ties local planning to a global map to avoid local minima. Real-world LoCoBot demonstrations, and extensive evaluations on ETH/UCY/Wildtrack datasets, show the MPPI-NAR/NIAR approach yields higher success rates and lower collision rates than baselines including DWA and offline RL methods, while highlighting practical considerations such as data requirements and platform variability. The results underscore the value of integrating goal-conditioned imitation with SMPC for real-time, socially aware navigation in dynamic environments, with promising directions for richer datasets and broader robotic platforms.
Abstract
This paper addresses navigation in crowded environments by integrating goal-conditioned generative models with Sampling-based Model Predictive Control (SMPC). We introduce goal-conditioned autoregressive models to generate crowd behaviors, capturing intricate interactions among individuals. The model processes potential robot trajectory samples and predicts the reactions of surrounding individuals, enabling proactive robotic navigation in complex scenarios. Extensive experiments show that this algorithm enables real-time navigation, significantly reducing collision rates and path lengths, and outperforming selected baseline methods. The practical effectiveness of this algorithm is validated on an actual robotic platform, demonstrating its capability in dynamic settings.
