Uncertainty-Aware DRL for Autonomous Vehicle Crowd Navigation in Shared Space
Mahsa Golchoubian, Moojan Ghafurian, Kerstin Dautenhahn, Nasser Lashgarian Azad
TL;DR
The paper tackles uncertainty in pedestrian trajectories for autonomous vehicle crowd navigation by integrating a data-driven, uncertainty-aware pedestrian predictor with a model-free DRL planner. A novel uncertainty-aware reward and a realism-based Hamburg-derived simulation environment enable training that accounts for prediction covariance, resulting in safer, more human-like navigation. Quantitative results show a 40% reduction in collisions and a 15% improvement in minimum intrusion distance over uncertainty-agnostic baselines, with the proposed method outperforming MPC in both safety and computational efficiency. This work advances practical, real-time AV crowd navigation by explicitly leveraging prediction uncertainty during training and planning, bringing trajectories closer to human driving in shared spaces.
Abstract
Safe, socially compliant, and efficient navigation of low-speed autonomous vehicles (AVs) in pedestrian-rich environments necessitates considering pedestrians' future positions and interactions with the vehicle and others. Despite the inevitable uncertainties associated with pedestrians' predicted trajectories due to their unobserved states (e.g., intent), existing deep reinforcement learning (DRL) algorithms for crowd navigation often neglect these uncertainties when using predicted trajectories to guide policy learning. This omission limits the usability of predictions when diverging from ground truth. This work introduces an integrated prediction and planning approach that incorporates the uncertainties of predicted pedestrian states in the training of a model-free DRL algorithm. A novel reward function encourages the AV to respect pedestrians' personal space, decrease speed during close approaches, and minimize the collision probability with their predicted paths. Unlike previous DRL methods, our model, designed for AV operation in crowded spaces, is trained in a novel simulation environment that reflects realistic pedestrian behaviour in a shared space with vehicles. Results show a 40% decrease in collision rate and a 15% increase in minimum distance to pedestrians compared to the state of the art model that does not account for prediction uncertainty. Additionally, the approach outperforms model predictive control methods that incorporate the same prediction uncertainties in terms of both performance and computational time, while producing trajectories closer to human drivers in similar scenarios.
