Diffusion-Based Environment-Aware Trajectory Prediction
Theodor Westny, Björn Olofsson, Erik Frisk
TL;DR
This work tackles the challenge of predicting future trajectories for multiple road users under multimodal uncertainty by introducing a diffusion-based conditional model that jointly represents inter-agent interactions and environment knowledge via Graph Attention Networks and Graph-GRU. It integrates lane-graph map information and enforces physical feasibility through differential motion constraints, while enabling interaction-aware guidance to modulate samples. The approach achieves state-of-the-art accuracy on real-world highD and roundabout datasets and demonstrates the model's capacity for multimodal predictions with few diffusion steps. The results suggest practical applicability for robust motion planning in autonomous driving, including scenarios with less cooperative agents and varying interaction strength.
Abstract
The ability to predict the future trajectories of traffic participants is crucial for the safe and efficient operation of autonomous vehicles. In this paper, a diffusion-based generative model for multi-agent trajectory prediction is proposed. The model is capable of capturing the complex interactions between traffic participants and the environment, accurately learning the multimodal nature of the data. The effectiveness of the approach is assessed on large-scale datasets of real-world traffic scenarios, showing that our model outperforms several well-established methods in terms of prediction accuracy. By the incorporation of differential motion constraints on the model output, we illustrate that our model is capable of generating a diverse set of realistic future trajectories. Through the use of an interaction-aware guidance signal, we further demonstrate that the model can be adapted to predict the behavior of less cooperative agents, emphasizing its practical applicability under uncertain traffic conditions.
