Snapshot: Towards Application-centered Models for Pedestrian Trajectory Prediction in Urban Traffic Environments
Nico Uhlemann, Yipeng Zhou, Tobias Simeon Mohr, Markus Lienkamp
TL;DR
The paper tackles real-world pedestrian trajectory prediction in urban traffic by introducing Snapshot, a compact, unimodal, feed-forward predictor with two dedicated encoders for social and map information. It couples a novel, agent-centric encoding with a cross-attention map encoder and a CNN-based decoder to forecast up to $T_p=60$ timesteps from a short observation horizon $T_o=10$, while maintaining real-time performance. A dedicated Argoverse 2 pedestrian benchmark is proposed, derived from a large-scale dataset and augmented via sliding-window sampling to produce over $10^6$ training/validation samples; Snapshot achieves an ADE improvement of $8.8 ext{%}$ over state-of-the-art baselines and strong robustness to varying histories. Real-world applicability is demonstrated by integrating Snapshot into an autonomous driving stack, showing reliable predictions under noisy detections and confirming the model's suitability for real-time deployment in urban environments.
Abstract
This paper explores pedestrian trajectory prediction in urban traffic while focusing on both model accuracy and real-world applicability. While promising approaches exist, they often revolve around pedestrian datasets excluding traffic-related information, or resemble architectures that are either not real-time capable or robust. To address these limitations, we first introduce a dedicated benchmark based on Argoverse 2, specifically targeting pedestrians in traffic environments. Following this, we present Snapshot, a modular, feed-forward neural network that outperforms the current state of the art, reducing the Average Displacement Error (ADE) by 8.8% while utilizing significantly less information. Despite its agent-centric encoding scheme, Snapshot demonstrates scalability, real-time performance, and robustness to varying motion histories. Moreover, by integrating Snapshot into a modular autonomous driving software stack, we showcase its real-world applicability.
