Trustworthy Pedestrian Trajectory Prediction via Pattern-Aware Interaction Modeling
Kaiyuan Zhai, Juan Chen, Chao Wang, Zeyi Xu, Guoming Tang
TL;DR
This work tackles trustworthy pedestrian trajectory prediction by addressing the interpretability gap in prior black-box interaction models. It introduces InSyn, a Transformer-based framework with a Pattern-Aware Interaction Encoder, a Trajectory Generator that employs Seq-Start of Seq (SSOS), and a Seq-CVAE for goal sampling, enabling explicit recognition of interaction patterns such as In Sync and Conflict. The approach yields state-of-the-art average ADE on ETH/UCY, demonstrates clear interpretability through case studies, and shows that SSOS reduces initial-step errors by around 6.58%, enhancing stability in sequential predictions. These contributions offer practical benefits for safety-critical applications like autonomous driving by providing both accuracy and transparency in socially aware trajectory forecasting.
Abstract
Accurate and reliable pedestrian trajectory prediction is critical for the application of intelligent applications, yet achieving trustworthy prediction remains highly challenging due to the complexity of interactions among pedestrians. Previous methods often adopt black-box modeling of pedestrian interactions. Despite their strong performance, such opaque modeling limits the reliability of predictions in real-world deployments. To address this issue, we propose InSyn (Interaction-Synchronization Network), a novel Transformer-based model that explicitly captures diverse interaction patterns (e.g., walking in sync or conflicting) while effectively modeling direction-sensitive social behaviors. Additionally, we introduce a training strategy, termed Seq-Start of Seq (SSOS), designed to alleviate the common issue of initial-step divergence in numerical time-series prediction. Experiments on the ETH and UCY datasets demonstrate that our model not only outperforms recent black-box baselines in prediction accuracy, especially under high-density scenarios, but also provides transparent interaction modeling, as shown in the case study. Furthermore, the SSOS strategy proves to be effective in improving sequential prediction performance, reducing the initial-step prediction error by approximately 6.58%. Code is avaliable at https://github.com/rickzky1001/InSyn
