MSMA: Multi-agent Trajectory Prediction in Connected and Autonomous Vehicle Environment with Multi-source Data Integration
Xi Chen, Rahul Bhadani, Zhanbo Sun, Larry Head
TL;DR
This work tackles trajectory prediction for surrounding vehicles in a mixed traffic scenario with a central CAV, leveraging both onboard sensors and V2X communications. It introduces MSMA, an encoder-decoder framework that uses source-specific temporal encoders and a cross-attention fusion module, along with Graph Attention Network–based agent-agent and agent-lane interactions, to jointly predict $D$ future trajectories per agent. The model is trained and evaluated on a customized CARLA Town03 dataset with synthesized latency and noise, and shows that multi-source data fusion improves prediction accuracy, particularly at higher CV market penetration rates, as measured by $\text{ADE}$, $\text{FDE}$, and $\text{MR}$. The study highlights MSMA as a first step toward effective multi-source trajectory forecasting in a CAV environment, while acknowledging limitations such as assuming broadcast-only CV trajectories, vehicle homogeneity, and the need for real-world data for broader validation.
Abstract
The prediction of surrounding vehicle trajectories is crucial for collision-free path planning. In this study, we focus on a scenario where a connected and autonomous vehicle (CAV) serves as the central agent, utilizing both sensors and communication technologies to perceive its surrounding traffics consisting of autonomous vehicles (AVs), connected vehicles (CVs), and human-driven vehicles (HDVs). Our trajectory prediction task is aimed at all the detected surrounding vehicles. To effectively integrate the multi-source data from both sensor and communication technologies, we propose a deep learning framework called MSMA utilizing a cross-attention module for multi-source data fusion. Vector map data is utilized to provide contextual information. The trajectory dataset is collected in CARLA simulator with synthesized data errors introduced. Numerical experiments demonstrate that in a mixed traffic flow scenario, the integration of data from different sources enhances our understanding of the environment. This notably improves trajectory prediction accuracy, particularly in situations with a high CV market penetration rate. The code is available at: https://github.com/xichennn/MSMA.
