Pre-training on Synthetic Driving Data for Trajectory Prediction
Yiheng Li, Seth Z. Zhao, Chenfeng Xu, Chen Tang, Chenran Li, Mingyu Ding, Masayoshi Tomizuka, Wei Zhan
TL;DR
This work tackles data scarcity in trajectory forecasting for autonomous driving by generating synthetic data through HD-map augmentation and rule-based trajectory synthesis, followed by self-supervised pre-training on the synthetic data. The authors adapt Masked AutoEncoder–style pre-training to learn general scene representations, then fine-tune on real data, achieving significant improvements over baselines in $MR_6$, $minADE_6$, and $minFDE_6$ (e.g., $5.04\%$, $3.84\%$, and $8.30\%$ respectively). A key finding is that pre-training on synthetic data, especially via self-supervised strategies, yields larger gains than directly augmenting real data or supervised pre-training, reducing the demand for real driving data. The approach demonstrates a practical pipeline for data expansion and representation learning in trajectory prediction, with substantial implications for data efficiency and cross-domain generalization in autonomous driving.
Abstract
Accumulating substantial volumes of real-world driving data proves pivotal in the realm of trajectory forecasting for autonomous driving. Given the heavy reliance of current trajectory forecasting models on data-driven methodologies, we aim to tackle the challenge of learning general trajectory forecasting representations under limited data availability. We propose a pipeline-level solution to mitigate the issue of data scarcity in trajectory forecasting. The solution is composed of two parts: firstly, we adopt HD map augmentation and trajectory synthesis for generating driving data, and then we learn representations by pre-training on them. Specifically, we apply vector transformations to reshape the maps, and then employ a rule-based model to generate trajectories on both original and augmented scenes; thus enlarging the driving data without collecting additional real ones. To foster the learning of general representations within this augmented dataset, we comprehensively explore the different pre-training strategies, including extending the concept of a Masked AutoEncoder (MAE) for trajectory forecasting. Without bells and whistles, our proposed pipeline-level solution is general, simple, yet effective: we conduct extensive experiments to demonstrate the effectiveness of our data expansion and pre-training strategies, which outperform the baseline prediction model by large margins, e.g. 5.04%, 3.84% and 8.30% in terms of $MR_6$, $minADE_6$ and $minFDE_6$. The pre-training dataset and the codes for pre-training and fine-tuning are released at https://github.com/yhli123/Pretraining_on_Synthetic_Driving_Data_for_Trajectory_Prediction.
