Improving Out-of-Distribution Generalization of Trajectory Prediction for Autonomous Driving via Polynomial Representations
Yue Yao, Shengchao Yan, Daniel Goehring, Wolfram Burgard, Joerg Reichardt
TL;DR
This work tackles the problem that trajectory prediction models are predominantly evaluated on In-Distribution data and often fail to generalize across datasets. It introduces a dataset homogenization protocol to enable Out-of-Distribution testing between Argoverse 2 and Waymo Open, and proposes Everything Polynomial (EP), a compact predictor that represents inputs with Bernstein polynomials of degree $5$ for histories and maps with degree $3$, while predicting future trajectories as $6$-th degree polynomials using a linear reconstruction with a fixed matrix $H$. EP achieves near-SotA In-Distribution performance with a fraction of the parameters and inference time, and demonstrates significantly improved OoD robustness compared to sequence-based baselines, particularly under homogeneous augmentation and polynomial representations. The results advocate adding OoD testing to standard evaluation and show that careful data homogenization and parametric trajectory representations can markedly enhance cross-dataset generalization for autonomous driving trajectory prediction.
Abstract
Robustness against Out-of-Distribution (OoD) samples is a key performance indicator of a trajectory prediction model. However, the development and ranking of state-of-the-art (SotA) models are driven by their In-Distribution (ID) performance on individual competition datasets. We present an OoD testing protocol that homogenizes datasets and prediction tasks across two large-scale motion datasets. We introduce a novel prediction algorithm based on polynomial representations for agent trajectory and road geometry on both the input and output sides of the model. With a much smaller model size, training effort, and inference time, we reach near SotA performance for ID testing and significantly improve robustness in OoD testing. Within our OoD testing protocol, we further study two augmentation strategies of SotA models and their effects on model generalization. Highlighting the contrast between ID and OoD performance, we suggest adding OoD testing to the evaluation criteria of trajectory prediction models.
