Adapting to Length Shift: FlexiLength Network for Trajectory Prediction
Yi Xu, Yun Fu
TL;DR
The paper addresses Observation Length Shift in Transformer-based trajectory prediction by introducing the FlexiLength Network (FLN), a general framework that trains on multiple observation lengths and delivers temporal invariant representations. FLN comprises FlexiLength Calibration (FLC), which uses shared encoders and temporal distillation to align predictions across short, medium, and long inputs, and FlexiLength Adaptation (FLA), which employs independent positional encoding and specialized layer normalization to reduce length-induced discrepancies. Empirical results on ETH/UCY, nuScenes, and Argoverse 1 show FLN consistently improves prediction accuracy over Isolated Training and adapts to unseen lengths with a one-time training process, applicable to Transformer-based models such as AgentFormer and HiVT. The work also includes ablations and normalization-shift analyses to justify design choices and demonstrates practical implications for robust trajectory forecasting in real-world, length-variable scenarios.
Abstract
Trajectory prediction plays an important role in various applications, including autonomous driving, robotics, and scene understanding. Existing approaches mainly focus on developing compact neural networks to increase prediction precision on public datasets, typically employing a standardized input duration. However, a notable issue arises when these models are evaluated with varying observation lengths, leading to a significant performance drop, a phenomenon we term the Observation Length Shift. To address this issue, we introduce a general and effective framework, the FlexiLength Network (FLN), to enhance the robustness of existing trajectory prediction techniques against varying observation periods. Specifically, FLN integrates trajectory data with diverse observation lengths, incorporates FlexiLength Calibration (FLC) to acquire temporal invariant representations, and employs FlexiLength Adaptation (FLA) to further refine these representations for more accurate future trajectory predictions. Comprehensive experiments on multiple datasets, ie, ETH/UCY, nuScenes, and Argoverse 1, demonstrate the effectiveness and flexibility of our proposed FLN framework.
