RedMotion: Motion Prediction via Redundancy Reduction
Royden Wagner, Omer Sahin Tas, Marvin Klemp, Carlos Fernandez, Christoph Stiller
TL;DR
RedMotion tackles self-driving motion prediction by learning augmentation-invariant road-environment representations through two redundancy-reduction mechanisms: (i) an internal transformer-based RED token decoder that compresses a variable local token set into a fixed global embedding, and (ii) Road Barlow Twins self-supervision applied to embeddings from augmented views. The architecture combines a trajectory encoder with a road-environment encoder that produces local and global context, fusing them via efficient cross-attention to generate multiple trajectory proposals. Empirical results on Waymo Open Motion and Argoverse 2 show improvements over contrastive, self-distillation, and masked-autoencoding baselines, with competitive performance against HPTR and MTR++ in Waymo. The work provides a universal approach to convert variable-length local road context into stable global representations, enabling more data-efficient pre-training for motion prediction and offering open-source code for adoption across multi-modal inputs in autonomous driving scenarios.
Abstract
We introduce RedMotion, a transformer model for motion prediction in self-driving vehicles that learns environment representations via redundancy reduction. Our first type of redundancy reduction is induced by an internal transformer decoder and reduces a variable-sized set of local road environment tokens, representing road graphs and agent data, to a fixed-sized global embedding. The second type of redundancy reduction is obtained by self-supervised learning and applies the redundancy reduction principle to embeddings generated from augmented views of road environments. Our experiments reveal that our representation learning approach outperforms PreTraM, Traj-MAE, and GraphDINO in a semi-supervised setting. Moreover, RedMotion achieves competitive results compared to HPTR or MTR++ in the Waymo Motion Prediction Challenge. Our open-source implementation is available at: https://github.com/kit-mrt/future-motion
