Controllable Generative Trajectory Prediction via Weak Preference Alignment
Yongxi Cao, Julian F. Schumann, Jens Kober, Joni Pajarinen, Arkady Zgonnikov
TL;DR
PrefCVAE introduces a weak preference alignment mechanism to a conditional VAE, enabling controllable trajectory prediction by embedding semantic attributes into the latent space without degrading predictive accuracy. By sampling paired latent values and using a differentiable preference loss tied to a trajectory utility metric, the model learns a monotonic, semantically meaningful latent space that can steer predictions (e.g., average velocity). Evaluations on nuScenes with a Beta-augmented AgentFormer show that latent control is achievable, the encoder better recovers the latent attributes, and the approach remains competitive on standard accuracy metrics. This work offers a cost-effective method to integrate semantic planning cues into generative models for safer, more informed autonomous driving decisions.
Abstract
Deep generative models such as conditional variational autoencoders (CVAEs) have shown great promise for predicting trajectories of surrounding agents in autonomous vehicle planning. State-of-the-art models have achieved remarkable accuracy in such prediction tasks. Besides accuracy, diversity is also crucial for safe planning because human behaviors are inherently uncertain and multimodal. However, existing methods generally lack a scheme to generate controllably diverse trajectories, which is arguably more useful than randomly diversified trajectories, to the end of safe planning. To address this, we propose PrefCVAE, an augmented CVAE framework that uses weakly labeled preference pairs to imbue latent variables with semantic attributes. Using average velocity as an example attribute, we demonstrate that PrefCVAE enables controllable, semantically meaningful predictions without degrading baseline accuracy. Our results show the effectiveness of preference supervision as a cost-effective way to enhance sampling-based generative models.
