KiGRAS: Kinematic-Driven Generative Model for Realistic Agent Simulation
Jianbo Zhao, Jiaheng Zhuang, Qibin Zhou, Taiyu Ban, Ziyao Xu, Hangning Zhou, Junhe Wang, Guoan Wang, Zhiheng Li, Bin Li
TL;DR
KiGRAS redefines autonomous-driving trajectory generation by learning in the action space and using a kinematic model to map actions to physically feasible trajectories, thereby removing large portions of redundant state-space representations. A discrete action space (63×63) pairs with a kinematic bicycle model and a unified scene encoder to support a transformer-based autoregressor with causal attention, enabling compact, realistic multi-agent simulations. The approach includes an inverse kinematic transformation via MPC for training labels, plus a Discriminative Policy Optimization fine-tuning pathway to customize driving habits such as safety, speed, and smoothness. Empirical results on Waymo SimAgents show state-of-the-art realism with a small parameter footprint, supported by qualitative scenarios and ablation analyses that validate the design choices.
Abstract
Trajectory generation is a pivotal task in autonomous driving. Recent studies have introduced the autoregressive paradigm, leveraging the state transition model to approximate future trajectory distributions. This paradigm closely mirrors the real-world trajectory generation process and has achieved notable success. However, its potential is limited by the ineffective representation of realistic trajectories within the redundant state space. To address this limitation, we propose the Kinematic-Driven Generative Model for Realistic Agent Simulation (KiGRAS). Instead of modeling in the state space, KiGRAS factorizes the driving scene into action probability distributions at each time step, providing a compact space to represent realistic driving patterns. By establishing physical causality from actions (cause) to trajectories (effect) through the kinematic model, KiGRAS eliminates massive redundant trajectories. All states derived from actions in the cause space are constrained to be physically feasible. Furthermore, redundant trajectories representing identical action sequences are mapped to the same representation, reflecting their underlying actions. This approach significantly reduces task complexity and ensures physical feasibility. KiGRAS achieves state-of-the-art performance in Waymo's SimAgents Challenge, ranking first on the WOMD leaderboard with significantly fewer parameters than other models. The video documentation is available at \url{https://kigras-mach.github.io/KiGRAS/}.
