Discrete Contrastive Learning for Diffusion Policies in Autonomous Driving
Kalle Kujanpää, Daulet Baimukashev, Farzeen Munir, Shoaib Azam, Tomasz Piotr Kucner, Joni Pajarinen, Ville Kyrki
TL;DR
This work tackles the problem of realistic autonomous-driving simulation by explicitly modeling diverse human driving styles. It introduces Discrete Style Diffusion Policy (DSDP), which first learns discrete driving styles through contrastive learning with the InfoNCE loss and LFQ discretization, then trains a conditional DDPM to generate actions conditioned on both observations and style. Empirical evaluation on NGSIM and Highway-ENV shows that DSDP yields safer and more human-like trajectories than strong baselines, with ablations confirming the importance of contrastive style extraction and discrete style conditioning. The approach improves realism in driving simulations, enhancing the fidelity of AV evaluation and the potential for better sim-to-real transfer.
Abstract
Learning to perform accurate and rich simulations of human driving behaviors from data for autonomous vehicle testing remains challenging due to human driving styles' high diversity and variance. We address this challenge by proposing a novel approach that leverages contrastive learning to extract a dictionary of driving styles from pre-existing human driving data. We discretize these styles with quantization, and the styles are used to learn a conditional diffusion policy for simulating human drivers. Our empirical evaluation confirms that the behaviors generated by our approach are both safer and more human-like than those of the machine-learning-based baseline methods. We believe this has the potential to enable higher realism and more effective techniques for evaluating and improving the performance of autonomous vehicles.
