A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing
Chengkai Xu, Jiaqi Liu, Yicheng Guo, Peng Hang, Jian Sun
TL;DR
End-to-end autonomous driving faces challenges of generalization, multi-modality, and long-horizon planning. The paper presents a knowledge-driven diffusion policy (KDP) that integrates diffusion-based generative modeling with a sparse mixture-of-experts router to produce temporally coherent action sequences and compose reusable driving knowledge. Key contributions include modeling driving knowledge as abstract experts, a Top-K routing mechanism with load balancing and mutual information regularization, and a Transformer-backed diffusion backbone that achieves superior performance in ramp, intersection, and roundabout scenarios. Results show higher success rates, lower collision risk, and smoother control compared to baselines, with ablations validating the necessity of sparse routing and temporal modeling for cross-scenario generalization and knowledge reuse.
Abstract
End-to-end autonomous driving remains constrained by the difficulty of producing adaptive, robust, and interpretable decision-making across diverse scenarios. Existing methods often collapse diverse driving behaviors, lack long-horizon consistency, or require task-specific engineering that limits generalization. This paper presents KDP, a knowledge-driven diffusion policy that integrates generative diffusion modeling with a sparse mixture-of-experts routing mechanism. The diffusion component generates temporally coherent action sequences, while the expert routing mechanism activates specialized and reusable experts according to context, enabling modular knowledge composition. Extensive experiments across representative driving scenarios demonstrate that KDP achieves consistently higher success rates, reduced collision risk, and smoother control compared to prevailing paradigms. Ablation studies highlight the effectiveness of sparse expert activation and the Transformer backbone, and activation analyses reveal structured specialization and cross-scenario reuse of experts. These results establish diffusion with expert routing as a scalable and interpretable paradigm for knowledge-driven end-to-end autonomous driving.
