Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation
Detian Chu, Linyuan Bai, Jianuo Huang, Zhenlong Fang, Peng Zhang, Wei Kang, Haifeng Lin
TL;DR
The paper addresses safety in end-to-end autonomous driving by formulating navigation as a safety-constrained CMDP and introducing ESAD-LEND, which combines latent diffusion-based state representation, a worst-case exploration strategy, and a CVaR-enhanced Soft Actor-Critic within an augmented Lagrangian framework. It enables latent imagination of future trajectories and safety-guaranteed planning, backed by distributional RL to handle risk via CVaR. Evaluations in CARLA and real-world-like settings show ESAD-LEND achieving superior safety, efficiency, and generalization compared with several baselines. This work advances practical safe exploration in high-dimensional autonomous driving and offers a principled, model-based approach for end-to-end navigation with rigorous safety guarantees.
Abstract
With the advancement of autonomous driving, ensuring safety during motion planning and navigation is becoming more and more important. However, most end-to-end planning methods suffer from a lack of safety. This research addresses the safety issue in the control optimization problem of autonomous driving, formulated as Constrained Markov Decision Processes (CMDPs). We propose a novel, model-based approach for policy optimization, utilizing a conditional Value-at-Risk based Soft Actor Critic to manage constraints in complex, high-dimensional state spaces effectively. Our method introduces a worst-case actor to guide safe exploration, ensuring rigorous adherence to safety requirements even in unpredictable scenarios. The policy optimization employs the Augmented Lagrangian method and leverages latent diffusion models to predict and simulate future trajectories. This dual approach not only aids in navigating environments safely but also refines the policy's performance by integrating distribution modeling to account for environmental uncertainties. Empirical evaluations conducted in both simulated and real environment demonstrate that our approach outperforms existing methods in terms of safety, efficiency, and decision-making capabilities.
