Risk Map As Middleware: Towards Interpretable Cooperative End-to-end Autonomous Driving for Risk-Aware Planning
Mingyue Lei, Zewei Zhou, Hongchen Li, Jiaqi Ma, Jia Hu
TL;DR
This work tackles occlusion-driven risk and interpretability in end-to-end autonomous driving by introducing Risk Map as Middleware (RiskMM), a cooperative framework that learns a spatiotemporal risk map from multi-agent representations and uses it to condition a differentiable, learning-based MPC planner with explicit physical constraints. The methodology combines V2X-assisted scenario awareness, occupancy/risk prediction via attention-based fusion, and a quadratic-cost MPC whose weights are derived from the learned risk map, enabling transparent interpretation of planning decisions. Evaluations on the V2XPnP-Seq dataset show that RiskMM improves risk-aware planning, provides interpretable planner behavior through learned MPC weights, and maintains robustness to communication noise; ablations confirm the critical roles of the risk map and modular training. Overall, RiskMM advances interpretable, risk-aware cooperative driving by uniting explicit risk modeling with end-to-end optimization, with potential for broader adoption across vehicle types and scenarios.
Abstract
End-to-end paradigm has emerged as a promising approach to autonomous driving. However, existing single-agent end-to-end pipelines are often constrained by occlusion and limited perception range, resulting in hazardous driving. Furthermore, their black-box nature prevents the interpretability of the driving behavior, leading to an untrustworthiness system. To address these limitations, we introduce Risk Map as Middleware (RiskMM) and propose an interpretable cooperative end-to-end driving framework. The risk map learns directly from the driving data and provides an interpretable spatiotemporal representation of the scenario from the upstream perception and the interactions between the ego vehicle and the surrounding environment for downstream planning. RiskMM first constructs a multi-agent spatiotemporal representation with unified Transformer-based architecture, then derives risk-aware representations by modeling interactions among surrounding environments with attention. These representations are subsequently fed into a learning-based Model Predictive Control (MPC) module. The MPC planner inherently accommodates physical constraints and different vehicle types and can provide interpretation by aligning learned parameters with explicit MPC elements. Evaluations conducted on the real-world V2XPnP-Seq dataset confirm that RiskMM achieves superior and robust performance in risk-aware trajectory planning, significantly enhancing the interpretability of the cooperative end-to-end driving framework. The codebase will be released to facilitate future research in this field.
