Integrating Decision-Making Into Differentiable Optimization Guided Learning for End-to-End Planning of Autonomous Vehicles

Wenru Liu; Yongkang Song; Chengzhen Meng; Zhiyu Huang; Haochen Liu; Chen Lv; Jun Ma

Integrating Decision-Making Into Differentiable Optimization Guided Learning for End-to-End Planning of Autonomous Vehicles

Wenru Liu, Yongkang Song, Chengzhen Meng, Zhiyu Huang, Haochen Liu, Chen Lv, Jun Ma

TL;DR

The paper tackles end-to-end autonomous-vehicle planning by embedding a decision-making component into a differentiable optimization framework that jointly optimizes lane decisions and ego-vehicle trajectories. It combines a transformer-based motion predictor with a differentiable optimizer and a kinematic bicycle vehicle model to create an end-to-end trainable pipeline trained on the Waymo Open Motion Dataset. The main contributions are (i) a differentiable constrained optimization formulation for lane decisions and planning with learned initialization, (ii) a bilevel training scheme that backpropagates through the optimizer, and (iii) extensive open-loop and closed-loop evaluations plus thorough ablation analysis. The results show improved safety, traveling efficiency, and riding comfort compared to baselines, and demonstrate that optimized decisions can yield superior driving performance even when diverging from expert demonstrations.

Abstract

We address the decision-making capability within an end-to-end planning framework that focuses on motion prediction, decision-making, and trajectory planning. Specifically, we formulate decision-making and trajectory planning as a differentiable nonlinear optimization problem, which ensures compatibility with learning-based modules to establish an end-to-end trainable architecture. This optimization introduces explicit objectives related to safety, traveling efficiency, and riding comfort, guiding the learning process in our proposed pipeline. Intrinsic constraints resulting from the decision-making task are integrated into the optimization formulation and preserved throughout the learning process. By integrating the differentiable optimizer with a neural network predictor, the proposed framework is end-to-end trainable, aligning various driving tasks with ultimate performance goals defined by the optimization objectives. The proposed framework is trained and validated using the Waymo Open Motion dataset. The open-loop testing reveals that while the planning outcomes using our method do not always resemble the expert trajectory, they consistently outperform baseline approaches with improved safety, traveling efficiency, and riding comfort. The closed-loop testing further demonstrates the effectiveness of optimizing decisions and improving driving performance. Ablation studies demonstrate that the initialization provided by the learning-based prediction module is essential for the convergence of the optimizer as well as the overall driving performance.

Integrating Decision-Making Into Differentiable Optimization Guided Learning for End-to-End Planning of Autonomous Vehicles

TL;DR

Abstract

Integrating Decision-Making Into Differentiable Optimization Guided Learning for End-to-End Planning of Autonomous Vehicles

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)

Theorems & Definitions (1)