Learning the References of Online Model Predictive Control for Urban Self-Driving
Yubin Wang, Zengqi Peng, Yusen Xie, Yulin Li, Hakim Ghazzai, Jun Ma
TL;DR
The paper tackles safe and efficient urban autonomous driving under dynamic traffic by fusing model-based MPC with a step-based DRL policy that outputs instantaneous references to modulate MPC costs. It introduces a learnable reference state $oldsymbol{x}_{ref}$ and a learnable weight $oldsymbol{Q}_{ref}$, forming a cost term $J_{ref,k}$ that latent-encodes safety, and uses SAC to learn real-time references from partial sensor observations. The approach is evaluated in CARLA with nine traffic participants and demonstrates superior safety, speed, and computational efficiency compared to baselines, plus successful zero-shot sim-to-real transfer and some robustness to noise and vehicle type changes. The work provides open-source code and shows potential for scalable, real-time, safety-aware planning in complex urban environments, with future directions including broader generalization and robustness enhancements.
Abstract
In this work, we propose a novel learning-based model predictive control (MPC) framework for motion planning and control of urban self-driving. In this framework, instantaneous references and cost functions of online MPC are learned from raw sensor data without relying on any oracle or predicted states of traffic. Moreover, driving safety conditions are latently encoded via the introduction of a learnable instantaneous reference vector. In particular, we implement a deep reinforcement learning (DRL) framework for policy search, where practical and lightweight raw observations are processed to reason about the traffic and provide the online MPC with instantaneous references. The proposed approach is validated in a high-fidelity simulator, where our development manifests remarkable adaptiveness to complex and dynamic traffic. Furthermore, sim-to-real deployments are also conducted to evaluate the generalizability of the proposed framework in various real-world applications. Also, we provide the open-source code and video demonstrations at the project website: https://latent-mpc.github.io/.
