Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions
Hansung Kim, Siddharth H. Nair, Francesco Borrelli
TL;DR
This paper tackles real-time motion planning in complex, multi-modal traffic by integrating a duality-based interaction predictor with a reduced stochastic MPC. The key idea is to screen which AV–TV interactions are relevant along the horizon using RAID-Net, an attention-based recurrent network that leverages Lagrangian duality to identify active constraints, and then solve a reduced MPC that enforces only the necessary collision-avoidance constraints. Empirically, the approach yields up to a 12× speed-up over full MPC on a traffic-intersection scenario, while maintaining high safety with a low false-negative rate and a manageable number of active constraints. The work demonstrates practical impact for scalable autonomous driving in interactive environments and outlines avenues for improved guarantees, learning-from-demonstrations refinements, and generalization to other topologies.
Abstract
We propose a hierarchical architecture designed for scalable real-time Model Predictive Control (MPC) in complex, multi-modal traffic scenarios. This architecture comprises two key components: 1) RAID-Net, a novel attention-based Recurrent Neural Network that predicts relevant interactions along the MPC prediction horizon between the autonomous vehicle and the surrounding vehicles using Lagrangian duality, and 2) a reduced Stochastic MPC problem that eliminates irrelevant collision avoidance constraints, enhancing computational efficiency. Our approach is demonstrated in a simulated traffic intersection with interactive surrounding vehicles, showcasing a 12x speed-up in solving the motion planning problem. A video demonstrating the proposed architecture in multiple complex traffic scenarios can be found here: https://youtu.be/-pRiOnPb9_c. GitHub: https://github.com/MPC-Berkeley/hmpc_raidnet
