Strategizing at Speed: A Learned Model Predictive Game for Multi-Agent Drone Racing
Andrei-Carlo Papuc, Lasse Peters, Sihao Sun, Laura Ferranti, Javier Alonso-Mora
TL;DR
This work analyzes the tension between deep, interaction-aware planning and fast, reactive decision-making in multi-agent drone racing. It shows that while Model Predictive Game (MPG) can outperform contouring MPC at moderate speeds, its benefits erode under latency, prompting the development of Learned Model Predictive Game (LMPG) to amortize computation offline. LMPG uses a differentiable trajectory-optimization layer and offline training to predict Nash strategies with low inference latency (about $3.5$ ms), achieving superior performance in both simulation and real-world races compared to MPG and MPC. The approach offers a practical path to real-time, interaction-aware planning for high-speed aerial robotics, with potential extensions to onboard perception and cluttered environments.
Abstract
Autonomous drone racing pushes the boundaries of high-speed motion planning and multi-agent strategic decision-making. Success in this domain requires drones not only to navigate at their limits but also to anticipate and counteract competitors' actions. In this paper, we study a fundamental question that arises in this domain: how deeply should an agent strategize before taking an action? To this end, we compare two planning paradigms: the Model Predictive Game (MPG), which finds interaction-aware strategies at the expense of longer computation times, and contouring Model Predictive Control (MPC), which computes strategies rapidly but does not reason about interactions. We perform extensive experiments to study this trade-off, revealing that MPG outperforms MPC at moderate velocities but loses its advantage at higher speeds due to latency. To address this shortcoming, we propose a Learned Model Predictive Game (LMPG) approach that amortizes model predictive gameplay to reduce latency. In both simulation and hardware experiments, we benchmark our approach against MPG and MPC in head-to-head races, finding that LMPG outperforms both baselines.
