α-RACER: Real-Time Algorithm for Game-Theoretic Motion Planning and Control in Autonomous Racing using Near-Potential Function
Dvij Kalaria, Chinmay Maheshwari, Shankar Sastry
TL;DR
This work tackles real-time, opponent-aware motion planning for multi-car autonomous racing by formulating the problem as an infinite-horizon dynamic game and introducing α-RACER, a two-phase method that learns a dynamic $\alpha$-potential function offline and uses it online to compute approximate Nash equilibria. The policy parameterization combines MPC-based controllers with a five-parameter reference trajectory that encodes competitive maneuvers such as overtaking and blocking, while incorporating nonlinear tire dynamics via a dynamic bicycle model. The key contributions are (i) a novel policy parametrization and per-step utility that capture competitive behaviors, (ii) a dynamic $\alpha$-potential framework that enables tractable NE approximation, and (iii) a two-phase offline/online procedure that yields real-time decisions validated in 3-car racing against multiple baselines, including IBR and self-play RL, with favorable approximation gaps and Nash-regret. The approach offers a principled, interpretable path to long-horizon, competitive planning in multi-agent racing with potential applicability to other domains requiring real-time game-theoretic control. Overall, α-RACER demonstrates superior performance and provides a scalable, near-equilibrium planning tool for high-speed autonomous racing.
Abstract
Autonomous racing extends beyond the challenge of controlling a racecar at its physical limits. Professional racers employ strategic maneuvers to outwit other competing opponents to secure victory. While modern control algorithms can achieve human-level performance by computing offline racing lines for single-car scenarios, research on real-time algorithms for multi-car autonomous racing is limited. To bridge this gap, we develop game-theoretic modeling framework that incorporates the competitive aspect of autonomous racing like overtaking and blocking through a novel policy parametrization, while operating the car at its limit. Furthermore, we propose an algorithmic approach to compute the (approximate) Nash equilibrium strategy, which represents the optimal approach in the presence of competing agents. Specifically, we introduce an algorithm inspired by recently introduced framework of dynamic near-potential function, enabling real-time computation of the Nash equilibrium. Our approach comprises two phases: offline and online. During the offline phase, we use simulated racing data to learn a near-potential function that approximates utility changes for agents. This function facilitates the online computation of approximate Nash equilibria by maximizing its value. We evaluate our method in a head-to-head 3-car racing scenario, demonstrating superior performance compared to several existing baselines.
