Intent Demonstration in General-Sum Dynamic Games via Iterative Linear-Quadratic Approximations
Jingqi Li, Anand Siththaranjan, Somayeh Sojoudi, Claire Tomlin, Andrea Bajcsy
TL;DR
This work addresses coordinating $N$ agents in general-sum dynamic games under incomplete information by enabling a certain agent to strategically demonstrate its intent to uncertain opponents. The authors develop an algorithm based on iterative linear-quadratic approximations that alternates between solving complete-information LQ games for a set of candidate intents and optimizing the certain agent’s joint physical-estimate trajectory, with convergence guarantees on belief alignment and potential improvements in task performance. They extend the framework to nonlinear dynamics via iLQG/iLQR and nonlinear belief updates, including Bayesian updates, and discuss potential integrations with deep reinforcement learning. Empirical validation across four multi-agent tasks demonstrates faster belief learning, reduced regret for the certain agent, and robust task performance when intent demonstration is strategically employed, highlighting practical benefits for autonomous driving, multi-robot collaboration, and shared-control systems.
Abstract
Autonomous agents should coordinate effectively without prior knowledge of others' intents. While prior work has focused on intent inference, we address the inverse problem: how agents can strategically demonstrate their intents within general-sum dynamic games. We model this problem and propose an algorithm that balances intent demonstration with task performance. To handle nonlinear dynamic games with continuous state-action spaces, our method leverages iterative linear-quadratic game approximations and provides efficient intent-teaching guarantees: the uncertain agent's belief can be driven rapidly to the ground truth, while the demonstrating agent avoids expending effort on unnecessary belief alignment when it does not improve task performance. Theoretical analysis and hardware experiments confirm that our approach enables the demonstrating agent to reconcile task execution with belief alignment and strategically manage the information asymmetry among agents, even as its intent evolves during deployment.
