Double Oracle Algorithm for Game-Theoretic Robot Allocation on Graphs
Zijian An, Lifeng Zhou
TL;DR
This work models two players allocating multi-type robots on a directed graph to contest multiple sites, extending the Colonel Blotto framework to capture graph constraints and cyclic dominance among robot types. It develops a two-pronged approach: (i) a DOA-based method to compute Nash equilibria by iteratively solving a sequence of subgames with best-response MILPs, and (ii) a novel CDH-specific construction consisting of an elimination-based transformation, an outcome interface $\\pi_{oi}$, and a piecewise-linear utility $u_{CDH}$ that preserves a meaningful win/lose/draw demarcation. The authors prove the correctness of the CDH utility construction, linearize it for MILP-based best responses, and demonstrate convergence to equilibrium across homogeneous, linear heterogeneous, and CDH robot allocations on graphs. The findings enable tractable equilibrium computation in realistic, constrained, on-graph Blotto settings and offer insights into how graph structure and robot-type interactions influence optimal strategies. This has practical implications for coordinating competitive multi-robot deployments in environments with connectivity and heterogeneity constraints.
Abstract
We study the problem of game-theoretic robot allocation where two players strategically allocate robots to compete for multiple sites of interest. Robots possess offensive or defensive capabilities to interfere and weaken their opponents to take over a competing site. This problem belongs to the conventional Colonel Blotto Game. Considering the robots' heterogeneous capabilities and environmental factors, we generalize the conventional Blotto game by incorporating heterogeneous robot types and graph constraints that capture the robot transitions between sites. Then we employ the Double Oracle Algorithm (DOA) to solve for the Nash equilibrium of the generalized Blotto game. Particularly, for cyclic-dominance-heterogeneous (CDH) robots that inhibit each other, we define a new transformation rule between any two robot types. Building on the transformation, we design a novel utility function to measure the game's outcome quantitatively. Moreover, we rigorously prove the correctness of the designed utility function. Finally, we conduct extensive simulations to demonstrate the effectiveness of DOA on computing Nash equilibrium for homogeneous, linear heterogeneous, and CDH robot allocation on graphs.
