Double Oracle Algorithm for Game-Theoretic Robot Allocation on Graphs

Zijian An; Lifeng Zhou

Double Oracle Algorithm for Game-Theoretic Robot Allocation on Graphs

Zijian An, Lifeng Zhou

TL;DR

This work models two players allocating multi-type robots on a directed graph to contest multiple sites, extending the Colonel Blotto framework to capture graph constraints and cyclic dominance among robot types. It develops a two-pronged approach: (i) a DOA-based method to compute Nash equilibria by iteratively solving a sequence of subgames with best-response MILPs, and (ii) a novel CDH-specific construction consisting of an elimination-based transformation, an outcome interface $\\pi_{oi}$, and a piecewise-linear utility $u_{CDH}$ that preserves a meaningful win/lose/draw demarcation. The authors prove the correctness of the CDH utility construction, linearize it for MILP-based best responses, and demonstrate convergence to equilibrium across homogeneous, linear heterogeneous, and CDH robot allocations on graphs. The findings enable tractable equilibrium computation in realistic, constrained, on-graph Blotto settings and offer insights into how graph structure and robot-type interactions influence optimal strategies. This has practical implications for coordinating competitive multi-robot deployments in environments with connectivity and heterogeneity constraints.

Abstract

We study the problem of game-theoretic robot allocation where two players strategically allocate robots to compete for multiple sites of interest. Robots possess offensive or defensive capabilities to interfere and weaken their opponents to take over a competing site. This problem belongs to the conventional Colonel Blotto Game. Considering the robots' heterogeneous capabilities and environmental factors, we generalize the conventional Blotto game by incorporating heterogeneous robot types and graph constraints that capture the robot transitions between sites. Then we employ the Double Oracle Algorithm (DOA) to solve for the Nash equilibrium of the generalized Blotto game. Particularly, for cyclic-dominance-heterogeneous (CDH) robots that inhibit each other, we define a new transformation rule between any two robot types. Building on the transformation, we design a novel utility function to measure the game's outcome quantitatively. Moreover, we rigorously prove the correctness of the designed utility function. Finally, we conduct extensive simulations to demonstrate the effectiveness of DOA on computing Nash equilibrium for homogeneous, linear heterogeneous, and CDH robot allocation on graphs.

Double Oracle Algorithm for Game-Theoretic Robot Allocation on Graphs

TL;DR

, and a piecewise-linear utility

that preserves a meaningful win/lose/draw demarcation. The authors prove the correctness of the CDH utility construction, linearize it for MILP-based best responses, and demonstrate convergence to equilibrium across homogeneous, linear heterogeneous, and CDH robot allocations on graphs. The findings enable tractable equilibrium computation in realistic, constrained, on-graph Blotto settings and offer insights into how graph structure and robot-type interactions influence optimal strategies. This has practical implications for coordinating competitive multi-robot deployments in environments with connectivity and heterogeneity constraints.

Abstract

Paper Structure (18 sections, 7 theorems, 50 equations, 12 figures, 1 algorithm)

This paper contains 18 sections, 7 theorems, 50 equations, 12 figures, 1 algorithm.

Introduction
Conventions and Problem Formulation
Robot Allocation between Two Players on Graphs
Colonel Blotto Game
Problem Formulation
Preliminaries
Double Oracle Algorithm
Reachable Strategy Set
Approach
Approach for Problem \ref{['prob:1']}
Approach for Problem \ref{['prob:3']}
Construction of outcome interface
Construction of Utility Function
DOA with CDH robots
Numerical Evaluation
...and 3 more sections

Key Result

Lemma 1

A mixed strategy group $(\mathbf{\Delta}_X^*, \mathbf{\Delta}_Y^*)$ is equilibrium of a continuous game $\mathbb{C}$ if, for all $(\mathbf{\Delta}_X, \mathbf{\Delta}_Y)$,

Figures (12)

Figure 1: Game-theoretic robot allocation on a graph. Two players ("red" and "blue") allocate three types of robots to compete for five sites in an area. The area is abstracted into a graph with five sites as the nodes and their connections as edges.
Figure 2: Examples of allocation games with (a) homogeneous and (b) heterogeneous robots. The number of robots allocated on each node is demonstrated as the length of the color bar, with red representing Player 1, and blue representing Player 2. In (a), at time $t$ (left), Player 1 wins node 3 and Player 2 wins node 5. At time $t+1$ (right). Player 1 moves one robot from node 4 into node 1, and Player 2 moves one robot from node 3 to node 2. Thus Player 1 wins nodes 1 and 3 while Player 2 wins the remaining nodes. In (b), Player 1 adopts mixed strategy $\mathbf{\Delta}_X \sim \text{Binomial}(0.4, 0.6)$ and Player 2 adopts pure strategy $\mathbf{\Delta}_Y=\mathbf{S}_y^1$. The win or loss cannot be claimed solely based on the number of robots because of the CDH robots.
Figure 3: The top panel (a) illustrates the composition of the function's definition space $\Gamma$, which is partitioned into three subspaces $Q_1$, $Q_2$, $Q_3$. In each subspace, the function $\pi_{\texttt{oi}}(\mathbf{x})$ is linear, that is, $\pi_{\texttt{oi}}(\mathbf{x}) = g_i(\mathbf{x})$ for $\mathbf{x} \in Q_i$ ($Q_i$ extending indefinitely outward into space). The plot on the right of (a) shows the positions where $g_i(\mathbf{x}) = 0$. (b) displays the relative positioning of the surface $\pi_{\texttt{oi}}(\mathbf{x}) = 0$ in the Cartesian coordinate system. The red half-plane is $g_1$ in \ref{['g_1']}, the green half-plane is $g_2$ in \ref{['g_2']}, and the yellow half-plane is $g_3$ in \ref{['g_3']}. These three half-planes together form the outcome interface, i.e., $\pi_{\texttt{oi}}(\mathbf{x}) = 0$.
Figure 4: Illustration of utility function $u_{\texttt{CDH}}$. The surface $\pi_{\texttt{oi}}(\mathbf{S}_x-\mathbf{S}_y)$ in the middle is the outcome interface in Fig. \ref{['fig: PI']}.
Figure 5: Three different graphs: $G_1$ (left), $G_2$ (middle), $G_3$ (right).
...and 7 more figures

Theorems & Definitions (8)

Lemma 1
Lemma 2
Definition 1: ovchinnikov2000max
Theorem 1: ovchinnikov2000max
Theorem 2
Theorem 3
Lemma 3: adam2021double
Lemma 4

Double Oracle Algorithm for Game-Theoretic Robot Allocation on Graphs

TL;DR

Abstract

Double Oracle Algorithm for Game-Theoretic Robot Allocation on Graphs

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (8)