Optimal Teaming for Coordination with Bounded Rationality via Convex Optimization
Zhewei Wang, Olugbenga Moses Anubi, Marcos M. Vasconcelos
TL;DR
This paper studies optimal teaming for coordination on a network of $N$ agents with binary actions under bounded rationality modeled by log-linear learning. The designer selects the weight matrix $W$ to maximize the stationary probability of convergence to a Nash equilibrium while penalizing connectivity by $(\rho/2) 1^T W 1$; the resulting optimization is convex. A key finding is that, with no sparsity constraints, the optimal $W$ takes the uniform off-diagonal form $W^* = w^*(11^T - I)$ with $w^* > 0$, and $w^*$ decreases as the rationality parameter $\beta$ increases. The paper extends the analysis to arbitrary sparsity patterns, showing convexity remains but optimal weights can be asymmetric depending on the pattern; numerical examples compare topologies (line, star, hybrid) and illustrate topology-dependent performance. Overall, the work provides a principled method to balance learning probability and communication cost in networks of boundedly rational agents, with implications for distributed task allocation and network design.
Abstract
Teaming is the process of establishing connections among agents within a system to enable collaboration toward achieving a collective goal. This paper examines teaming in the context of a network of agents learning to coordinate with bounded rationality. In our framework, the team structure is represented via a weighted graph, and the agents use log-linear learning. We formulate the design of the graph's weight matrix as a convex optimization problem whose objective is to maximize the probability of learning a Nash equilibrium while minimizing a connectivity cost. Despite its convexity, solving this optimization problem is computationally challenging, as the objective function involves the summation over the action profile space, which grows exponentially with the number of agents. Leveraging the underlying symmetry and convexity properties of the problem, when there are no sparsity constraints, we prove that there exists an optimal solution corresponding to a uniformly weighted graph, simplifying to a one-dimensional convex optimization problem. Additionally, we show that the optimal weight decreases monotonically with the agent's rationality, implying that when the agents become more rational the optimal team requires less connectivity.
