Deviate or Not: Learning Coalition Structures with Multiple-bit Observations in Games
Yixuan Even Xu, Zhe Feng, Fei Fang
TL;DR
The paper addresses learning hidden coalition structures among $n$ agents by actively designing a sequence of games and observing multi-bit deviation feedback. It introduces a multi-bit observation oracle that yields $n$-bit feedback per round, enabling sublinear-round CSL across normal-form, congestion, graphical, and auction games. It establishes an information-theoretic lower bound of $\log_2 n - O(\log\log n)$ rounds and matches this bound (up to constants) in several settings with tailored algorithms, including $\log_2 n+2$ rounds for normal-form and congestion, and $2n/d+2\log_2 d+1$ for graphical games, as well as $(1+\log_2 n)(1+c)+1$ rounds for auctions. These results significantly reduce the round complexity required to identify coalition structures, enabling practical deployment in real-world multi-agent systems and mechanism design.
Abstract
We consider the Coalition Structure Learning (CSL) problem in multi-agent systems, motivated by the existence of coalitions in many real-world systems, e.g., trading platforms and auction systems. In this problem, there is a hidden coalition structure within a set of $n$ agents, which affects the behavior of the agents in games. Our goal is to actively design a sequence of games for the agents to play, such that observations in these games can be used to learn the hidden coalition structure. In particular, we consider the setting where in each round, we design and present a game together with a strategy profile to the agents, and receive a multiple-bit observation -- for each agent, we observe whether or not they would like to deviate from the specified strategy. We show that we can learn the coalition structure in $O(\log n)$ rounds if we are allowed to design any normal-form game, matching the information-theoretical lower bound. For practicality, we extend the result to settings where we can only choose games of a specific format, and design algorithms to learn the coalition structure in these settings. For most settings, our complexity matches the theoretical lower bound up to a constant factor.
