Deviate or Not: Learning Coalition Structures with Multiple-bit Observations in Games

Yixuan Even Xu; Zhe Feng; Fei Fang

Deviate or Not: Learning Coalition Structures with Multiple-bit Observations in Games

Yixuan Even Xu, Zhe Feng, Fei Fang

TL;DR

The paper addresses learning hidden coalition structures among $n$ agents by actively designing a sequence of games and observing multi-bit deviation feedback. It introduces a multi-bit observation oracle that yields $n$-bit feedback per round, enabling sublinear-round CSL across normal-form, congestion, graphical, and auction games. It establishes an information-theoretic lower bound of $\log_2 n - O(\log\log n)$ rounds and matches this bound (up to constants) in several settings with tailored algorithms, including $\log_2 n+2$ rounds for normal-form and congestion, and $2n/d+2\log_2 d+1$ for graphical games, as well as $(1+\log_2 n)(1+c)+1$ rounds for auctions. These results significantly reduce the round complexity required to identify coalition structures, enabling practical deployment in real-world multi-agent systems and mechanism design.

Abstract

We consider the Coalition Structure Learning (CSL) problem in multi-agent systems, motivated by the existence of coalitions in many real-world systems, e.g., trading platforms and auction systems. In this problem, there is a hidden coalition structure within a set of $n$ agents, which affects the behavior of the agents in games. Our goal is to actively design a sequence of games for the agents to play, such that observations in these games can be used to learn the hidden coalition structure. In particular, we consider the setting where in each round, we design and present a game together with a strategy profile to the agents, and receive a multiple-bit observation -- for each agent, we observe whether or not they would like to deviate from the specified strategy. We show that we can learn the coalition structure in $O(\log n)$ rounds if we are allowed to design any normal-form game, matching the information-theoretical lower bound. For practicality, we extend the result to settings where we can only choose games of a specific format, and design algorithms to learn the coalition structure in these settings. For most settings, our complexity matches the theoretical lower bound up to a constant factor.

Deviate or Not: Learning Coalition Structures with Multiple-bit Observations in Games

TL;DR

The paper addresses learning hidden coalition structures among

agents by actively designing a sequence of games and observing multi-bit deviation feedback. It introduces a multi-bit observation oracle that yields

-bit feedback per round, enabling sublinear-round CSL across normal-form, congestion, graphical, and auction games. It establishes an information-theoretic lower bound of

rounds and matches this bound (up to constants) in several settings with tailored algorithms, including

rounds for normal-form and congestion, and

for graphical games, as well as

rounds for auctions. These results significantly reduce the round complexity required to identify coalition structures, enabling practical deployment in real-world multi-agent systems and mechanism design.

Abstract

agents, which affects the behavior of the agents in games. Our goal is to actively design a sequence of games for the agents to play, such that observations in these games can be used to learn the hidden coalition structure. In particular, we consider the setting where in each round, we design and present a game together with a strategy profile to the agents, and receive a multiple-bit observation -- for each agent, we observe whether or not they would like to deviate from the specified strategy. We show that we can learn the coalition structure in

rounds if we are allowed to design any normal-form game, matching the information-theoretical lower bound. For practicality, we extend the result to settings where we can only choose games of a specific format, and design algorithms to learn the coalition structure in these settings. For most settings, our complexity matches the theoretical lower bound up to a constant factor.

Deviate or Not: Learning Coalition Structures with Multiple-bit Observations in Games

TL;DR

Abstract

Deviate or Not: Learning Coalition Structures with Multiple-bit Observations in Games

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (40)