Optimal Rates for Feasible Payoff Set Estimation in Games

Annalisa Barbara; Riccardo Poiani; Martino Bernasconi; Andrea Celli

Optimal Rates for Feasible Payoff Set Estimation in Games

Annalisa Barbara, Riccardo Poiani, Martino Bernasconi, Andrea Celli

TL;DR

The paper addresses the problem of learning the full feasible payoff set consistent with observed equilibrium behavior in an unknown bimatrix game. It develops minimax-optimal sample complexity rates for estimating this set under Hausdorff error, covering both general-sum and zero-sum games and both exact ($\alpha=0$) and approximate ($\alpha>0$) equilibria. The main approach uses a combination of set-valued mappings, linear-inequality representations, and concentration arguments, including a fractional-knapsack-inspired analysis for the approximate-equilibrium case. The results provide a principled learning-theoretic foundation for set-valued payoff inference in multi-agent environments and have downstream use in counterfactual analysis, mechanism design, and robustness to payoff ambiguity.

Abstract

We study a setting in which two players play a (possibly approximate) Nash equilibrium of a bimatrix game, while a learner observes only their actions and has no knowledge of the equilibrium or the underlying game. A natural question is whether the learner can rationalize the observed behavior by inferring the players' payoff functions. Rather than producing a single payoff estimate, inverse game theory aims to identify the entire set of payoffs consistent with observed behavior, enabling downstream use in, e.g., counterfactual analysis and mechanism design across applications like auctions, pricing, and security games. We focus on the problem of estimating the set of feasible payoffs with high probability and up to precision $ε$ on the Hausdorff metric. We provide the first minimax-optimal rates for both exact and approximate equilibrium play, in zero-sum as well as general-sum games. Our results provide learning-theoretic foundations for set-valued payoff inference in multi-agent environments.

Optimal Rates for Feasible Payoff Set Estimation in Games

TL;DR

) and approximate (

) equilibria. The main approach uses a combination of set-valued mappings, linear-inequality representations, and concentration arguments, including a fractional-knapsack-inspired analysis for the approximate-equilibrium case. The results provide a principled learning-theoretic foundation for set-valued payoff inference in multi-agent environments and have downstream use in counterfactual analysis, mechanism design, and robustness to payoff ambiguity.

Abstract

on the Hausdorff metric. We provide the first minimax-optimal rates for both exact and approximate equilibrium play, in zero-sum as well as general-sum games. Our results provide learning-theoretic foundations for set-valued payoff inference in multi-agent environments.

Paper Structure (58 sections, 37 theorems, 254 equations, 1 figure, 1 table, 1 algorithm)

This paper contains 58 sections, 37 theorems, 254 equations, 1 figure, 1 table, 1 algorithm.

Introduction
Technical challenges
Related Works
Preliminaries
Mathematical notation
General-Sum Games and Nash-Equilibrium
Zero-Sum Games and Nash-Equilibrium
Learning Framework
Sample Complexity Lower Bound
On Exact Equilibria
On Approximate Equilibria
Algorithm
Representation of $\mathcal{G}_\alpha(\hat{x}, \hat{y})$ and $\mathcal{Z}_\alpha(\hat{x}, \hat{y})$
Optimal Rates for Exact Equilibria
Proof sketch of \ref{['thm:ub-alpha-0']} (General-Sum Games)
...and 43 more sections

Key Result

Theorem 3.1

Let $\alpha = 0$ and $\epsilon < 1$. Then, there exists a problem instance $(x, y) \in \Delta_n^2$ such that, for any $\pi \in (0,1)$ and $(\epsilon, \delta)$-correct algorithm, it holds that: This holds both for General and Zero-Sum Games.

Figures (1)

Figure 1: Visual representations of payoff sets $\mathcal{G}^x_\alpha(\cdot, \cdot)$ for several pairs of instances $(x^0, y^0)$ (striped red region) and $(x^1, y^1)$ (solid green region). In each figure, a point $P^*$ is used to illustrate the Hausdorff distance between the two areas. The $y$ vector is $(1,0)$ in all problems. (Left): we set $\alpha=0$, $x^0=(0,1)$, $x^1=(\pi, 1-\pi)$ for any $\pi \in (0,1)$; (Center): $\alpha = 0.2$, $x^0=(0,1)$, and $x^1=(\pi, 1-\pi)$ for $\pi = 0.05$; (Right): $\alpha=0.1$, $\epsilon=0.1$, $x^0 = (\tfrac{\alpha}{2}, 1-\tfrac{\alpha}{2})$ and $x^1 =(\tfrac{\alpha(1+3\epsilon)}{2}, 1-\tfrac{\alpha(1+3\epsilon)}{2})$.

Theorems & Definitions (69)

Remark 2.1
Theorem 3.1
Theorem 3.2
Theorem 3.3
Theorem 4.1
Theorem 4.2
proof
proof : Proof of \ref{['thm:lb-alpha-0']}
Theorem 1.1
proof
...and 59 more

Optimal Rates for Feasible Payoff Set Estimation in Games

TL;DR

Abstract

Optimal Rates for Feasible Payoff Set Estimation in Games

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (69)