Table of Contents
Fetching ...

ε-Stationary Nash Equilibria in Multi-player Stochastic Graph Games

Ali Asadi, Léonard Brice, Krishnendu Chatterjee, K. S. Thejaswini

TL;DR

This work studies constrained, $ε$-approximate Nash equilibria in multi-player turn-based stochastic graph games with stationary strategies. It presents an $\mathrm{FNP}^{\mathrm{NP}}$ algorithm that, under a promise of an exact constrained NE, computes an $ε$-NE that satisfies the same payoff constraints up to $ε$ by encoding strategies with floating-point numbers and reducing unilateral deviations to MDP computations solvable with an $\mathrm{NP}$ oracle. The paper proves the exact constrained NE problem is $\exists\mathbb{R}$-complete and that the associated decision problem is $\mathrm{NP}$-hard, supported by a concrete five-player construction showing double-exponential probability requirements for exact equilibria. It also discusses gaps between upper and lower bounds and raises open questions about fixed-point representations for $ε$-NEs and their implications for algorithmic tractability in broader game settings.

Abstract

A strategy profile in a multi-player game is a Nash equilibrium if no player can unilaterally deviate to achieve a strictly better payoff. A profile is an $ε$-Nash equilibrium if no player can gain more than $ε$ by unilaterally deviating from their strategy. In this work, we use $ε$-Nash equilibria to approximate the computation of Nash equilibria. Specifically, we focus on turn-based, multiplayer stochastic games played on graphs, where players are restricted to stationary strategies -- strategies that use randomness but not memory. The problem of deciding the constrained existence of stationary Nash equilibria -- where each player's payoff must lie within a given interval -- is known to be $\exists\mathbb{R}$-complete in such a setting (Hansen and Sølvsten, 2020). We extend this line of work to stationary $ε$-Nash equilibria and present an algorithm that solves the following promise problem: given a game with a Nash equilibrium satisfying the constraints, compute an $ε$-Nash equilibrium that $ε$-satisfies those same constraints -- satisfies the constraints up to an $ε$ additive error. Our algorithm runs in FNP^NP time. To achieve this, we first show that if a constrained Nash equilibrium exists, then one exists where the non-zero probabilities are at least an inverse of a double-exponential in the input. We further prove that such a strategy can be encoded using floating-point representations, as in the work of Frederiksen and Miltersen (2013), which finally gives us our FNP^NP algorithm. We further show that the decision version of the promise problem is NP-hard. Finally, we show a partial tightness result by proving a lower bound for such techniques: if a constrained Nash equilibrium exists, then there must be one that where the probabilities in the strategies are double-exponentially small.

ε-Stationary Nash Equilibria in Multi-player Stochastic Graph Games

TL;DR

This work studies constrained, -approximate Nash equilibria in multi-player turn-based stochastic graph games with stationary strategies. It presents an algorithm that, under a promise of an exact constrained NE, computes an -NE that satisfies the same payoff constraints up to by encoding strategies with floating-point numbers and reducing unilateral deviations to MDP computations solvable with an oracle. The paper proves the exact constrained NE problem is -complete and that the associated decision problem is -hard, supported by a concrete five-player construction showing double-exponential probability requirements for exact equilibria. It also discusses gaps between upper and lower bounds and raises open questions about fixed-point representations for -NEs and their implications for algorithmic tractability in broader game settings.

Abstract

A strategy profile in a multi-player game is a Nash equilibrium if no player can unilaterally deviate to achieve a strictly better payoff. A profile is an -Nash equilibrium if no player can gain more than by unilaterally deviating from their strategy. In this work, we use -Nash equilibria to approximate the computation of Nash equilibria. Specifically, we focus on turn-based, multiplayer stochastic games played on graphs, where players are restricted to stationary strategies -- strategies that use randomness but not memory. The problem of deciding the constrained existence of stationary Nash equilibria -- where each player's payoff must lie within a given interval -- is known to be -complete in such a setting (Hansen and Sølvsten, 2020). We extend this line of work to stationary -Nash equilibria and present an algorithm that solves the following promise problem: given a game with a Nash equilibrium satisfying the constraints, compute an -Nash equilibrium that -satisfies those same constraints -- satisfies the constraints up to an additive error. Our algorithm runs in FNP^NP time. To achieve this, we first show that if a constrained Nash equilibrium exists, then one exists where the non-zero probabilities are at least an inverse of a double-exponential in the input. We further prove that such a strategy can be encoded using floating-point representations, as in the work of Frederiksen and Miltersen (2013), which finally gives us our FNP^NP algorithm. We further show that the decision version of the promise problem is NP-hard. Finally, we show a partial tightness result by proving a lower bound for such techniques: if a constrained Nash equilibrium exists, then there must be one that where the probabilities in the strategies are double-exponentially small.

Paper Structure

This paper contains 12 sections, 14 theorems, 14 equations, 2 figures.

Key Result

Theorem 5

There exists an $\FNP^\NP$ procedure to solve the functional approximated constrained problem of NEs.

Figures (2)

  • Figure 1: A game where very small probabilities are necessary. The rewards corresponding to all the players not explicitly mentioned are zero.
  • Figure 2: A reduction from $3\SAT$

Theorems & Definitions (20)

  • Definition 1: Game
  • Definition 2: Markov decision process, Markov chain
  • Definition 3: Nash equilibrium
  • Definition 4: $\varepsilon$-Nash equilibrium
  • Theorem 5
  • Lemma 6
  • Lemma 8
  • Lemma 11: FM13
  • Lemma 12
  • Theorem 13
  • ...and 10 more