Reducing Stochastic Games to Semidefinite Program Feasibility
Manuel Bodirsky, Georg Loho, Mateusz Skomra
TL;DR
The paper presents a polynomial-time chain of reductions that converts max-plus-average constraint problems into semidefinite program feasibility, thereby subsuming several key game-theoretic problems (mean payoff games, simple stochastic games, parity games) within SDP feasibility. The core innovations are lifting to non-Archimedean SDPs using Puiseux series and then translating to real SDPs through two steps: substituting a large real parameter and encoding large constants with small LMIs via spectrahedral projections. This approach not only links convex optimisation with extensive game-theoretic problems but also suggests that polynomial-time SDP solvers, if available, would yield polynomial-time solutions for the entire referenced class of games. Moreover, the work highlights potential cross-fertilization between convex algebraic geometry and verification, offering new avenues for algorithmic strategies in SDPs inspired by game-theoretic constructions.
Abstract
We present a polynomial-time reduction from max-plus-average constraints to the feasibility problem for semidefinite programs. This shows that Condon's simple stochastic games, stochastic mean payoff games, and in particular mean payoff games and parity games can all be reduced to semidefinite programming.
