Learning in Quantum Common-Interest Games and the Separability Problem
Wayne Lin, Georgios Piliouras, Ryann Sim, Antonios Varvitsiotis
TL;DR
This paper introduces quantum common-interest games (CIGs) where players’ strategies are density matrices and share a common bilinear utility, linking the Nash equilibria to the KKT points of the Best Separable State (BSS) problem. It develops non-commutative analogues of classical learning dynamics—the Linear Quantum Replicator Dynamics (lin-QREP) and Linear Matrix Multiplicative Weights Update (lin-MMWU)—and analyzes their fixed points, Lyapunov structure, and convergence properties via a quantum Shahshahani metric. The authors prove that NE are fixed points for these dynamics, that limit points comprise fixed points (potentially larger than NE), and provide extensive experiments showing lin-QREP converges to NE in many instances while lin-MMWU can converge to non-NE fixed points unless perturbed. They also demonstrate alternating Best Response dynamics converge to NE in two-player QCIGs and present BSS-focused experiments comparing BR with MMWU variants against PPT-SDP ground truths, including large-scale and perturbation-enhanced results. Overall, the work bridges optimization and quantum game theory, offering decentralized learning avenues for BSS and advancing understanding of quantum-cooperative dynamics with practical implications for entanglement-aware optimization and quantum information tasks.
Abstract
Learning in games has emerged as a powerful tool for machine learning with numerous applications. Quantum games model interactions between strategic players who have access to quantum resources, and several recent works have studied {learning in} the competitive regime of quantum zero-sum games. Going beyond this setting, we introduce quantum common-interest games (CIGs) where players have density matrices as strategies and their interests are perfectly aligned. We bridge the gap between optimization and game theory by establishing the equivalence between KKT (first-order stationary) points of an instance of the Best Separable State (BSS) problem and the Nash equilibria of its corresponding quantum CIG. This allows learning dynamics for the quantum CIG to be seen as decentralized algorithms for the BSS problem. Taking the perspective of learning in games, we then introduce non-commutative extensions of the continuous-time replicator dynamics and the discrete-time best response dynamics/linear multiplicative weights update for learning in quantum CIGs. We prove analogues of classical convergence results of the dynamics and explore differences which arise in the quantum setting. Finally, we corroborate our theoretical findings through extensive experiments.
