Blind Inverse Game Theory: Jointly Decoding Rewards and Rationality in Entropy-Regularized Competitive Games
Hamza Virk, Sandro Amaglobeli, Zuhayr Syed
TL;DR
The paper tackles the identifiability problem in inverse game theory when the entropy-regularized rationality parameter $\tau$ is unknown, causing a scale ambiguity with rewards $\theta$. It introduces Blind-IGT, which uses a normalization constraint $\|\theta^*\|_2=1$ to disentangle scale and recovers both $\theta$ and $\tau$ via a Normalized Least Squares estimator, achieving the optimal $\mathcal{O}(N^{-1/2})$ convergence rate. The authors establish necessary and sufficient identifiability conditions (rank and non-uniformity) and provide partial identification via confidence sets when those conditions fail, with extensions to Markov games where transition dynamics may be unknown. Empirical results show convergence rates matching theory, demonstrate the necessity of the blind approach, and reveal robustness to misspecified normalization and unknown dynamics. Overall, Blind-IGT offers a rigorous statistical foundation for recovering objectives and bounded rationality in entropy-regularized competitive environments, enabling reliable reward decoding and interpretability in multi-agent systems.
Abstract
Inverse Game Theory (IGT) methods based on the entropy-regularized Quantal Response Equilibrium (QRE) offer a tractable approach for competitive settings, but critically assume the agents' rationality parameter (temperature $τ$) is known a priori. When $τ$ is unknown, a fundamental scale ambiguity emerges that couples $τ$ with the reward parameters ($θ$), making them statistically unidentifiable. We introduce Blind-IGT, the first statistical framework to jointly recover both $θ$ and $τ$ from observed behavior. We analyze this bilinear inverse problem and establish necessary and sufficient conditions for unique identification by introducing a normalization constraint that resolves the scale ambiguity. We propose an efficient Normalized Least Squares (NLS) estimator and prove it achieves the optimal $\mathcal{O}(N^{-1/2})$ convergence rate for joint parameter recovery. When strong identifiability conditions fail, we provide partial identification guarantees through confidence set construction. We extend our framework to Markov games and demonstrate optimal convergence rates with strong empirical performance even when transition dynamics are unknown.
