State-Constrained Zero-Sum Differential Games with One-Sided Information
Mukesh Ghimire, Lei Zhang, Zhe Xu, Yi Ren
TL;DR
This work extends the theory of zero-sum differential games with one-sided information to settings with state constraints, proving the existence of a value and deriving primal and dual subdynamic principles that underpin computational strategy synthesis. The authors develop a backward-induction framework that leverages belief splitting to convexify the value and a dual game for the conjugate value, enabling explicit construction of behavioral strategies for both players. They address numerical challenges—value discontinuities, partial convexity, and convexification error—via physics-informed neural networks, partially convex value networks, and convex hull techniques, and demonstrate these methods on Hexner-like and football-style case studies. The results reveal how information asymmetry and state constraints shape optimal strategies and belief dynamics, with implications for scalable learning in continuous-action, long-horizon differential games.
Abstract
We study zero-sum differential games with state constraints and one-sided information, where the informed player (Player 1) has a categorical payoff type unknown to the uninformed player (Player 2). The goal of Player 1 is to minimize his payoff without violating the constraints, while that of Player 2 is to violate the state constraints if possible, or to maximize the payoff otherwise. One example of the game is a man-to-man matchup in football. Without state constraints, Cardaliaguet (2007) showed that the value of such a game exists and is convex to the common belief of players. Our theoretical contribution is an extension of this result to games with state constraints and the derivation of the primal and dual subdynamic principles necessary for computing behavioral strategies. Different from existing works that are concerned about the scalability of no-regret learning in games with discrete dynamics, our study reveals the underlying structure of strategies for belief manipulation resulting from information asymmetry and state constraints. This structure will be necessary for scalable learning on games with continuous actions and long time windows. We use a simplified football game to demonstrate the utility of this work, where we reveal player positions and belief states in which the attacker should (or should not) play specific random deceptive moves to take advantage of information asymmetry, and compute how the defender should respond.
