Upper and Lower Bounds for a Class of Constrained Linear Time-Varying Games
Vincent Liu, Chris Manzie, Peter M. Dower
TL;DR
Addresses bounding the value function $v$ for constrained linear time-varying zero-sum differential games with a convex terminal cost $g$. The authors develop upper and lower bounds by constructing a viscosity supersolution $\bar{v}$ via trimmed-input trajectories and a viscosity subsolution $\underline{v}$ via hyperplanes, proving $\bar{v}\ge v\ge \underline{v}$ and achieving tightness on a characteristic set. An algorithmic procedure combines offline pre-computation of trajectory-costate data with online evaluation through a quadratic program or hyperplane maximization, yielding scalability not tied to state dimension. In reachability, these bounds translate to inner and outer approximations of backwards reachable sets, enabling rigorous safety analysis without grid-based solvers. The approach delivers substantial computational savings and rigorous guarantees for moderately high-dimensional systems, with tunable accuracy by the number of trajectories and hyperplanes.
Abstract
This paper develops an algorithm for upper- and lower-bounding the value function for a class of linear time-varying games subject to convex control sets. In particular, a two-player zero-sum differential game is considered where the respective players aim to minimise and maximise a convex terminal state cost. A collection of solutions of a single-player dynamical system subject to a trimmed control set is used to characterise a viscosity supersolution of a Hamilton-Jacobi (HJ) equation, which in turn yields an upper bound for the value function. Analogously, a collection of hyperplanes is used to characterise a viscosity subsolution of the HJ equation, which yields a lower bound. The computational complexity and memory requirement of the proposed algorithm scales with the number of solutions and hyperplanes that characterise the bounds, which is not explicitly tied to the number of system states. Thus, the algorithm is tractable for systems of moderately high dimension whilst preserving rigorous guarantees for optimal control and differential game applications.
