Identifying Time-varying Costs in Finite-horizon Linear Quadratic Gaussian Games
Kai Ren, Maryam Kamgarpour
TL;DR
This work tackles identifying time-varying costs in finite-horizon linear-quadratic Gaussian games from observed Nash policies or trajectories. It derives a backward-propagating, null-space characterization that places each $\theta_t^i$ in the null space of $M_t^i$ and presents a constrained least-squares backpropagation algorithm to recover $\{Q_t^i, l_t^i, R_t^i\}$. It further provides finite-sample probabilistic bounds on the error in $\theta_t^i$ when the Nash policy is estimated from demonstrations, accounting for active-set stability. The approach is validated on numerical and driving simulations, demonstrating accurate reconstruction of policies and trajectories given sufficient demonstrations and highlighting data requirements. Overall, the method enables prediction and planning for multi-agent interactions under time-varying objectives in robotics and autonomous systems.
Abstract
We address cost identification in a finite-horizon linear quadratic Gaussian game. We characterize the set of cost parameters that generate a given Nash equilibrium policy. We propose a backpropagation algorithm to identify the time-varying cost parameters. We derive a probabilistic error bound when the cost parameters are identified from finite trajectories. We test our method in numerical and driving simulations. Our algorithm identifies the cost parameters that can reproduce the Nash equilibrium policy and trajectory observations.
