Identifying Time-varying Costs in Finite-horizon Linear Quadratic Gaussian Games

Kai Ren; Maryam Kamgarpour

Identifying Time-varying Costs in Finite-horizon Linear Quadratic Gaussian Games

Kai Ren, Maryam Kamgarpour

TL;DR

This work tackles identifying time-varying costs in finite-horizon linear-quadratic Gaussian games from observed Nash policies or trajectories. It derives a backward-propagating, null-space characterization that places each $\theta_t^i$ in the null space of $M_t^i$ and presents a constrained least-squares backpropagation algorithm to recover $\{Q_t^i, l_t^i, R_t^i\}$. It further provides finite-sample probabilistic bounds on the error in $\theta_t^i$ when the Nash policy is estimated from demonstrations, accounting for active-set stability. The approach is validated on numerical and driving simulations, demonstrating accurate reconstruction of policies and trajectories given sufficient demonstrations and highlighting data requirements. Overall, the method enables prediction and planning for multi-agent interactions under time-varying objectives in robotics and autonomous systems.

Abstract

We address cost identification in a finite-horizon linear quadratic Gaussian game. We characterize the set of cost parameters that generate a given Nash equilibrium policy. We propose a backpropagation algorithm to identify the time-varying cost parameters. We derive a probabilistic error bound when the cost parameters are identified from finite trajectories. We test our method in numerical and driving simulations. Our algorithm identifies the cost parameters that can reproduce the Nash equilibrium policy and trajectory observations.

Identifying Time-varying Costs in Finite-horizon Linear Quadratic Gaussian Games

TL;DR

in the null space of

and presents a constrained least-squares backpropagation algorithm to recover

. It further provides finite-sample probabilistic bounds on the error in

when the Nash policy is estimated from demonstrations, accounting for active-set stability. The approach is validated on numerical and driving simulations, demonstrating accurate reconstruction of policies and trajectories given sufficient demonstrations and highlighting data requirements. Overall, the method enables prediction and planning for multi-agent interactions under time-varying objectives in robotics and autonomous systems.

Identifying Time-varying Costs in Finite-horizon Linear Quadratic Gaussian Games

TL;DR

Abstract

Identifying Time-varying Costs in Finite-horizon Linear Quadratic Gaussian Games

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (13)