PACE: A Framework for Learning and Control in Linear Incomplete-Information Differential Games

Seyed Yousef Soltanian; Wenlong Zhang

PACE: A Framework for Learning and Control in Linear Incomplete-Information Differential Games

Seyed Yousef Soltanian, Wenlong Zhang

TL;DR

This work tackles two-player linear-quadratic differential games with incomplete information, where each agent lacks knowledge of the other's cost parameters. It introduces the Peer-Aware Cost Estimation (PACE) framework, which treats the peer as a learning agent and jointly learns the opponent's cost parameters and Riccati solution in real time by modeling the peer's learning dynamics from past state trajectories. The method provides convergence guarantees for parameter estimates and system stability, and demonstrates superior robustness and faster convergence compared with complete-information peer-approximation baselines in numerical experiments. The approach promises practical impact for human-robot interaction and multi-agent control by enabling real-time, data-driven inference of opponents' objectives using only shared state observations.

Abstract

In this paper, we address the problem of a two-player linear quadratic differential game with incomplete information, a scenario commonly encountered in multi-agent control, human-robot interaction (HRI), and approximation methods for solving general-sum differential games. While solutions to such linear differential games are typically obtained through coupled Riccati equations, the complexity increases when agents have incomplete information, particularly when neither is aware of the other's cost function. To tackle this challenge, we propose a model-based Peer-Aware Cost Estimation (PACE) framework for learning the cost parameters of the other agent. In PACE, each agent treats its peer as a learning agent rather than a stationary optimal agent, models their learning dynamics, and leverages this dynamic to infer the cost function parameters of the other agent. This approach enables agents to infer each other's objective function in real time based solely on their previous state observations and dynamically adapt their control policies. Furthermore, we provide a theoretical guarantee for the convergence of parameter estimation and the stability of system states in PACE. Additionally, in our numerical studies, we demonstrate how modeling the learning dynamics of the other agent benefits PACE, compared to approaches that approximate the other agent as having complete information, particularly in terms of stability and convergence speed.

PACE: A Framework for Learning and Control in Linear Incomplete-Information Differential Games

TL;DR

Abstract

PACE: A Framework for Learning and Control in Linear Incomplete-Information Differential Games

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (6)