Table of Contents
Fetching ...

What Do Agents Think One Another Want? Level-2 Inverse Games for Inferring Agents' Estimates of Others' Objectives

Hamzah I. Khan, Jingqi Li, David Fridovich-Keil

TL;DR

The paper tackles the challenge of interpreting multi-agent strategic interactions when agents hold heterogeneous beliefs about others' objectives, arguing that standard level-1 inverse game methods are insufficient. It develops a level-2 inverse game framework where each agent optimizes its own objective while forming estimates of others' objectives, and a third party infers both the objectives and the beliefs by solving a series of coupled Nash problems. The authors prove non-convexity in level-2 inference for linear-quadratic games and derive loss bounds illustrating level-1 inadequacy under belief misalignment, then propose a differentiable MCP-based gradient solver that scales to offline and online settings. Empirical results on a lane-change driving scenario show level-2 inference uncovers critical misalignments and deadlock conditions that level-1 cannot explain, suggesting practical benefits for safer autonomous planning and regulator-informed traffic optimization. The work lays groundwork for extending level-2 reasoning to more complex nonlinear, stochastic environments and for formal analysis of observability of level-2 parameters.

Abstract

Effectively interpreting strategic interactions among multiple agents requires us to infer each agent's objective from limited information. Existing inverse game-theoretic approaches frame this challenge in terms of a "level-1" inference problem, in which we take the perspective of a third-party observer and assume that individual agents share complete knowledge of one another's objectives. However, this assumption breaks down in decentralized, real-world scenarios like urban driving and bargaining, in which agents may act based on conflicting views of one another's objectives. We demonstrate the necessity of inferring agents' different estimates of each other's objectives through empirical examples, and by theoretically characterizing the prediction error of level-1 inference on fictitious gameplay data from linear-quadratic games. To address this fundamental issue, we propose a framework for level-2 inference to address the question: "What does each agent believe about other agents' objectives?" We prove that the level-2 inference problem is non-convex even in benign settings like linear-quadratic games, and we develop an efficient gradient-based approach for identifying local solutions. Experiments on a synthetic urban driving example show that our approach uncovers nuanced misalignments that level-1 methods miss.

What Do Agents Think One Another Want? Level-2 Inverse Games for Inferring Agents' Estimates of Others' Objectives

TL;DR

The paper tackles the challenge of interpreting multi-agent strategic interactions when agents hold heterogeneous beliefs about others' objectives, arguing that standard level-1 inverse game methods are insufficient. It develops a level-2 inverse game framework where each agent optimizes its own objective while forming estimates of others' objectives, and a third party infers both the objectives and the beliefs by solving a series of coupled Nash problems. The authors prove non-convexity in level-2 inference for linear-quadratic games and derive loss bounds illustrating level-1 inadequacy under belief misalignment, then propose a differentiable MCP-based gradient solver that scales to offline and online settings. Empirical results on a lane-change driving scenario show level-2 inference uncovers critical misalignments and deadlock conditions that level-1 cannot explain, suggesting practical benefits for safer autonomous planning and regulator-informed traffic optimization. The work lays groundwork for extending level-2 reasoning to more complex nonlinear, stochastic environments and for formal analysis of observability of level-2 parameters.

Abstract

Effectively interpreting strategic interactions among multiple agents requires us to infer each agent's objective from limited information. Existing inverse game-theoretic approaches frame this challenge in terms of a "level-1" inference problem, in which we take the perspective of a third-party observer and assume that individual agents share complete knowledge of one another's objectives. However, this assumption breaks down in decentralized, real-world scenarios like urban driving and bargaining, in which agents may act based on conflicting views of one another's objectives. We demonstrate the necessity of inferring agents' different estimates of each other's objectives through empirical examples, and by theoretically characterizing the prediction error of level-1 inference on fictitious gameplay data from linear-quadratic games. To address this fundamental issue, we propose a framework for level-2 inference to address the question: "What does each agent believe about other agents' objectives?" We prove that the level-2 inference problem is non-convex even in benign settings like linear-quadratic games, and we develop an efficient gradient-based approach for identifying local solutions. Experiments on a synthetic urban driving example show that our approach uncovers nuanced misalignments that level-1 methods miss.

Paper Structure

This paper contains 41 sections, 2 theorems, 60 equations, 5 figures, 8 tables, 1 algorithm.

Key Result

Proposition 1

Let $\{\Theta^{i*}\}_{i=1}^N$ be the ground truth parameters of a level-2 inverse game. Suppose that, in each agent $i$'s hypothesized game $\Gamma(\Theta^{i*})$, all agents' cost functions are strongly convex in $(\mathbf{x}^{}, \mathbf{u}^{})$. Define the observed data $\mathbf{y}^{}$ as the union

Figures (5)

  • Figure 1: Schematic of the proposed approach for level-2 game-theoretic model inference. Given observations of a multi-agent interaction, our method infers each agent's objective and its estimate of the others' objectives, thus accounting for potential mismatch in each agent's understanding of the interaction. By contrast, existing "level-1" methods assume that each agent knows the others' objectives, and lack the flexibility to explain nuanced behavior that results from that mismatch.
  • Figure 2: Level-2 inference achieves lower inverse game loss values than level-1 inference when observed agents' estimates of others' objectives differ significantly. We use \ref{['alg:level-2 inference']} to solve the level-2 inverse game, with further details provided in Supplement C.
  • Figure 3: Fictitious play between two agents frequently, but not always, leads to unsuccessful interaction. Both agents wish to track the center of the top lane (1m). When these agents incorrectly estimate each other's objectives, it can lead to a variety of both successful (\ref{['fig:forward-lc-mismatched-safe']}) and unsuccessful (\ref{['fig:forward-lc-mismatched-deadlock']}) interactions. We simulate 64 instances of fictitious play in which agents' estimates of the each other's desired lane offset vary uniformly from 0.5m to 4m. \ref{['fig:lane-change-times']} shows the time it takes for agent 2 to successfully lane change in each instance. In many of these instances, agent 2 successfully changes its lane early in the simulation (\ref{['fig:forward-lc-mismatched-safe']}); this can occur safely, but it can also occur abruptly or aggressively due to mismatch (not shown). In other instances, agent 2's lane change is delayed or fails due to deadlock (\ref{['fig:forward-lc-mismatched-deadlock']}).
  • Figure 4: Level-1 inference on the lane change in \ref{['fig:front-figure']}.
  • Figure 5: Level-2 inference on the lane change in \ref{['fig:front-figure']}.

Theorems & Definitions (5)

  • Remark 1
  • Remark 2
  • Proposition 1
  • Proposition 2
  • Remark 3