On Feasible Rewards in Multi-Agent Inverse Reinforcement Learning
Till Freihaut, Giorgia Ramponi
TL;DR
This work tackles MAIRL by highlighting that observing a single Nash equilibrium is insufficient to identify underlying rewards due to equilibrium multiplicity. It introduces entropy-regularized Markov games to obtain a unique equilibrium (QRE) and develops a theoretical framework for MAIRL in this regime, including an error-propagation analysis and a generative-model-based sample complexity bound. The authors prove that, in general, reward identifiability is only achievable in the average-reward sense, but show that it becomes possible under the assumption of linearly separable rewards. The results provide a rigorous foundation for MAIRL, offering concrete conditions, bounds, and directions for designing algorithms and studying identifiability under practical multi-agent settings.
Abstract
Multi-agent Inverse Reinforcement Learning (MAIRL) aims to recover agent reward functions from expert demonstrations. We characterize the feasible reward set in Markov games, identifying all reward functions that rationalize a given equilibrium. However, equilibrium-based observations are often ambiguous: a single Nash equilibrium can correspond to many reward structures, potentially changing the game's nature in multi-agent systems. We address this by introducing entropy-regularized Markov games, which yield a unique equilibrium while preserving strategic incentives. For this setting, we provide a sample complexity analysis detailing how errors affect learned policy performance. Our work establishes theoretical foundations and practical insights for MAIRL.
