Inverse Mixed Strategy Games with Generative Trajectory Models
Max Muchen Sun, Pete Trautman, Todd Murphey
TL;DR
This work tackles the inverse game problem under uncertainty for multi-agent robotics by introducing an inverse mixed-strategy framework that leverages a CVAE-based generative trajectory model as the nominal mixed strategy and a differentiable BRNE solver. By learning the inter-agent cost $l_{\theta}$ and using online samples from the CVAE, the method infers Nash-optimal actions even with noisy observations and unknown objectives. Offline data informs the nominal strategy while online observations drive adaptation, yielding robust performance comparable to ground-truth and oracle baselines in simulated navigation with an unknown game model. The approach offers a practical path toward safe, real-time multi-agent coordination in human-robot scenarios, with potential extensions to more complex, heterogeneous agent settings.
Abstract
Game-theoretic models are effective tools for modeling multi-agent interactions, especially when robots need to coordinate with humans. However, applying these models requires inferring their specifications from observed behaviors -- a challenging task known as the inverse game problem. Existing inverse game approaches often struggle to account for behavioral uncertainty and measurement noise, and leverage both offline and online data. To address these limitations, we propose an inverse game method that integrates a generative trajectory model into a differentiable mixed-strategy game framework. By representing the mixed strategy with a conditional variational autoencoder (CVAE), our method can infer high-dimensional, multi-modal behavior distributions from noisy measurements while adapting in real-time to new observations. We extensively evaluate our method in a simulated navigation benchmark, where the observations are generated by an unknown game model. Despite the model mismatch, our method can infer Nash-optimal actions comparable to those of the ground-truth model and the oracle inverse game baseline, even in the presence of uncertain agent objectives and noisy measurements.
