Mean Field Correlated Imitation Learning
Zhiyu Zhao, Qirui Mi, Ning Yang, Xue Yan, Haifeng Zhang, Jun Wang, Yaodong Yang
TL;DR
This paper introduces Adaptive Mean Field Correlated Equilibrium ($AMFCE$) to extend mean-field game frameworks to scenarios with time-varying correlated signals, addressing limitations of $MFNE$ and existing MFCE models that assume fixed future signals. Building on this equilibrium, the authors propose Mean Field Correlated Imitation Learning ($MFCIL$), a GAN-based imitation-learning framework that recovers both the agent policy and the evolving correlation device from expert demonstrations. They prove existence of $AMFCE$, show that $MFNE$ is a special case of $AMFCE$, and derive finite-horizon, polynomial-in-$T$ bounds on imitation gaps, improving the tractability of practical MFG-IL. Empirically, $MFCIL$ outperforms state-of-the-art baselines on tasks including Squeeze, RPS, Flock, and real-world traffic flow and TaxAI simulations, illustrating robust recovery of correlated policies and improved population-level predictions. The work provides a principled, scalable approach for modeling and learning in large populations where external, time-varying signals influence collective behavior.
Abstract
We investigate multi-agent imitation learning (IL) within the framework of mean field games (MFGs), considering the presence of time-varying correlated signals. Existing MFG IL algorithms assume demonstrations are sampled from Mean Field Nash Equilibria (MFNE), limiting their adaptability to real-world scenarios. For example, in the traffic network equilibrium influenced by public routing recommendations, recommendations introduce time-varying correlated signals into the game, not captured by MFNE and other existing correlated equilibrium concepts. To address this gap, we propose Adaptive Mean Field Correlated Equilibrium (AMFCE), a general equilibrium incorporating time-varying correlated signals. We establish the existence of AMFCE under mild conditions and prove that MFNE is a subclass of AMFCE. We further propose Correlated Mean Field Imitation Learning (CMFIL), a novel IL framework designed to recover the AMFCE, accompanied by a theoretical guarantee on the quality of the recovered policy. Experimental results, including a real-world traffic flow prediction problem, demonstrate the superiority of CMFIL over state-of-the-art IL baselines, highlighting the potential of CMFIL in understanding large population behavior under correlated signals.
