Table of Contents
Fetching ...

Learning Mean Field Games on Sparse Graphs: A Hybrid Graphex Approach

Christian Fabian, Kai Cui, Heinz Koeppl

TL;DR

GXMFGs extend mean field games to sparse networks by leveraging graphexes, addressing scalability and realism in agent interactions. The authors propose a Hybrid graphex learning algorithm (HOMD) that decouples a highly connected core from a sparse periphery, and provide theoretical results showing convergence of the finite-system MF to the limiting GXMFG framework along with approximate optimality guarantees. They validate the approach on synthetic graphex networks and real-world graphs, reporting accurate mean-field predictions (typically within a few percent) and superior performance over LPGMFG baselines. The work advances scalable equilibrium learning for large, realistic networks and lays groundwork for extensions to continuous state/action spaces and time.

Abstract

Learning the behavior of large agent populations is an important task for numerous research areas. Although the field of multi-agent reinforcement learning (MARL) has made significant progress towards solving these systems, solutions for many agents often remain computationally infeasible and lack theoretical guarantees. Mean Field Games (MFGs) address both of these issues and can be extended to Graphon MFGs (GMFGs) to include network structures between agents. Despite their merits, the real world applicability of GMFGs is limited by the fact that graphons only capture dense graphs. Since most empirically observed networks show some degree of sparsity, such as power law graphs, the GMFG framework is insufficient for capturing these network topologies. Thus, we introduce the novel concept of Graphex MFGs (GXMFGs) which builds on the graph theoretical concept of graphexes. Graphexes are the limiting objects to sparse graph sequences that also have other desirable features such as the small world property. Learning equilibria in these games is challenging due to the rich and sparse structure of the underlying graphs. To tackle these challenges, we design a new learning algorithm tailored to the GXMFG setup. This hybrid graphex learning approach leverages that the system mainly consists of a highly connected core and a sparse periphery. After defining the system and providing a theoretical analysis, we state our learning approach and demonstrate its learning capabilities on both synthetic graphs and real-world networks. This comparison shows that our GXMFG learning algorithm successfully extends MFGs to a highly relevant class of hard, realistic learning problems that are not accurately addressed by current MARL and MFG methods.

Learning Mean Field Games on Sparse Graphs: A Hybrid Graphex Approach

TL;DR

GXMFGs extend mean field games to sparse networks by leveraging graphexes, addressing scalability and realism in agent interactions. The authors propose a Hybrid graphex learning algorithm (HOMD) that decouples a highly connected core from a sparse periphery, and provide theoretical results showing convergence of the finite-system MF to the limiting GXMFG framework along with approximate optimality guarantees. They validate the approach on synthetic graphex networks and real-world graphs, reporting accurate mean-field predictions (typically within a few percent) and superior performance over LPGMFG baselines. The work advances scalable equilibrium learning for large, realistic networks and lays groundwork for extensions to continuous state/action spaces and time.

Abstract

Learning the behavior of large agent populations is an important task for numerous research areas. Although the field of multi-agent reinforcement learning (MARL) has made significant progress towards solving these systems, solutions for many agents often remain computationally infeasible and lack theoretical guarantees. Mean Field Games (MFGs) address both of these issues and can be extended to Graphon MFGs (GMFGs) to include network structures between agents. Despite their merits, the real world applicability of GMFGs is limited by the fact that graphons only capture dense graphs. Since most empirically observed networks show some degree of sparsity, such as power law graphs, the GMFG framework is insufficient for capturing these network topologies. Thus, we introduce the novel concept of Graphex MFGs (GXMFGs) which builds on the graph theoretical concept of graphexes. Graphexes are the limiting objects to sparse graph sequences that also have other desirable features such as the small world property. Learning equilibria in these games is challenging due to the rich and sparse structure of the underlying graphs. To tackle these challenges, we design a new learning algorithm tailored to the GXMFG setup. This hybrid graphex learning approach leverages that the system mainly consists of a highly connected core and a sparse periphery. After defining the system and providing a theoretical analysis, we state our learning approach and demonstrate its learning capabilities on both synthetic graphs and real-world networks. This comparison shows that our GXMFG learning algorithm successfully extends MFGs to a highly relevant class of hard, realistic learning problems that are not accurately addressed by current MARL and MFG methods.
Paper Structure (47 sections, 8 theorems, 122 equations, 4 figures, 3 tables, 1 algorithm)

This paper contains 47 sections, 8 theorems, 122 equations, 4 figures, 3 tables, 1 algorithm.

Key Result

Lemma 1

Under Assumption as:lipschitz there exists a mean field core equilibrium $(\boldsymbol{\mu}, \boldsymbol{\pi}) \in \boldsymbol{\mathcal{M}}^\infty \times \boldsymbol{\Pi}^\infty$.

Figures (4)

  • Figure 1: Four networks, each with around 15k nodes and 50k edges. The Erdős-Rényi graph (left) generated by a standard graphon does not have any high degree nodes. The Lp graphon graph (middle left) has a few high degree nodes expressed by the flat degree distribution tail. The graphex (middle right) and real subsampled Flickr network (right) (data from mislove2007measurementkunegis2013konect) show a very similar degree distribution with rather broad tails and a core-periphery like structure.
  • Figure 2: The core exploitability $\mathcal{E}$ is optimized with respect to the iterations of Algorithm \ref{['algo:homd']}. (a): SIS; (b): SIR; (c): RS.
  • Figure 3: Difference $\Delta \mu^k = \frac{1}{2T} \operatorname{\mathbb E} \left[ \sum_t \lVert \hat{\mu}^k_t - \mu^k_t \rVert_1 \right]$ and $\Delta \mu^\infty = \frac{1}{2T} \operatorname{\mathbb E} \left[ \sum_t \lVert \hat{\mu}^\infty_t - \mu^\infty_t \rVert_1 \right]$ of the periphery $k$-degree and core MFs against the empirical $k$-degree and $k > k_{\max}$ MFs with respect to $\nu$ for sampled graphs ($\pm$ 95% confidence interval, 20 trials). The parameters $\nu = 10$ and $\nu = 750$ corresponds to approximately $N=40$ and $N=26750$ nodes. (a): SIS; (b): SIR; (c): RS.
  • Figure 4: Comparison of the empirically observed MF on a real network with the predicted equilibrium for (a) SIS: $x=I$ on Prosper; (b-c) SIR: $x=I$ and $x=R$ on Dogster; and (d) RS: $x=U$ on Pokec.

Theorems & Definitions (17)

  • Definition 1
  • Definition 2
  • Lemma 1
  • Remark 1
  • Theorem 1: Core MF convergence
  • Theorem 2: Periphery MF convergence
  • Theorem 3: Core approximate optimality
  • Theorem 4: Periphery approximate optimality
  • Lemma 2
  • proof : Proof of Lemma \ref{['lem:impl_conv_op']}
  • ...and 7 more