Table of Contents
Fetching ...

Learning Mean Field Control on Sparse Graphs

Christian Fabian, Kai Cui, Heinz Koeppl

TL;DR

This work tackles cooperative MARL on extremely sparse graphs where the average degree remains finite, a regime where graphon-based mean-field models underperform. It introduces Locally Weak Mean Field Control (LWMFC), leveraging local weak convergence to define degree-specific mean fields and a limiting system, together with a two-systems approximation to manage high-degree neighborhoods. The authors prove MF and objective convergence and develop scalable learning algorithms, including a limiting MFC MDP solver and a MARL method that operates directly on real networks. Empirical results on synthetic Chung-Lu graphs and eight real networks demonstrate that LWMFC and its extensive variant outperform existing mean-field approaches, providing practical tools for scalable policy learning in sparse networks with diverging degree variance.

Abstract

Large agent networks are abundant in applications and nature and pose difficult challenges in the field of multi-agent reinforcement learning (MARL) due to their computational and theoretical complexity. While graphon mean field games and their extensions provide efficient learning algorithms for dense and moderately sparse agent networks, the case of realistic sparser graphs remains largely unsolved. Thus, we propose a novel mean field control model inspired by local weak convergence to include sparse graphs such as power law networks with coefficients above two. Besides a theoretical analysis, we design scalable learning algorithms which apply to the challenging class of graph sequences with finite first moment. We compare our model and algorithms for various examples on synthetic and real world networks with mean field algorithms based on Lp graphons and graphexes. As it turns out, our approach outperforms existing methods in many examples and on various networks due to the special design aiming at an important, but so far hard to solve class of MARL problems.

Learning Mean Field Control on Sparse Graphs

TL;DR

This work tackles cooperative MARL on extremely sparse graphs where the average degree remains finite, a regime where graphon-based mean-field models underperform. It introduces Locally Weak Mean Field Control (LWMFC), leveraging local weak convergence to define degree-specific mean fields and a limiting system, together with a two-systems approximation to manage high-degree neighborhoods. The authors prove MF and objective convergence and develop scalable learning algorithms, including a limiting MFC MDP solver and a MARL method that operates directly on real networks. Empirical results on synthetic Chung-Lu graphs and eight real networks demonstrate that LWMFC and its extensive variant outperform existing mean-field approaches, providing practical tools for scalable policy learning in sparse networks with diverging degree variance.

Abstract

Large agent networks are abundant in applications and nature and pose difficult challenges in the field of multi-agent reinforcement learning (MARL) due to their computational and theoretical complexity. While graphon mean field games and their extensions provide efficient learning algorithms for dense and moderately sparse agent networks, the case of realistic sparser graphs remains largely unsolved. Thus, we propose a novel mean field control model inspired by local weak convergence to include sparse graphs such as power law networks with coefficients above two. Besides a theoretical analysis, we design scalable learning algorithms which apply to the challenging class of graph sequences with finite first moment. We compare our model and algorithms for various examples on synthetic and real world networks with mean field algorithms based on Lp graphons and graphexes. As it turns out, our approach outperforms existing methods in many examples and on various networks due to the special design aiming at an important, but so far hard to solve class of MARL problems.

Paper Structure

This paper contains 35 sections, 4 theorems, 75 equations, 4 figures, 2 tables, 2 algorithms.

Key Result

Theorem 3.1

Under Assumption as:weak_conv, for any fixed policy ensemble $\pi$, the empirical MFs converge to the limiting MFs such that for all $k \in \mathbb{N}$ and all $t \in \mathcal{T}$

Figures (4)

  • Figure 1: Four networks, first two generated by an Lp graphon and graphex, third is a CL graph and fourth is a real subsampled YouTube (YT) network mislove2009onlinekunegis2013konect, highly connected nodes are depicted larger. Each network has around 14.5k nodes and 13k edges, except graphex has around 16.5k edges; all networks are plotted in the prefuse force directed layout (software: cytoscape). While the Lp graphon graph lacks sufficiently many high degree nodes, the tail of the graphex degree distribution is too heavy. In contrast, the CL graph is qualitatively close to the real YT network.
  • Figure 2: Overall MF evolution on real networks (50 trials, with two std. devs.), for our approx. (LW), our extensive approx. (LW$^*$), graphex (GX), and Lp graphon (LP) models: (a) SIS on Enron, (b) SIR on Slashdot, (c) Color on CAIDA (without LW$^*$), (d) Rumor on Cities (without LW$^*$).
  • Figure 3: Training curves of LWMFC policy gradient, LWMFMARL, and IPPO on a random CL graph with 406 nodes for: (a) SIS, (b) SIR, (c) Color, (d) Rumor.
  • Figure 4: Training curves of LWMFC policy gradient and LWMFMARL for four different examples: (a) SIS on Enron, (b) SIR on Slashdot, (c) Color on CAIDA, (d) Rumor on Cities.

Theorems & Definitions (11)

  • Definition 2.1: Local weak convergence in probability
  • Example 1: Power law
  • Theorem 3.1: MF convergence
  • Proposition 3.3: Objective convergence
  • Corollary 3.4: Optimal policy
  • Lemma 4.1
  • Example 2
  • proof
  • proof
  • proof
  • ...and 1 more