Table of Contents
Fetching ...

Matching Algorithms in the Sparse Stochastic Block Model

Anna Brandenberger, Byron Chin, Nathan S. Sheffield, Divya Shyamal

TL;DR

This work extends the classical Karp–Sipser analysis from sparse Erdős–Rényi graphs to the stochastic block model, focusing on the sparse regime $p_{ij}=\Theta(1/n)$ and distinguishing regimes where simple offline and online matching algorithms remain near-optimal. The authors show that the Karp–Sipser algorithm achieves near-optimal offline matchings in equitable SBMs, the sub-critical regime, and certain bipartite ER-like cases, but can fail on more general block structures; they develop a blocked configuration model and a differential-equation framework (via Wormald’s theorem) to formalize these results. For online matching, they analyze four linear-time heuristics, proving GREEDY is optimal in the equitable case and BRUTE-FORCE is asymptotically optimal in general, with the others lacking universal optimality; they provide counterexamples and discuss potential label-aware improvements. Overall, the paper advances understanding of matching in structured random graphs, clarifying when simple algorithms suffice and when more sophisticated strategies are necessary, with implications for scalable online allocation in networks with community structure. The results bridge offline and online perspectives and offer a foundation for further refinement of algorithms tailored to stochastic block models.

Abstract

The stochastic block model (SBM) is a generalization of the Erdős--Rényi model of random graphs that describes the interaction of a finite number of distinct communities. In sparse Erdős--Rényi graphs, it is known that a linear-time algorithm of Karp and Sipser achieves near-optimal matching sizes asymptotically almost surely, giving a law-of-large numbers for the matching sizes of such graphs in terms of solutions to an ODE. We provide an extension of this analysis, identifying broad ranges of stochastic block model parameters for which the Karp--Sipser algorithm achieves near-optimal matching sizes, but demonstrating that it cannot perform optimally on general SBM instances. We also consider the problem of constructing a matching online, in which the vertices of one half of a bipartite stochastic block model arrive one-at-a-time, and must be matched as they arrive. We show that the competitive ratio lower bound of 0.837 found by Mastin and Jaillet for the Erdős--Rényi case is tight whenever the expected degrees in all communities are equal. We propose several linear-time algorithms for online matching in the general stochastic block model, but prove that despite very good experimental performance, none of these achieve online asymptotic optimality.

Matching Algorithms in the Sparse Stochastic Block Model

TL;DR

This work extends the classical Karp–Sipser analysis from sparse Erdős–Rényi graphs to the stochastic block model, focusing on the sparse regime and distinguishing regimes where simple offline and online matching algorithms remain near-optimal. The authors show that the Karp–Sipser algorithm achieves near-optimal offline matchings in equitable SBMs, the sub-critical regime, and certain bipartite ER-like cases, but can fail on more general block structures; they develop a blocked configuration model and a differential-equation framework (via Wormald’s theorem) to formalize these results. For online matching, they analyze four linear-time heuristics, proving GREEDY is optimal in the equitable case and BRUTE-FORCE is asymptotically optimal in general, with the others lacking universal optimality; they provide counterexamples and discuss potential label-aware improvements. Overall, the paper advances understanding of matching in structured random graphs, clarifying when simple algorithms suffice and when more sophisticated strategies are necessary, with implications for scalable online allocation in networks with community structure. The results bridge offline and online perspectives and offer a foundation for further refinement of algorithms tailored to stochastic block models.

Abstract

The stochastic block model (SBM) is a generalization of the Erdős--Rényi model of random graphs that describes the interaction of a finite number of distinct communities. In sparse Erdős--Rényi graphs, it is known that a linear-time algorithm of Karp and Sipser achieves near-optimal matching sizes asymptotically almost surely, giving a law-of-large numbers for the matching sizes of such graphs in terms of solutions to an ODE. We provide an extension of this analysis, identifying broad ranges of stochastic block model parameters for which the Karp--Sipser algorithm achieves near-optimal matching sizes, but demonstrating that it cannot perform optimally on general SBM instances. We also consider the problem of constructing a matching online, in which the vertices of one half of a bipartite stochastic block model arrive one-at-a-time, and must be matched as they arrive. We show that the competitive ratio lower bound of 0.837 found by Mastin and Jaillet for the Erdős--Rényi case is tight whenever the expected degrees in all communities are equal. We propose several linear-time algorithms for online matching in the general stochastic block model, but prove that despite very good experimental performance, none of these achieve online asymptotic optimality.
Paper Structure (32 sections, 20 theorems, 39 equations, 7 figures, 6 algorithms)

This paper contains 32 sections, 20 theorems, 39 equations, 7 figures, 6 algorithms.

Key Result

Lemma 2.2

Suppose we generate a graph from the blocked configuration model and run the Karp--Sipser algorithm for an arbitrary number of steps. Then conditioned on the following, the resulting graph is still distributed according to the blocked configuration model. Thus, if we collect all of these values into a tuple $Y = (E_{ij}, T_i, F_i)$, the algorithm's progress can be described by a Markov chain on $

Figures (7)

  • Figure 1: A plot showing that for $c_{11} = 5.6$, $c_{12} = 5.6$, $c_{22} = 0$, the system has only 1 fixed point, and so we are subcritical. Here the horizontal axis is $x_1$ and the vertical axis is $x_2$; the green line is the set of points fixing $x_1$ and the yellow line is the set of points fixing $x_2$. Note that they have only a single intersection.
  • Figure 2: An example on which the Karp--Sipser algorithm performs suboptimally.
  • Figure 3: DEGREEDY beats GREEDY. $|L_0|=n$, $|R_0|=|R_1|=\frac{n}{2}$
  • Figure 4: SHORTSIGHTED beats DEGREEDY. All classes of size $\frac{n}{2}$
  • Figure 5: BRUTE-FORCE beats SHORTSIGHTED. All classes of size $\frac{n}{2}$
  • ...and 2 more figures

Theorems & Definitions (45)

  • Definition 1.1: Stochastic block model
  • Definition 1.2: Matching
  • Definition 1.3: Online bipartite matching problem
  • Definition 2.1: Blocked configuration model
  • Lemma 2.2
  • Lemma 2.3
  • Corollary 2.4
  • Corollary 2.5
  • proof
  • Lemma 2.6
  • ...and 35 more