Label propagation on binomial random graphs

Marcos Kiwi; Lyuben Lichev; Dieter Mitsche; Paweł Prałat

Label propagation on binomial random graphs

Marcos Kiwi, Lyuben Lichev, Dieter Mitsche, Paweł Prałat

TL;DR

This work analyzes a non-binary label propagation algorithm on the Erdős-Rényi random graph $\mathcal{G}(n,p)$, where vertices iteratively adopt the majority label in their neighborhood with specific tie-breaking rules. The authors develop a multi-stage edge-exposure framework and introduce the ALAP procedure to decouple dependencies, enabling rigorous analysis of convergence behavior. They prove that for $n^{5/8+\varepsilon}\le np\ll n$ the process a.a.s. ends with a single label after five rounds, and they precisely characterize the surviving label in broader regimes: it is typically $1$ when $n^{2/3}\ll np\ll n$, but not when $n^{5/8+\varepsilon}\le np\ll n^{2/3}$; at the critical scaling $np=\Theta(n^{2/3})$ the surviving label has a nontrivial distribution. A key technical tool is a monotonicity lemma for binomial tails (Lemma bis), together with couplings and Berry–Esseen/Slud-type bounds, which may have independent interest for related stochastic processes. The results illuminate how random initial labeling and dense graph topology interact to drive consensus (or a limited diversity) in majority-dynamics like LPA, with implications for understanding community detection under randomness.

Abstract

We study the behavior of a label propagation algorithm (LPA) on the Erdős-Rényi random graph $\mathcal{G}(n,p)$. Initially, given a network, each vertex starts with a random label in the interval $[0,1]$. Then, in each round of LPA, every vertex switches its label to the majority label in its neighborhood (including its own label). At the first round, ties are broken towards smaller labels, while at each of the next rounds, ties are broken uniformly at random. The algorithm terminates once all labels stay the same in two consecutive iterations. LPA is successfully used in practice for detecting communities in networks (corresponding to vertex sets with the same label after termination of the algorithm). Perhaps surprisingly, LPA's performance on dense random graphs is hard to analyze, and so far convergence to consensus was known only when $np\ge n^{3/4+\varepsilon}$, where LPA converges in three rounds. By defining an alternative label attribution procedure which converges to the label propagation algorithm after three rounds, a careful multi-stage exposure of the edges allows us to break the $n^{3/4+\varepsilon}$ barrier and show that, when $np \ge n^{5/8+\varepsilon}$, a.a.s.\ the algorithm terminates with a single label. Moreover, we show that, if $np\gg n^{2/3}$, a.a.s.\ this label is the smallest one, whereas if $n^{5/8+\varepsilon}\le np\ll n^{2/3}$, the surviving label is a.a.s.\ not the smallest one. En passant, we show a presumably new monotonicity lemma for Binomial random variables that might be of independent interest.

Label propagation on binomial random graphs

TL;DR

This work analyzes a non-binary label propagation algorithm on the Erdős-Rényi random graph

, where vertices iteratively adopt the majority label in their neighborhood with specific tie-breaking rules. The authors develop a multi-stage edge-exposure framework and introduce the ALAP procedure to decouple dependencies, enabling rigorous analysis of convergence behavior. They prove that for

the process a.a.s. ends with a single label after five rounds, and they precisely characterize the surviving label in broader regimes: it is typically

when

, but not when

; at the critical scaling

the surviving label has a nontrivial distribution. A key technical tool is a monotonicity lemma for binomial tails (Lemma bis), together with couplings and Berry–Esseen/Slud-type bounds, which may have independent interest for related stochastic processes. The results illuminate how random initial labeling and dense graph topology interact to drive consensus (or a limited diversity) in majority-dynamics like LPA, with implications for understanding community detection under randomness.

Abstract

We study the behavior of a label propagation algorithm (LPA) on the Erdős-Rényi random graph

. Initially, given a network, each vertex starts with a random label in the interval

. Then, in each round of LPA, every vertex switches its label to the majority label in its neighborhood (including its own label). At the first round, ties are broken towards smaller labels, while at each of the next rounds, ties are broken uniformly at random. The algorithm terminates once all labels stay the same in two consecutive iterations. LPA is successfully used in practice for detecting communities in networks (corresponding to vertex sets with the same label after termination of the algorithm). Perhaps surprisingly, LPA's performance on dense random graphs is hard to analyze, and so far convergence to consensus was known only when

, where LPA converges in three rounds. By defining an alternative label attribution procedure which converges to the label propagation algorithm after three rounds, a careful multi-stage exposure of the edges allows us to break the

barrier and show that, when

, a.a.s.\ the algorithm terminates with a single label. Moreover, we show that, if

, a.a.s.\ this label is the smallest one, whereas if

, the surviving label is a.a.s.\ not the smallest one. En passant, we show a presumably new monotonicity lemma for Binomial random variables that might be of independent interest.

Paper Structure (26 sections, 28 theorems, 247 equations)

This paper contains 26 sections, 28 theorems, 247 equations.

Introduction
Main results
Outline of the proof
Technical contributions
Preliminaries and Notation
Notation
Preliminaries
First Two Rounds
Properties of the basins
Consequences of the basin sizes: the second round
The Alternative Label Attribution Procedure (ALAP).
Step 1.
Step 2.
Proof of Lemma \ref{['lem:bis']}
Showing I1.
...and 11 more sections

Key Result

Theorem 1.1

Suppose that $\varepsilon\in (0,1/24)$ and $n^{5/8+\varepsilon}\le np \ll n$. Then, a.a.s. after five rounds of the process, all vertices carry the label that was most represented after the first round. Moreover,

Theorems & Definitions (66)

Theorem 1.1
Remark 1.2
Lemma 2.1
Lemma 2.2: see Lemma 2.1 in Slu77
Lemma 2.3
proof
Remark 2.4
Remark 2.5
Lemma 2.6
Lemma 3.1
...and 56 more

Label propagation on binomial random graphs

TL;DR

Abstract

Label propagation on binomial random graphs

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (66)