Spectral Recovery in the Labeled SBM

Julia Gaudio; Heming Liu

Spectral Recovery in the Labeled SBM

Julia Gaudio, Heming Liu

TL;DR

This work addresses exact community recovery in the Labeled Stochastic Block Model (LSBM), a labeled extension of the SBM. It introduces a simple spectral algorithm that uses $L$ labeled matrices to emulate the likelihood structure and demonstrates exact recovery down to the information-theoretic threshold in the log-degree regime, under a distinct, nonzero eigenvalue condition on the parameter matrices. The results extend prior CSBM (the $L=2$ case) to general $L$, showing that appropriate spectral encodings can achieve the IT threshold for nearly all parameters. The analysis combines entrywise eigenvector perturbation with degree-profile separation via Chernoff-type bounds, contributing a principled, scalable approach to spectral methods for labeled network models.

Abstract

We consider the problem of exact community recovery in the Labeled Stochastic Block Model (LSBM) with $k$ communities, where each pair of vertices is associated with a label from the set $\{0,1, \dots, L\}$. A pair of vertices from communities $i,j$ is given label $\ell$ with probability $p_{ij}^{(\ell)}$, and the goal is to recover the community partition. We propose a simple spectral algorithm for exact community recovery, and show that it achieves the information-theoretic threshold in the logarithmic-degree regime, under the assumption that the eigenvalues of certain parameter matrices are distinct and nonzero. Our results generalize recent work of Dhara, Gaudio, Mossel, and Sandon (2023), who showed that a spectral algorithm achieves the information-theoretic threshold in the Censored SBM, which is equivalent to the LSBM with $L = 2$. Interestingly, their algorithm uses eigenvectors from two matrix representations of the graph, while our algorithm uses eigenvectors from $L$ matrices.

Spectral Recovery in the Labeled SBM

TL;DR

This work addresses exact community recovery in the Labeled Stochastic Block Model (LSBM), a labeled extension of the SBM. It introduces a simple spectral algorithm that uses

labeled matrices to emulate the likelihood structure and demonstrates exact recovery down to the information-theoretic threshold in the log-degree regime, under a distinct, nonzero eigenvalue condition on the parameter matrices. The results extend prior CSBM (the

case) to general

, showing that appropriate spectral encodings can achieve the IT threshold for nearly all parameters. The analysis combines entrywise eigenvector perturbation with degree-profile separation via Chernoff-type bounds, contributing a principled, scalable approach to spectral methods for labeled network models.

Abstract

We consider the problem of exact community recovery in the Labeled Stochastic Block Model (LSBM) with

communities, where each pair of vertices is associated with a label from the set

. A pair of vertices from communities

is given label

with probability

, and the goal is to recover the community partition. We propose a simple spectral algorithm for exact community recovery, and show that it achieves the information-theoretic threshold in the logarithmic-degree regime, under the assumption that the eigenvalues of certain parameter matrices are distinct and nonzero. Our results generalize recent work of Dhara, Gaudio, Mossel, and Sandon (2023), who showed that a spectral algorithm achieves the information-theoretic threshold in the Censored SBM, which is equivalent to the LSBM with

. Interestingly, their algorithm uses eigenvectors from two matrix representations of the graph, while our algorithm uses eigenvectors from

matrices.

Paper Structure (7 sections, 5 theorems, 34 equations)

This paper contains 7 sections, 5 theorems, 34 equations.

Introduction
Model and Information-Theoretic Threshold
Model Definition
Information-Theoretic Threshold
Spectral Algorithm
Proof of Exact Recovery
Conclusion

Key Result

Theorem 2.3

Consider $\pi \in \mathbb{R}^k$, $\{q_{ij}^{(\ell)}\}_{i,j \in [k], \ell \in [L]}$, and $t > 0$. Let $G_n \sim \text{LSBM}(\pi, q, t, n)$. For $i \in [k]$, let $\theta_i$ be the $k \times L$ matrix whose $(j,\ell)$ entry is $\pi_j q_{ij}^{(\ell)}$. Define

Theorems & Definitions (11)

Definition 2.1
Definition 2.2
Theorem 2.3
Theorem 3.1
Lemma 4.1
Lemma 4.2
proof
proof : Proof of Theorem \ref{['theorem:spectral']}
Definition 5.1
Theorem 5.2: Theorem 3, Yun2016
...and 1 more

Spectral Recovery in the Labeled SBM

TL;DR

Abstract

Spectral Recovery in the Labeled SBM

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (11)