Stochastic Block Model for Hypergraphs: Statistical limits and a semidefinite programming approach

Chiheon Kim; Afonso S. Bandeira; Michel X. Goemans

Stochastic Block Model for Hypergraphs: Statistical limits and a semidefinite programming approach

Chiheon Kim, Afonso S. Bandeira, Michel X. Goemans

TL;DR

The paper establishes a sharp information-theoretic threshold for exact recovery in the stochastic block model on $k$-uniform hypergraphs, showing that recovery is possible via ML when $I( ext{alpha}, ext{beta})>1$ and impossible when $I( ext{alpha}, ext{beta})<1$. It introduces a tractable truncate-and-relax SDP algorithm and proves exact recovery under a related, parameter-dependent threshold $I_{ ext{sdp}}( ext{alpha}, ext{beta})>1$, while also giving a complementary lower bound $I_2( ext{alpha}, ext{beta})$ governing the algorithm’s failure; simulations indicate the truncation threshold aligns with $I_2( ext{alpha}, ext{beta})=1$, suggesting a gap between statistical limits and SDP performance for larger $k$. The results generalize two-community graph SBM phase transitions to hypergraphs and quantify the trade-off between statistical limits and computationally efficient recovery methods. Overall, the work clarifies when convex relaxations can attain the information-theoretic limits and where computational gaps persist, thereby guiding future algorithm design for higher-order relational data.

Abstract

We study the problem of community detection in a random hypergraph model which we call the stochastic block model for $k$-uniform hypergraphs ($k$-SBM). We investigate the exact recovery problem in $k$-SBM and show that a sharp phase transition occurs around a threshold: below the threshold it is impossible to recover the communities with non-vanishing probability, yet above the threshold there is an estimator which recovers the communities almost asymptotically surely. We also consider a simple, efficient algorithm for the exact recovery problem which is based on a semidefinite relaxation technique.

Stochastic Block Model for Hypergraphs: Statistical limits and a semidefinite programming approach

TL;DR

The paper establishes a sharp information-theoretic threshold for exact recovery in the stochastic block model on

-uniform hypergraphs, showing that recovery is possible via ML when

and impossible when

. It introduces a tractable truncate-and-relax SDP algorithm and proves exact recovery under a related, parameter-dependent threshold

, while also giving a complementary lower bound

governing the algorithm’s failure; simulations indicate the truncation threshold aligns with

, suggesting a gap between statistical limits and SDP performance for larger

. The results generalize two-community graph SBM phase transitions to hypergraphs and quantify the trade-off between statistical limits and computationally efficient recovery methods. Overall, the work clarifies when convex relaxations can attain the information-theoretic limits and where computational gaps persist, thereby guiding future algorithm design for higher-order relational data.

Abstract

We study the problem of community detection in a random hypergraph model which we call the stochastic block model for

-uniform hypergraphs (

-SBM). We investigate the exact recovery problem in

-SBM and show that a sharp phase transition occurs around a threshold: below the threshold it is impossible to recover the communities with non-vanishing probability, yet above the threshold there is an estimator which recovers the communities almost asymptotically surely. We also consider a simple, efficient algorithm for the exact recovery problem which is based on a semidefinite relaxation technique.

Stochastic Block Model for Hypergraphs: Statistical limits and a semidefinite programming approach

TL;DR

Abstract

Stochastic Block Model for Hypergraphs: Statistical limits and a semidefinite programming approach

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (30)