Stochastic Block Model for Hypergraphs: Statistical limits and a semidefinite programming approach
Chiheon Kim, Afonso S. Bandeira, Michel X. Goemans
TL;DR
The paper establishes a sharp information-theoretic threshold for exact recovery in the stochastic block model on $k$-uniform hypergraphs, showing that recovery is possible via ML when $I( ext{alpha}, ext{beta})>1$ and impossible when $I( ext{alpha}, ext{beta})<1$. It introduces a tractable truncate-and-relax SDP algorithm and proves exact recovery under a related, parameter-dependent threshold $I_{ ext{sdp}}( ext{alpha}, ext{beta})>1$, while also giving a complementary lower bound $I_2( ext{alpha}, ext{beta})$ governing the algorithm’s failure; simulations indicate the truncation threshold aligns with $I_2( ext{alpha}, ext{beta})=1$, suggesting a gap between statistical limits and SDP performance for larger $k$. The results generalize two-community graph SBM phase transitions to hypergraphs and quantify the trade-off between statistical limits and computationally efficient recovery methods. Overall, the work clarifies when convex relaxations can attain the information-theoretic limits and where computational gaps persist, thereby guiding future algorithm design for higher-order relational data.
Abstract
We study the problem of community detection in a random hypergraph model which we call the stochastic block model for $k$-uniform hypergraphs ($k$-SBM). We investigate the exact recovery problem in $k$-SBM and show that a sharp phase transition occurs around a threshold: below the threshold it is impossible to recover the communities with non-vanishing probability, yet above the threshold there is an estimator which recovers the communities almost asymptotically surely. We also consider a simple, efficient algorithm for the exact recovery problem which is based on a semidefinite relaxation technique.
