PriME: Privacy-aware Membership profile Estimation in networks
Abhinav Chakraborty, Sayak Chatterjee, Sagnik Nandy
TL;DR
This work tackles private estimation of membership profiles in networks generated by the Degree-Corrected Mixed Membership Stochastic Block Model (DCMM) under $\varepsilon$-edge Local Differential Privacy. It introduces PriME, a privacy-preserving spectral method that first privatizes the observed graph via a symmetric edge-flip mechanism and then recovers the membership matrix $\Pi$ using post-PCA SCORE normalization and a Sketched Vertex Hunting step. The authors establish a minimax lower bound on estimation risk under privacy and prove that PriME achieves the corresponding rate upper bound, up to logarithmic factors, demonstrating minimax optimality under edge-LDP. Empirical results on synthetic data and real networks (Facebook Ego Networks and Political Blogs) illustrate the privacy-utility trade-off and the method’s practical viability, while discussing how degree heterogeneity and signal strength modulate performance. The work advances privacy-aware network analysis by providing the first minimax-rate results for private estimation in MMSBMs with mixed memberships and a concrete algorithm to achieve them.
Abstract
This paper presents a novel approach to estimating community membership probabilities for network vertices generated by the Degree Corrected Mixed Membership Stochastic Block Model while preserving individual edge privacy. Operating within the $\varepsilon$-edge local differential privacy framework, we introduce an optimal private algorithm based on a symmetric edge flip mechanism and spectral clustering for accurate estimation of vertex community memberships. We conduct a comprehensive analysis of the estimation risk and establish the optimality of our procedure by providing matching lower bounds to the minimax risk under privacy constraints. To validate our approach, we demonstrate its performance through numerical simulations and its practical application to real-world data. This work represents a significant step forward in balancing accurate community membership estimation with stringent privacy preservation in network data analysis.
