Private graphon estimation via sum-of-squares
Hongjie Chen, Jingqiu Ding, Tommaso d'Orsi, Yiding Hua, Chih-Hung Liu, David Steurer
TL;DR
This work delivers the first polynomial-time, node-differentially private algorithms for learning stochastic block models and graphon estimation with fixed numbers of blocks, achieving statistical utilities comparable to the best exponential-time private mechanisms. The core idea is to run an exponential mechanism over a sum-of-squares relaxation of the underlying combinatorial score, with the sos level determined by the target block count. A key technical contribution is showing that graphon distances reduce to quadratic optimization over the Birkhoff polytope, enabling low-degree sos certificates and multiplicative approximation guarantees. The authors also develop robust Lipschitz-extension techniques within the sos framework to extend privacy guarantees beyond maximum-degree constraints and establish lower bounds on sample complexity, as well as non-private improvements in the same computational regime. Overall, the results bridge computational tractability and privacy, yielding practical, provably-private graphon and SBM learning algorithms with strong theoretical guarantees.
Abstract
We develop the first pure node-differentially-private algorithms for learning stochastic block models and for graphon estimation with polynomial running time for any constant number of blocks. The statistical utility guarantees match those of the previous best information-theoretic (exponential-time) node-private mechanisms for these problems. The algorithm is based on an exponential mechanism for a score function defined in terms of a sum-of-squares relaxation whose level depends on the number of blocks. The key ingredients of our results are (1) a characterization of the distance between the block graphons in terms of a quadratic optimization over the polytope of doubly stochastic matrices, (2) a general sum-of-squares convergence result for polynomial optimization over arbitrary polytopes, and (3) a general approach to perform Lipschitz extensions of score functions as part of the sum-of-squares algorithmic paradigm.
