Finding planted cliques using gradient descent
Reza Gheissari, Aukosh Jagannath, Yiming Xu
TL;DR
This work shows that a black-box optimization approach based on a Lagrange-multiplier Hamiltonian H_G,γ(U) can recover planted cliques at the information-theoretic √n scale when allowed to optimize over all subgraphs and initialized from the full vertex set. The authors analyze both gradient descent and a low-temperature Gibbs sampler, proving that the planted clique is the global minimizer for γ>1 and that the full-graph initialization enables efficient recovery in O(n) steps, while the natural empty initialization fails due to entropic barriers. They characterize a rugged energy landscape with many small local minima and demonstrate sharp initialization-dependent behavior, complemented by a robustness extension to semi-random contaminations. The results bridge a gap between problem-specific algorithms and black-box optimization, offering insight into initialization sensitivity, landscape intricacy, and applicability to robust variants in planted-clique-like settings.
Abstract
The planted clique problem is a paradigmatic model of statistical-to-computational gaps: the planted clique is information-theoretically detectable if its size $k\ge 2\log_2 n$ but polynomial-time algorithms only exist for the recovery task when $k= Ω(\sqrt{n})$. By now, there are many algorithms that succeed as soon as $k = Ω(\sqrt{n})$. Glaringly, however, no black-box optimization method, e.g., gradient descent or the Metropolis process, has been shown to work. In fact, Chen, Mossel, and Zadik recently showed that any Metropolis process whose state space is the set of cliques fails to find any sub-linear sized planted clique in polynomial time if initialized naturally from the empty set. We show that using the method of Lagrange multipliers, namely optimizing the Hamiltonian given by the sum of the objective function and the clique constraint over the space of all subgraphs, succeeds. In particular, we prove that Markov chains which minimize this Hamiltonian (gradient descent and a low-temperature relaxation of it) succeed at recovering planted cliques of size $k = Ω(\sqrt{n})$ if initialized from the full graph. Importantly, initialized from the empty set, the relaxation still does not help the gradient descent find sub-linear planted cliques. We also demonstrate robustness of these Markov chain approaches under a natural contamination model.
