Table of Contents
Fetching ...

Improved Robust Estimation for Erdős-Rényi Graphs: The Sparse Regime and Optimal Breakdown Point

Hongjie Chen, Jingqiu Ding, Yiding Hua, Stefan Tiegel

TL;DR

The paper tackles robust estimation of the Erdős–Rényi edge density under node corruptions, introducing the first polynomial-time algorithm that achieves near-optimal error and the optimal breakdown point of 1/2. It develops a two-track strategy: an inefficient identifiability-based method and an efficient Sum-of-Squares (SoS) based approach, with constant-degree certificates for concentration over small sets and a diagonal reweighting to handle sparse regimes. The main contribution is a practical SoS-certified estimator that, with high probability, outputs $\hat{d}$ satisfying $|\hat{d}-d^\filledcirc| \lesssim \sqrt{\frac{\log(n)\,d^\filledcirc}{n}} + \text{(η-dependent terms)}$, matching information-theoretic lower bounds up to $\log(1/η)$ factors. This yields robust edge-density estimation that remains effective in sparse graphs, broadening applicability to real-world networks with adversarial perturbations. The work also clarifies the role of small-set edge-concentration certificates and demonstrates how to convert identifiability proofs into efficient SoS-based algorithms.

Abstract

We study the problem of robustly estimating the edge density of Erdős-Rényi random graphs $G(n, d^\circ/n)$ when an adversary can arbitrarily add or remove edges incident to an $η$-fraction of the nodes. We develop the first polynomial-time algorithm for this problem that estimates $d^\circ$ up to an additive error $O([\sqrt{\log(n) / n} + η\sqrt{\log(1/η)} ] \cdot \sqrt{d^\circ} + η\log(1/η))$. Our error guarantee matches information-theoretic lower bounds up to factors of $\log(1/η)$. Moreover, our estimator works for all $d^\circ \geq Ω(1)$ and achieves optimal breakdown point $η= 1/2$. Previous algorithms [AJK+22, CDHS24], including inefficient ones, incur significantly suboptimal errors. Furthermore, even admitting suboptimal error guarantees, only inefficient algorithms achieve optimal breakdown point. Our algorithm is based on the sum-of-squares (SoS) hierarchy. A key ingredient is to construct constant-degree SoS certificates for concentration of the number of edges incident to small sets in $G(n, d^\circ/n)$. Crucially, we show that these certificates also exist in the sparse regime, when $d^\circ = o(\log n)$, a regime in which the performance of previous algorithms was significantly suboptimal.

Improved Robust Estimation for Erdős-Rényi Graphs: The Sparse Regime and Optimal Breakdown Point

TL;DR

The paper tackles robust estimation of the Erdős–Rényi edge density under node corruptions, introducing the first polynomial-time algorithm that achieves near-optimal error and the optimal breakdown point of 1/2. It develops a two-track strategy: an inefficient identifiability-based method and an efficient Sum-of-Squares (SoS) based approach, with constant-degree certificates for concentration over small sets and a diagonal reweighting to handle sparse regimes. The main contribution is a practical SoS-certified estimator that, with high probability, outputs satisfying , matching information-theoretic lower bounds up to factors. This yields robust edge-density estimation that remains effective in sparse graphs, broadening applicability to real-world networks with adversarial perturbations. The work also clarifies the role of small-set edge-concentration certificates and demonstrates how to convert identifiability proofs into efficient SoS-based algorithms.

Abstract

We study the problem of robustly estimating the edge density of Erdős-Rényi random graphs when an adversary can arbitrarily add or remove edges incident to an -fraction of the nodes. We develop the first polynomial-time algorithm for this problem that estimates up to an additive error . Our error guarantee matches information-theoretic lower bounds up to factors of . Moreover, our estimator works for all and achieves optimal breakdown point . Previous algorithms [AJK+22, CDHS24], including inefficient ones, incur significantly suboptimal errors. Furthermore, even admitting suboptimal error guarantees, only inefficient algorithms achieve optimal breakdown point. Our algorithm is based on the sum-of-squares (SoS) hierarchy. A key ingredient is to construct constant-degree SoS certificates for concentration of the number of edges incident to small sets in . Crucially, we show that these certificates also exist in the sparse regime, when , a regime in which the performance of previous algorithms was significantly suboptimal.

Paper Structure

This paper contains 32 sections, 18 theorems, 117 equations, 1 algorithm.

Key Result

Theorem 1.2

For any $0 \leqslant \eta < 1/2$ and $d^\circ \geqslant 1$, there exists a polynomial-time algorithm which, given $\eta$ and a graph that is an $\eta$-corruption of an Erdős-Rényi random graph sampled from $\varmathbb G(n, d^\circ/n)$, outputs an estimator $\hat{d}$ satisfying with probability at least $1-1/\mathop{\mathrm{poly}}\nolimits(n)$.

Theorems & Definitions (39)

  • Definition 1.1: $\eta$-corrupted Erdős-Rényi random graphs
  • Theorem 1.2: Informal restatement of \ref{['thm:main_algo']}
  • Definition 2.1: $\delta$-good graphs
  • Theorem 3.1
  • Remark 3.2
  • Lemma 3.3: Feasibility
  • Lemma 3.4: Utility
  • proof : Proof of \ref{['thm:main_algo']}
  • Lemma 3.6
  • proof
  • ...and 29 more