Table of Contents
Fetching ...

Robust Gradient Descent for Phase Retrieval

Alex Buna, Patrick Rebeschini

TL;DR

The paper studies robust phase retrieval where measurements $y=(\boldsymbol{a}^\top \boldsymbol{x^*})^2+z$ are contaminated by heavy-tailed noise and adversarial outliers. It introduces a robust gradient-descent framework built on Wirtinger Flow, using robust mean estimators in place of empirical gradients and two spectral initialization strategies (MeanEstStab and CovEst) to handle zero-mean and unknown-mean noise. Theoretical guarantees show good initialization within a strongly convex region and linear convergence of the iterative updates, with sample complexities $m_0=O(n^2\log n)$ (MeanEstStab) or $m_0=O(n)$ (CovEst) for initialization and per-iteration samples $\tilde{m}=O(n\log n)$, yielding near-optimal total complexity. For unknown mean noise, a preprocessing step reduces to the zero-mean setting while preserving guarantees. Overall, the work advances robust non-convex phase retrieval by providing practical algorithms with explicit finite-sample performance under heavy-tailed noise and arbitrary contamination.

Abstract

Recent progress in robust statistical learning has mainly tackled convex problems, like mean estimation or linear regression, with non-convex challenges receiving less attention. Phase retrieval exemplifies such a non-convex problem, requiring the recovery of a signal from only the magnitudes of its linear measurements, without phase (sign) information. While several non-convex methods, especially those involving the Wirtinger Flow algorithm, have been proposed for noiseless or mild noise settings, developing solutions for heavy-tailed noise and adversarial corruption remains an open challenge. In this paper, we investigate an approach that leverages robust gradient descent techniques to improve the Wirtinger Flow algorithm's ability to simultaneously cope with fourth moment bounded noise and adversarial contamination in both the inputs (covariates) and outputs (responses). We address two scenarios: known zero-mean noise and completely unknown noise. For the latter, we propose a preprocessing step that alters the problem into a new format that does not fit traditional phase retrieval approaches but can still be resolved with a tailored version of the algorithm for the zero-mean noise context.

Robust Gradient Descent for Phase Retrieval

TL;DR

The paper studies robust phase retrieval where measurements are contaminated by heavy-tailed noise and adversarial outliers. It introduces a robust gradient-descent framework built on Wirtinger Flow, using robust mean estimators in place of empirical gradients and two spectral initialization strategies (MeanEstStab and CovEst) to handle zero-mean and unknown-mean noise. Theoretical guarantees show good initialization within a strongly convex region and linear convergence of the iterative updates, with sample complexities (MeanEstStab) or (CovEst) for initialization and per-iteration samples , yielding near-optimal total complexity. For unknown mean noise, a preprocessing step reduces to the zero-mean setting while preserving guarantees. Overall, the work advances robust non-convex phase retrieval by providing practical algorithms with explicit finite-sample performance under heavy-tailed noise and arbitrary contamination.

Abstract

Recent progress in robust statistical learning has mainly tackled convex problems, like mean estimation or linear regression, with non-convex challenges receiving less attention. Phase retrieval exemplifies such a non-convex problem, requiring the recovery of a signal from only the magnitudes of its linear measurements, without phase (sign) information. While several non-convex methods, especially those involving the Wirtinger Flow algorithm, have been proposed for noiseless or mild noise settings, developing solutions for heavy-tailed noise and adversarial corruption remains an open challenge. In this paper, we investigate an approach that leverages robust gradient descent techniques to improve the Wirtinger Flow algorithm's ability to simultaneously cope with fourth moment bounded noise and adversarial contamination in both the inputs (covariates) and outputs (responses). We address two scenarios: known zero-mean noise and completely unknown noise. For the latter, we propose a preprocessing step that alters the problem into a new format that does not fit traditional phase retrieval approaches but can still be resolved with a tailored version of the algorithm for the zero-mean noise context.

Paper Structure

This paper contains 25 sections, 15 theorems, 57 equations, 4 figures, 1 table.

Key Result

Lemma 2.1

The population risk (population_risk) is $\alpha:=4\left\lVert\boldsymbol{x^*}\right\rVert^2$-strongly convex and $\beta:={73}\left\lVert\boldsymbol{x^*}\right\rVert^2/9$-smooth in a ball around $\boldsymbol{x^*}$ of radius $R:=\left\lVert\boldsymbol{x^*}\right\rVert/9$. That is: for all $\boldsymbol{x}\in\mathbb{R}^n$ with $\left\lVert\boldsymbol{x}-\boldsymbol{x^*}\right\rVert\leq \left\lVert\b

Theorems & Definitions (33)

  • Definition 2.0.1
  • Lemma 2.1
  • Definition 2.1.1
  • Lemma 2.2
  • Proposition 2.2.1
  • Proposition 2.2.2
  • Lemma 2.3
  • Theorem 3.1
  • Remark 3.1.1
  • Remark 3.1.2
  • ...and 23 more