Table of Contents
Fetching ...

Robust Least Squares Problems with Binary Uncertain Data

Yang Zhou, Xiaojun Chen

TL;DR

This work tackles robust least squares with binary or hypercube adversarial noise by formulating a minimax problem $\min_{x \in \mathcal{X}} \max_{y \in \mathcal{Y}} \Theta(x,y)$ where $\Theta(x,y)=\tfrac{1}{2}\|F(x)-C y\|^2$. It reveals that the geometry of the noise-propagation matrix $C$—whether its columns form acute or obtuse angles—determines whether the inner maximization is supermodular or submodular, enabling submodular optimization tools within a gradient-based minimax framework. For the supermodular linear case, a Lovász-extension-based approach relates global minimax points to saddle points, yielding an $\epsilon$-global minimax point in $O(\epsilon^{-2})$ iterations; for the nonlinear case, a randomized projected-gradient method achieves $\epsilon$-stationarity in $O(\epsilon^{-4})$. In the submodular linear case, a double greedy subsolver provides a $(\tfrac{1}{3},\epsilon)$-approximate minimax point in $O(\epsilon^{-2})$, with exactness when $C$ is column-orthogonal. Numerical experiments on health status prediction and phase retrieval demonstrate BRLS's superior robustness to structured noise compared with classical LS and LASSO, highlighting its practical impact in adversarially noisy settings.

Abstract

We propose a Binary Robust Least Squares (BRLS) model that encompasses key robust least squares formulations, such as those involving uncertain binary labels and adversarial noise constrained within a hypercube. We show that the geometric structure of the noise propagation matrix, particularly whether its columns form acute or obtuse angles, implies the supermodularity or submodularity of the inner maximization problem. This structural property enables us to integrate powerful combinatorial optimization tools into a gradient-based minimax algorithmic framework. For the robust linear least squares problem with the supermodularity, we establish the relationship between the minimax points of BRLS and saddle points of its continuous relaxation, and propose a projected gradient algorithm computing $ε$-global minimax points in $O(ε^{-2})$ iterations. For the robust nonlinear least squares problem with supermodularity, we develop a revised framework that finds $ε$-stationary points in the sense of expectation within $O(ε^{-4})$ iterations. For the robust linear least squares problem with the submodularity, we employ a double greedy algorithm as a subsolver, guaranteeing a $(\frac{1}{3}, ε)$-approximate minimax point in $O(ε^{-2})$ iterations. Numerical experiments on health status prediction and phase retrieval demonstrate that BRLS achieves superior robustness against structured noise compared to classical least squares problems and LASSO.

Robust Least Squares Problems with Binary Uncertain Data

TL;DR

This work tackles robust least squares with binary or hypercube adversarial noise by formulating a minimax problem where . It reveals that the geometry of the noise-propagation matrix —whether its columns form acute or obtuse angles—determines whether the inner maximization is supermodular or submodular, enabling submodular optimization tools within a gradient-based minimax framework. For the supermodular linear case, a Lovász-extension-based approach relates global minimax points to saddle points, yielding an -global minimax point in iterations; for the nonlinear case, a randomized projected-gradient method achieves -stationarity in . In the submodular linear case, a double greedy subsolver provides a -approximate minimax point in , with exactness when is column-orthogonal. Numerical experiments on health status prediction and phase retrieval demonstrate BRLS's superior robustness to structured noise compared with classical LS and LASSO, highlighting its practical impact in adversarially noisy settings.

Abstract

We propose a Binary Robust Least Squares (BRLS) model that encompasses key robust least squares formulations, such as those involving uncertain binary labels and adversarial noise constrained within a hypercube. We show that the geometric structure of the noise propagation matrix, particularly whether its columns form acute or obtuse angles, implies the supermodularity or submodularity of the inner maximization problem. This structural property enables us to integrate powerful combinatorial optimization tools into a gradient-based minimax algorithmic framework. For the robust linear least squares problem with the supermodularity, we establish the relationship between the minimax points of BRLS and saddle points of its continuous relaxation, and propose a projected gradient algorithm computing -global minimax points in iterations. For the robust nonlinear least squares problem with supermodularity, we develop a revised framework that finds -stationary points in the sense of expectation within iterations. For the robust linear least squares problem with the submodularity, we employ a double greedy algorithm as a subsolver, guaranteeing a -approximate minimax point in iterations. Numerical experiments on health status prediction and phase retrieval demonstrate that BRLS achieves superior robustness against structured noise compared to classical least squares problems and LASSO.

Paper Structure

This paper contains 15 sections, 11 theorems, 60 equations, 5 figures, 3 algorithms.

Key Result

Proposition 2.1

For any fixed $x \in \mathcal{X}$, it holds that $\operatorname{SOL}(\Theta(x,\cdot), \{0,1\}^n) \subseteq \operatorname{SOL}(\Theta(x,\cdot), [0,1]^n)$. If $C$ has full column rank, then the two sets are equal.

Figures (5)

  • Figure 1: The function $\Theta = (x-y)^2$ in \ref{['eq:binary_RLS_chen']} with its global minimax points (left) and $\Theta^L$ in \ref{['eq:minimax_lovasz']} with its saddle point (right).
  • Figure 2: Average accuracy of LS, LASSO, RLS(100%C), and RLS(70%C) models among 10 users as a function of the noise ratio.
  • Figure 3: Distribution of true vs predicted labels for LS, LASSO, RLS(100%C), and RLS(70%C) models of 10 users under different noise ratios.
  • Figure 4: Robustness comparison of RLS to LS and LASSO with $C$ being acute for phase retrieval.
  • Figure 5: Robustness comparison of RLS to LS and LASSO with $C$ being obtuse for phase retrieval.

Theorems & Definitions (27)

  • Definition 1.1
  • Definition 1.2
  • Proposition 2.1
  • proof
  • Corollary 2.2
  • Definition 2.3: Acute/Obtuse Matrix
  • Definition 2.4
  • Proposition 2.5
  • proof
  • Definition 3.1: Lovász Extension grotschel2012geometric
  • ...and 17 more