Max-Linear Regression by Convex Programming

Seonho Kim; Sohail Bahmani; Kiryung Lee

Max-Linear Regression by Convex Programming

Seonho Kim, Sohail Bahmani, Kiryung Lee

TL;DR

This work presents Anchored Regression (AR), a scalable convex program for estimating the parameters of a multivariate max-linear regression model under Gaussian covariates and adversarial noise. AR convexifies the typically nonconvex LAD objective by using an anchor vector, enabling nonasymptotic recovery guarantees with sample complexity that scales as $n \ge C \zeta^{-2}(4p\log^3 p\log^5 k+4\log(1/\delta)\log k)$, where $\zeta$ captures the cone-geometry of the problem. In the balanced setting where the $k$ components are equally likely to be the maximum, this reduces to $n \asymp k^4 p$ up to logarithmic factors, matching known results for AM in certain regimes under noiseless conditions. The paper provides empirical evidence of AR’s robustness to outliers and deterministic noise, demonstrates competitive performance against AM in Gaussian settings, and introduces iterative AR (IAR) to further improve estimation accuracy. Theoretical development is complemented by a detailed comparison of computational costs and a tightness result for the fundamental bound, highlighting AR’s practical viability for large-scale max-linear regression problems.

Abstract

We consider the multivariate max-linear regression problem where the model parameters $\boldsymbolβ_{1},\dotsc,\boldsymbolβ_{k}\in\mathbb{R}^{p}$ need to be estimated from $n$ independent samples of the (noisy) observations $y = \max_{1\leq j \leq k} \boldsymbolβ_{j}^{\mathsf{T}} \boldsymbol{x} + \mathrm{noise}$. The max-linear model vastly generalizes the conventional linear model, and it can approximate any convex function to an arbitrary accuracy when the number of linear models $k$ is large enough. However, the inherent nonlinearity of the max-linear model renders the estimation of the regression parameters computationally challenging. Particularly, no estimator based on convex programming is known in the literature. We formulate and analyze a scalable convex program given by anchored regression (AR) as the estimator for the max-linear regression problem. Under the standard Gaussian observation setting, we present a non-asymptotic performance guarantee showing that the convex program recovers the parameters with high probability. When the $k$ linear components are equally likely to achieve the maximum, our result shows a sufficient number of noise-free observations for exact recovery scales as {$k^{4}p$} up to a logarithmic factor. { This sample complexity coincides with that by alternating minimization (Ghosh et al., {2021}). Moreover, the same sample complexity applies when the observations are corrupted with arbitrary deterministic noise. We provide empirical results that show that our method performs as our theoretical result predicts, and is competitive with the alternating minimization algorithm particularly in presence of multiplicative Bernoulli noise. Furthermore, we also show empirically that a recursive application of AR can significantly improve the estimation accuracy.}

Max-Linear Regression by Convex Programming

TL;DR

, where

captures the cone-geometry of the problem. In the balanced setting where the

components are equally likely to be the maximum, this reduces to

up to logarithmic factors, matching known results for AM in certain regimes under noiseless conditions. The paper provides empirical evidence of AR’s robustness to outliers and deterministic noise, demonstrates competitive performance against AM in Gaussian settings, and introduces iterative AR (IAR) to further improve estimation accuracy. Theoretical development is complemented by a detailed comparison of computational costs and a tightness result for the fundamental bound, highlighting AR’s practical viability for large-scale max-linear regression problems.

Abstract

We consider the multivariate max-linear regression problem where the model parameters

need to be estimated from

independent samples of the (noisy) observations

. The max-linear model vastly generalizes the conventional linear model, and it can approximate any convex function to an arbitrary accuracy when the number of linear models

is large enough. However, the inherent nonlinearity of the max-linear model renders the estimation of the regression parameters computationally challenging. Particularly, no estimator based on convex programming is known in the literature. We formulate and analyze a scalable convex program given by anchored regression (AR) as the estimator for the max-linear regression problem. Under the standard Gaussian observation setting, we present a non-asymptotic performance guarantee showing that the convex program recovers the parameters with high probability. When the

linear components are equally likely to achieve the maximum, our result shows a sufficient number of noise-free observations for exact recovery scales as {

} up to a logarithmic factor. { This sample complexity coincides with that by alternating minimization (Ghosh et al., {2021}). Moreover, the same sample complexity applies when the observations are corrupted with arbitrary deterministic noise. We provide empirical results that show that our method performs as our theoretical result predicts, and is competitive with the alternating minimization algorithm particularly in presence of multiplicative Bernoulli noise. Furthermore, we also show empirically that a recursive application of AR can significantly improve the estimation accuracy.}

Max-Linear Regression by Convex Programming

TL;DR

Abstract

Max-Linear Regression by Convex Programming

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (11)