First-Order Sparse Convex Optimization: Better Rates with Sparse Updates

Dan Garber

First-Order Sparse Convex Optimization: Better Rates with Sparse Updates

Dan Garber

Abstract

It was recently established that for convex optimization problems with sparse optimal solutions (be it entry-wise sparsity or matrix rank-wise sparsity) it is possible to design first-order methods with linear convergence rates that depend on an improved mixed-norm condition number of the form $\frac{β_1{}s}{α_2}$, where $β_1$ is the $\ell_1$-Lipschitz continuity constant of the gradient, $α_2$ is the $\ell_2$-quadratic growth constant, and $s$ is the sparsity of optimal solutions. However, beyond the improved convergence rate, these methods are unable to leverage the sparsity of optimal solutions towards improving the runtime of each iteration as well, which may still be prohibitively high for high-dimensional problems. In this work, we establish that linear convergence rates which depend on this improved condition number can be obtained using only sparse updates, which may result in overall significantly improved running times. Moreover, our methods are considerably easier to implement.

First-Order Sparse Convex Optimization: Better Rates with Sparse Updates

Abstract

, where

is the

-Lipschitz continuity constant of the gradient,

is the

-quadratic growth constant, and

is the sparsity of optimal solutions. However, beyond the improved convergence rate, these methods are unable to leverage the sparsity of optimal solutions towards improving the runtime of each iteration as well, which may still be prohibitively high for high-dimensional problems. In this work, we establish that linear convergence rates which depend on this improved condition number can be obtained using only sparse updates, which may result in overall significantly improved running times. Moreover, our methods are considerably easier to implement.

First-Order Sparse Convex Optimization: Better Rates with Sparse Updates

Abstract

First-Order Sparse Convex Optimization: Better Rates with Sparse Updates

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (20)