Efficiency of stochastic coordinate proximal gradient methods on nonseparable composite optimization

I. Necoara; F. Chorobura

Efficiency of stochastic coordinate proximal gradient methods on nonseparable composite optimization

I. Necoara, F. Chorobura

TL;DR

A probabilistic worst case complexity analysis is presented for the stochastic coordinate proximal gradient method in convex and nonconvex settings and it is proved high-probability bounds on the number of iterations before a given optimality is achieved.

Abstract

This paper deals with composite optimization problems having the objective function formed as the sum of two terms, one has Lipschitz continuous gradient along random subspaces and may be nonconvex and the second term is simple and differentiable, but possibly nonconvex and nonseparable. Under these settings we design a stochastic coordinate proximal gradient method which takes into account the nonseparable composite form of the objective function. This algorithm achieves scalability by constructing at each iteration a local approximation model of the whole nonseparable objective function along a random subspace with user-determined dimension. We outline efficient techniques for selecting the random subspace, yielding an implementation that has low cost per-iteration while also achieving fast convergence rates. We present a probabilistic worst-case complexity analysis for our stochastic coordinate proximal gradient method in convex and nonconvex settings, in particular we prove high-probability bounds on the number of iterations before a given optimality is achieved. Extensive numerical results also confirm the efficiency of our algorithm.

Efficiency of stochastic coordinate proximal gradient methods on nonseparable composite optimization

TL;DR

Abstract

Efficiency of stochastic coordinate proximal gradient methods on nonseparable composite optimization

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (18)