Variance Reduction and Low Sample Complexity in Stochastic Optimization via Proximal Point Method

Jiaming Liang

Variance Reduction and Low Sample Complexity in Stochastic Optimization via Proximal Point Method

Jiaming Liang

TL;DR

The paper tackles high-confidence guarantees for stochastic convex composite optimization under bounded-variance noise by introducing the stochastic proximal point method (SPPM). It combines a variance-reducing proximal subproblem solver (PSS) with a probability booster (PB) to achieve high-probability convergence using a constant prox-stepsize λ, avoiding decaying step sizes. The main theoretical contributions are a high-probability convergence result and a low-sample-complexity bound that scales with log(1/p) for the confidence parameter, with an overall gradient-sample complexity of O(max{κ log(κ/ε), κσ^2/(μ ε)} log(1/p) log(1/ε)). The framework achieves variance reduction without mini-batching and provides a practical, adaptive approach to stochastic optimization that relies only on bounded-variance noise assumptions. These results have implications for reliable stochastic optimization in settings where sub-Gaussian noise cannot be guaranteed, offering a principled, proximal-based pathway to robust convergence.

Abstract

High-probability guarantees in stochastic optimization are often obtained only under strong noise assumptions such as sub-Gaussian tails. We show that such guarantees can also be achieved under the weaker assumption of bounded variance by developing a stochastic proximal point method. This method combines a proximal subproblem solver, which inherently reduces variance, with a probability booster that amplifies per-iteration reliability into high-confidence results. The analysis demonstrates convergence with low sample complexity, without restrictive noise assumptions or reliance on mini-batching.

Variance Reduction and Low Sample Complexity in Stochastic Optimization via Proximal Point Method

TL;DR

Abstract

Paper Structure (18 sections, 19 theorems, 133 equations, 5 algorithms)

This paper contains 18 sections, 19 theorems, 133 equations, 5 algorithms.

Introduction
SPPM in a nutshell.
Contributions.
Organization of the paper.
Basic notation and definitions
Stochastic Proximal Point Method and Main Results
Proximal Subproblem Solver
Analysis of Proximal Point Method
Probability Booster
Now, we are ready to analyze Steps 1 and 2 of Algorithm \ref{['alg:PB']}.
Next, we analyze Step 3 of Algorithm \ref{['alg:PB']}.
Now, we are ready to analyze Step 4 of Algorithm \ref{['alg:PB']}.
To this end, we are in a position to prove \ref{['ineq:phi']}.
High-Probability Result and Low Sample Complexity
Conclusions
...and 3 more sections

Key Result

Proposition 3.1

Assuming then we have

Theorems & Definitions (36)

Proposition 3.1
Lemma 3.2
proof
Lemma 3.3
proof
Lemma 3.4
proof
Proposition 4.1
Lemma 4.2
proof
...and 26 more

Variance Reduction and Low Sample Complexity in Stochastic Optimization via Proximal Point Method

TL;DR

Abstract

Variance Reduction and Low Sample Complexity in Stochastic Optimization via Proximal Point Method

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (36)