Accelerated Over-Relaxation Heavy-Ball Method: Achieving Global Accelerated Convergence with Broad Generalization

Jingrong Wei; Long Chen

Accelerated Over-Relaxation Heavy-Ball Method: Achieving Global Accelerated Convergence with Broad Generalization

Jingrong Wei, Long Chen

TL;DR

The paper addresses achieving global accelerated convergence for smooth strongly convex optimization and extends this capability to composite and saddle-point problems. It introduces the Accelerated Over-Relaxation Heavy-Ball (AOR-HB) method, derived from discretizing a two-dimensional accelerated flow with an over-relaxation in the gradient term, and proves a strong Lyapunov property that yields global accelerated linear convergence with optimal iteration complexity $\mathcal{O}(\sqrt{\kappa}\,|\log\varepsilon|)$. The authors further extend AOR-HB to composite convex minimization and to a class of bilinear saddle-point problems via AOR-HB-saddle, providing convergence guarantees and practical single-loop implementations. Numerical results across smooth, composite, non-convex, and saddle-point problems illustrate robust acceleration and competitive performance relative to established first-order methods. Overall, AOR-HB offers broad generalization, conceptual clarity, and a pathway to generalize acceleration techniques beyond traditional convex settings.

Abstract

The heavy-ball momentum method accelerates gradient descent with a momentum term but lacks accelerated convergence for general smooth strongly convex problems. This work introduces the Accelerated Over-Relaxation Heavy-Ball (AOR-HB) method, the first variant with provable global and accelerated convergence for such problems. AOR-HB closes a long-standing theoretical gap, extends to composite convex optimization and min-max problems, and achieves optimal complexity bounds. It offers three key advantages: (1) broad generalization ability, (2) potential to reshape acceleration techniques, and (3) conceptual clarity and elegance compared to existing methods.

Accelerated Over-Relaxation Heavy-Ball Method: Achieving Global Accelerated Convergence with Broad Generalization

TL;DR

. The authors further extend AOR-HB to composite convex minimization and to a class of bilinear saddle-point problems via AOR-HB-saddle, providing convergence guarantees and practical single-loop implementations. Numerical results across smooth, composite, non-convex, and saddle-point problems illustrate robust acceleration and competitive performance relative to established first-order methods. Overall, AOR-HB offers broad generalization, conceptual clarity, and a pathway to generalize acceleration techniques beyond traditional convex settings.

Abstract

Paper Structure (43 sections, 15 theorems, 142 equations, 5 figures, 4 algorithms)

This paper contains 43 sections, 15 theorems, 142 equations, 5 figures, 4 algorithms.

Introduction
Notation.
Heavy-ball methods and flow.
Non-convergence and non-acceleration of the HB method.
Accelerated first-order methods.
Main contributions for smooth strongly convex optimization.
Remark on non-strongly convex optimization.
Relation to other acceleration methods.
Extension to composite convex minimization.
Extension to a class of saddle point problems.
Extension to a class of monotone operator equations.
AOR-HB Method for Convex Optimization
Strong Lyapunov property.
AOR-HB method for smooth strongly convex minimization problems.
Preliminaries for convergence analysis.
...and 28 more sections

Key Result

Theorem 1.1

Suppose $f$ is $\mu$-strongly convex and $L$-smooth. Let $(x_k,y_k)$ be generated by scheme (AOR-HB) with initial value $(x_0, y_0)$ and step size $\alpha = \sqrt{\mu/L}$. Then there exists a constant $C_0 = C_0(x_0, y_0,\mu, L)$ so that we have the accelerated linear convergence

Figures (5)

Figure 1: Simulation results for the objective function (\ref{['eq:fex2']}).
Figure 2: Simulation results for the logistic regression problem (\ref{['eq:log reg']}).
Figure 3: Comparison of $\ell_1$ minimization methods.
Figure 4: Comparison of $\ell_1 - \ell_2$ minimization methods.
Figure 5: Simulation results for MSPBE (\ref{['eq:MSPBE']}).

Theorems & Definitions (29)

Theorem 1.1: Convergence of AOR-HB method
Theorem 1.2: Convergence of AOR-HB-saddle method
Lemma 2.1
proof
Remark 3.1
Lemma A.1: Section 2.1 in nesterov2018lectures
Lemma A.2: Bregman divergence identity chen1993convergence
proof
Lemma C.1: Strong Lyapunov property luo2022differential
proof
...and 19 more

Accelerated Over-Relaxation Heavy-Ball Method: Achieving Global Accelerated Convergence with Broad Generalization

TL;DR

Abstract

Accelerated Over-Relaxation Heavy-Ball Method: Achieving Global Accelerated Convergence with Broad Generalization

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (29)