An accelerated gradient method with adaptive restart for convex multiobjective optimization problems

Hao Luo; Liping Tang; Xinmin Yang

An accelerated gradient method with adaptive restart for convex multiobjective optimization problems

Hao Luo, Liping Tang, Xinmin Yang

TL;DR

The paper develops an accelerated gradient framework for convex multiobjective optimization by deriving a rigorous continuous-time limit of multiobjective APG and introducing an Accelerated Multiobjective Gradient (AMG) flow with adaptive time scaling. An IMEX discretization yields an accelerated method with rates $\min\{L/k^2,(1-\sqrt{\mu/L})^k\}$, accompanied by a quadratic programming subproblem on the convex hull of gradients. A Lyapunov-based analysis proves exponential decay of a merit function, and practical gains are achieved via a residual-based adaptive restart to mitigate oscillations. The approach is validated numerically against state-of-the-art methods, showing competitive convergence in convex cases and linear-like behavior under strong convexity, with restart strategies significantly enhancing robustness and speed.

Abstract

In this work, based on the continuous time approach, we propose an accelerated gradient method with adaptive residual restart for convex multiobjective optimization problems. For the first, we derive rigorously the continuous limit of the multiobjective accelerated proximal gradient method by Tanabe et al. [Comput. Optim. Appl., 2023]. It is a second-order ordinary differential equation (ODE) that involves a special projection operator and can be viewed as an extension of the ODE by Su et al. [J. Mach. Learn. Res., 2016] for Nesterov acceleration. Then, we introduce a novel accelerated multiobjective gradient (AMG) flow with tailored time scaling that adapts automatically to the convex case and the strongly convex case, and the exponential decay rate of a merit function along with the solution trajectory of AMG flow is established via the Lyapunov analysis. After that, we consider an implicit-explicit time discretization and obtain an accelerated multiobjective gradient method with a convex quadratic programming subproblem. The fast sublinear rate and linear rate are proved respectively for convex and strongly convex problems. In addition, we present an efficient residual based adaptive restart technique to overcome the oscillation issue and improve the convergence significantly. Numerical results are provided to validate the practical performance of the proposed method.

An accelerated gradient method with adaptive restart for convex multiobjective optimization problems

TL;DR

, accompanied by a quadratic programming subproblem on the convex hull of gradients. A Lyapunov-based analysis proves exponential decay of a merit function, and practical gains are achieved via a residual-based adaptive restart to mitigate oscillations. The approach is validated numerically against state-of-the-art methods, showing competitive convergence in convex cases and linear-like behavior under strong convexity, with restart strategies significantly enhancing robustness and speed.

Abstract

Paper Structure (30 sections, 16 theorems, 128 equations, 8 figures, 3 algorithms)

This paper contains 30 sections, 16 theorems, 128 equations, 8 figures, 3 algorithms.

Introduction
Multiobjective gradient methods
Multiobjective gradient flow
Main contributions
Organization
Preliminary
Pareto optimality
Functional class
Projection operator
First-order necessary condition
Merit function
Deriving the Continuous Model
Auxiliary estimates
Proof of \ref{['thm:ode-apg-mop']}
Accelerated Multiobjective Gradient Flow
...and 15 more sections

Key Result

Lemma 2.1

Assume $f\in\mathcal{S}_{\mu}^{1}(\,{\mathbb R}^n)$ and $(x,y,M)\in\,{\mathbb R}^n\times\,{\mathbb R}^n\times\,{\mathbb R}_+$ satisfies then it holds that

Figures (8)

Figure 1: The approximate Pareto front at the iteration $k = 25$, with $N=100$ initial sample points in $[-2,2]^n$ for the first example \ref{['eq:ex1']}.
Figure 2: The KKT residual $\| {\bf proj}_{C(x_k)}(0)\|$ and the iterate gap $\left\lVert {x_{k+1}-x_k} \right\rVert$$v.s.$ the iteration step $k$, with one initial sample point $x_0\in[-2,2]^n$ for the first example \ref{['eq:ex1']}.
Figure 3: The KKT residual $\| {\bf proj}_{C(x_k)}(0)\|$ and the iterate gap $\left\lVert {x_{k+1}-x_k} \right\rVert$$v.s.$ the iteration time $s$, with one initial sample point $x_0\in[-2,2]^n$ for the first example \ref{['eq:ex1']}.
Figure 4: The KKT residual $\| {\bf proj}_{C(x_k)}(0)\|$ and the iterate gap $\left\lVert {x_{k+1}-x_k} \right\rVert$$v.s.$ the iteration step $k$, with one initial sample point $x_0\in[-2,2]^n$ for the second example \ref{['eq:ex2']}.
Figure 5: The KKT residual $\| {\bf proj}_{C(x_k)}(0)\|$ and the iterate gap $\left\lVert {x_{k+1}-x_k} \right\rVert$$v.s.$ the iteration time $s$, with one initial sample point $x_0\in[-2,2]^n$ for the second example \ref{['eq:ex2']}.
...and 3 more figures

Theorems & Definitions (27)

Lemma 2.1
proof
Theorem 2.1: Tanabe2024
Lemma 2.2
proof
Theorem 3.1
Lemma 3.1
proof
Lemma 3.2
proof
...and 17 more

An accelerated gradient method with adaptive restart for convex multiobjective optimization problems

TL;DR

Abstract

An accelerated gradient method with adaptive restart for convex multiobjective optimization problems

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (27)