Accelerated Gradient Descent by Concatenation of Stepsize Schedules

Zehao Zhang; Rujun Jiang

Accelerated Gradient Descent by Concatenation of Stepsize Schedules

Zehao Zhang, Rujun Jiang

TL;DR

This work introduces two new families of stepsize schedules, achieving a convergence rate of O(n-\log_2(\sqrt 2+1)$ with state-of-the-art constants for the objective value and gradient norm of the last iterate, respectively.

Abstract

This work considers stepsize schedules for gradient descent on smooth convex objectives. We extend the existing literature and propose a unified technique for constructing stepsizes with analytic bounds for an arbitrary number of iterations. This technique constructs new stepsize schedules by concatenating two stepsize schedules with fewer steps. Using this approach, we introduce two new families of stepsize schedules, achieving a convergence rate of $O(n^{-\log_2(\sqrt 2+1)})$ with state-of-the-art constants for the objective value and gradient norm of the last iterate, respectively. Furthermore, our analytically derived stepsize schedules either match or surpass the existing best numerically computed stepsize schedules.

Accelerated Gradient Descent by Concatenation of Stepsize Schedules

TL;DR

Abstract

Accelerated Gradient Descent by Concatenation of Stepsize Schedules

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (74)