Optimistic Online-to-Batch Conversions for Accelerated Convergence and Universality

Yu-Hu Yan; Peng Zhao; Zhi-Hua Zhou

Optimistic Online-to-Batch Conversions for Accelerated Convergence and Universality

Yu-Hu Yan, Peng Zhao, Zhi-Hua Zhou

TL;DR

This paper links accelerated offline convex optimization with online-to-batch conversions by introducing optimistic O2B conversions that embed look-ahead information into the analysis. The approach yields accelerated convergence for convex smooth objectives, extends to strongly convex objectives with optimal rates, and develops universal variants that adapt to smooth and non-smooth settings while using only one gradient query per iteration. It clarifies the connection to Nesterov's Accelerated Gradient and Polyak's Heavy-Ball, and demonstrates competitive empirical performance on standard convex problems. The work broadens the online-learning lens on acceleration and offers practical, horizon-efficient algorithms for a broad class of convex problems.

Abstract

In this work, we study offline convex optimization with smooth objectives, where the classical Nesterov's Accelerated Gradient (NAG) method achieves the optimal accelerated convergence. Extensive research has aimed to understand NAG from various perspectives, and a recent line of work approaches this from the viewpoint of online learning and online-to-batch conversion, emphasizing the role of optimistic online algorithms for acceleration. In this work, we contribute to this perspective by proposing novel optimistic online-to-batch conversions that incorporate optimism theoretically into the analysis, thereby significantly simplifying the online algorithm design while preserving the optimal convergence rates. Specifically, we demonstrate the effectiveness of our conversions through the following results: (i) when combined with simple online gradient descent, our optimistic conversion achieves the optimal accelerated convergence; (ii) our conversion also applies to strongly convex objectives, and by leveraging both optimistic online-to-batch conversion and optimistic online algorithms, we achieve the optimal accelerated convergence rate for strongly convex and smooth objectives, for the first time through the lens of online-to-batch conversion; (iii) our optimistic conversion can achieve universality to smoothness -- applicable to both smooth and non-smooth objectives without requiring knowledge of the smoothness coefficient -- and remains efficient as non-universal methods by using only one gradient query in each iteration. Finally, we highlight the effectiveness of our optimistic online-to-batch conversions by a precise correspondence with NAG.

Optimistic Online-to-Batch Conversions for Accelerated Convergence and Universality

TL;DR

Abstract

Optimistic Online-to-Batch Conversions for Accelerated Convergence and Universality

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (29)