Accelerated Parameter-Free Stochastic Optimization
Itai Kreisler, Maor Ivgi, Oliver Hinder, Yair Carmon
TL;DR
This work addresses the challenge of accelerated stochastic optimization in the smooth convex setting without requiring exact problem parameters. It introduces U-DoG, a parameter-free accelerated method that combines UniXGrad and DoG with iterate stabilization, using the evolving drift $\bar{r}_t$ to adapt momentum and ensure stability, and proving near-optimal high-probability rates under sub-Gaussian noise. The analysis covers both noiseless and stochastic scenarios, providing general suboptimality bounds, stability guarantees, and extensions to bounded and sub-Gaussian noise, plus a mini-batch corollary and a discussion of the parameter-free nature. Empirically, U-DoG (and the variant A-DoG) improves over DoG on convex problems and is competitive with carefully tuned SGD, while neural network experiments show more mixed results, highlighting the method’s strength in parameter-free acceleration for convex stochastic optimization and its current limitations in non-convex deep learning settings.
Abstract
We propose a method that achieves near-optimal rates for smooth stochastic convex optimization and requires essentially no prior knowledge of problem parameters. This improves on prior work which requires knowing at least the initial distance to optimality d0. Our method, U-DoG, combines UniXGrad (Kavis et al., 2019) and DoG (Ivgi et al., 2023) with novel iterate stabilization techniques. It requires only loose bounds on d0 and the noise magnitude, provides high probability guarantees under sub-Gaussian noise, and is also near-optimal in the non-smooth case. Our experiments show consistent, strong performance on convex problems and mixed results on neural network training.
