Learning-Rate-Free Stochastic Optimization over Riemannian Manifolds
Daniel Dodd, Louis Sharrock, Christopher Nemeth
TL;DR
This work addresses the sensitivity of stochastic optimization on Riemannian manifolds to learning-rate choices by introducing learning-rate-free algorithms. The three main methods—RDoG, NRDoG, and RDoWG—leverage Distance over Gradients and Distance over Weighted Gradients to adapt step sizes without prior knowledge of the optimum distance, achieving high-probability convergence rates that are optimal up to logarithmic factors. The framework supports bounded-iterate guarantees and includes curvature-aware and curvature-omitting variants, with theoretical results complemented by experiments on Rayleigh quotient on the sphere, Grassmann PCA, and Poincaré ball embeddings showing robustness against initialization and hyperparameter tuning. Overall, these LR-free approaches offer robust, scalable alternatives for geodesically convex stochastic optimization with practical impact across manifold-structured learning tasks.
Abstract
In recent years, interest in gradient-based optimization over Riemannian manifolds has surged. However, a significant challenge lies in the reliance on hyperparameters, especially the learning rate, which requires meticulous tuning by practitioners to ensure convergence at a suitable rate. In this work, we introduce innovative learning-rate-free algorithms for stochastic optimization over Riemannian manifolds, eliminating the need for hand-tuning and providing a more robust and user-friendly approach. We establish high probability convergence guarantees that are optimal, up to logarithmic factors, compared to the best-known optimally tuned rate in the deterministic setting. Our approach is validated through numerical experiments, demonstrating competitive performance against learning-rate-dependent algorithms.
