Revisiting the Geometrically Decaying Step Size: Linear Convergence for Smooth or Non-Smooth Functions
Jihun Kim
TL;DR
The paper introduces a positive inverse condition number, a weaker geometric criterion, to guarantee linear convergence of subgradient descent with geometrically decaying step sizes for both smooth and non-smooth locally Lipschitz functions, without requiring convexity or sharpness. It formalizes the condition using Clarke subdifferentials and a defined set S, provides practical sufficient conditions and broad nonconvex examples where the condition holds, and presents two algorithms: one requiring the distance to a minimizer and a second that replaces it with a known small constant, both achieving linear convergence with rates depending on problem-constants. This work expands the applicability of fast convergence guarantees to a wider class of nonconvex problems and lays groundwork for extensions to stochastic, robust, and distributed settings. Overall, the results show that geometry captured by ¯μ>0 suffices to drive linear convergence under minimally informative step-size schemes.
Abstract
We revisit the geometrically decaying step size given a positive inverse condition number, under which a locally Lipschitz function shows linear convergence. The positivity does not require the function to satisfy convexity, weak convexity, quasar convexity, or sharpness, but instead amounts to a property strictly weaker than the assumptions used in existing works (e.g., weak convexity + sharpness). We propose a clean and simple subgradient descent algorithm that requires minimal knowledge of problem constants, applicable to either smooth or non-smooth functions.
