Revisiting the Geometrically Decaying Step Size: Linear Convergence for Smooth or Non-Smooth Functions

Jihun Kim

Revisiting the Geometrically Decaying Step Size: Linear Convergence for Smooth or Non-Smooth Functions

Jihun Kim

TL;DR

The paper introduces a positive inverse condition number, a weaker geometric criterion, to guarantee linear convergence of subgradient descent with geometrically decaying step sizes for both smooth and non-smooth locally Lipschitz functions, without requiring convexity or sharpness. It formalizes the condition using Clarke subdifferentials and a defined set S, provides practical sufficient conditions and broad nonconvex examples where the condition holds, and presents two algorithms: one requiring the distance to a minimizer and a second that replaces it with a known small constant, both achieving linear convergence with rates depending on problem-constants. This work expands the applicability of fast convergence guarantees to a wider class of nonconvex problems and lays groundwork for extensions to stochastic, robust, and distributed settings. Overall, the results show that geometry captured by ¯μ>0 suffices to drive linear convergence under minimally informative step-size schemes.

Abstract

We revisit the geometrically decaying step size given a positive inverse condition number, under which a locally Lipschitz function shows linear convergence. The positivity does not require the function to satisfy convexity, weak convexity, quasar convexity, or sharpness, but instead amounts to a property strictly weaker than the assumptions used in existing works (e.g., weak convexity + sharpness). We propose a clean and simple subgradient descent algorithm that requires minimal knowledge of problem constants, applicable to either smooth or non-smooth functions.

Revisiting the Geometrically Decaying Step Size: Linear Convergence for Smooth or Non-Smooth Functions

TL;DR

Abstract

Revisiting the Geometrically Decaying Step Size: Linear Convergence for Smooth or Non-Smooth Functions

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (13)