Inexact subgradient methods for semialgebraic functions
Jérôme Bolte, Tam Le, Éric Moulines, Edouard Pauwels
TL;DR
This work analyzes inexact (biased) subgradient methods for locally Lipschitz, semialgebraic functions under persistent additive errors. By leveraging a continuous-time differential inclusion framework together with the Kurdyka–Łojasiewicz inequality and metric subregularity, it shows that biased iterates with vanishing or small constant steps approach the epsilon-critical region and remain within $O(\epsilon^{\rho})$ of the true critical set, with $\rho$ reflecting geometric properties. In the convex case, a universal error bound yields explicit complexity guarantees for averaged iterates, extending classical results to inexact settings without requiring compactness. The paper also develops descent-type lemmas and an invariance principle linking discrete-time updates to continuous-time dynamics, with extensions to o-minimal definable settings and potential stochastic generalizations. These results provide rigorous performance guarantees for optimization routines that rely on inexact subgradient information, which is common in large-scale machine learning and nonconvex nonsmooth optimization contexts.
Abstract
Motivated by the extensive application of approximate gradients in machine learning and optimization, we investigate inexact subgradient methods subject to persistent additive errors. Within a nonconvex semialgebraic framework, assuming boundedness or coercivity, we establish that the method yields iterates that eventually fluctuate near the critical set at a proximity characterized by an $O(ε^ρ)$ distance, where $ε$ denotes the magnitude of subgradient evaluation errors, and $ρ$ encapsulates geometric characteristics of the underlying problem. Our analysis comprehensively addresses both vanishing and constant step-size regimes. Notably, the latter regime inherently enlarges the fluctuation region, yet this enlargement remains on the order of $ε^ρ$. In the convex scenario, employing a universal error bound applicable to coercive semialgebraic functions, we derive novel complexity results concerning averaged iterates. Additionally, our study produces auxiliary results of independent interest, including descent-type lemmas for nonsmooth nonconvex functions and an invariance principle governing the behavior of algorithmic sequences under small-step limits.
