Table of Contents
Fetching ...

Counter-examples in first-order optimization: a constructive approach

Baptiste Goujaud, Aymeric Dieuleveut, Adrien Taylor

TL;DR

This work tackles gaps in worst-case analysis for first-order optimization by automating the search for cyclic trajectories that witness non-convergence. It builds a cycle-detection framework using performance estimation problems (PEP) and SDP lifting to search over a function class $\mathcal{F}$ and algorithm parameters. The authors apply the method to four stationary first-order methods—Heavy-ball, Nesterov accelerated gradient, inexact gradient descent, and three-operator splitting—and show cycles for parameter regions lacking a Lyapunov certificate, thereby complementing traditional convergence proofs. Overall, the approach provides a constructive, scalable tool for certifying the non-existence of universal convergence guarantees and informs the design of robust optimization algorithms.

Abstract

While many approaches were developed for obtaining worst-case complexity bounds for first-order optimization methods in the last years, there remain theoretical gaps in cases where no such bound can be found. In such cases, it is often unclear whether no such bound exists (e.g., because the algorithm might fail to systematically converge) or simply if the current techniques do not allow finding them. In this work, we propose an approach to automate the search for cyclic trajectories generated by first-order methods. This provides a constructive approach to show that no appropriate complexity bound exists, thereby complementing the approaches providing sufficient conditions for convergence. Using this tool, we provide ranges of parameters for which some of the famous heavy-ball, Nesterov accelerated gradient, inexact gradient descent, and three-operator splitting algorithms fail to systematically converge, and show that it nicely complements existing tools searching for Lyapunov functions.

Counter-examples in first-order optimization: a constructive approach

TL;DR

This work tackles gaps in worst-case analysis for first-order optimization by automating the search for cyclic trajectories that witness non-convergence. It builds a cycle-detection framework using performance estimation problems (PEP) and SDP lifting to search over a function class and algorithm parameters. The authors apply the method to four stationary first-order methods—Heavy-ball, Nesterov accelerated gradient, inexact gradient descent, and three-operator splitting—and show cycles for parameter regions lacking a Lyapunov certificate, thereby complementing traditional convergence proofs. Overall, the approach provides a constructive, scalable tool for certifying the non-existence of universal convergence guarantees and informs the design of robust optimization algorithms.

Abstract

While many approaches were developed for obtaining worst-case complexity bounds for first-order optimization methods in the last years, there remain theoretical gaps in cases where no such bound can be found. In such cases, it is often unclear whether no such bound exists (e.g., because the algorithm might fail to systematically converge) or simply if the current techniques do not allow finding them. In this work, we propose an approach to automate the search for cyclic trajectories generated by first-order methods. This provides a constructive approach to show that no appropriate complexity bound exists, thereby complementing the approaches providing sufficient conditions for convergence. Using this tool, we provide ranges of parameters for which some of the famous heavy-ball, Nesterov accelerated gradient, inexact gradient descent, and three-operator splitting algorithms fail to systematically converge, and show that it nicely complements existing tools searching for Lyapunov functions.
Paper Structure (11 sections, 2 theorems, 15 equations, 4 figures, 1 table)

This paper contains 11 sections, 2 theorems, 15 equations, 4 figures, 1 table.

Key Result

Proposition 3.1

Let $\mathcal{A}$ be a order-$\ell$eq:sfom, and $(x_t)_{t\in\mathbb{N}}$ be any sequence generated by $\mathcal{A}$. Then the sequence $(x_t)_{t\in\mathbb{N}}$ is cyclic if and only if there exists $K\geq2$ such that $\forall t \in \llbracket 0, \ell-1 \rrbracket, x_t = x_{t+K}$.

Figures (4)

  • Figure 1: Heavy-ball \ref{['eq:hb']}. Green area: set of parameters $(\gamma, \beta)\in \Omega_{\mathrm{HB}}$ for which a Lyapunov function exists; Red area: set of parameters $(\gamma, \beta) \in \Omega_{\mathrm{HB}}$ for which \ref{['eq:hb']} cycles on at least one function in $\mathcal{F}_{0,L}$.
  • Figure 2: Nesterov Accelerated gradient \ref{['eq:nag']}. Green area: set of parameters $(\gamma,\beta)\in \Omega_{\mathrm{NAG}}$ for which a Lyapunov function exists; Red area: set of $( \gamma, \beta) \in \Omega_{\mathrm{NAG}}$ for which \ref{['eq:nag']} cycles on at least one function in $\mathcal{F}_{0,L}$.
  • Figure 3: Inexact GD \ref{['eq:igd']}. Green area: set of parameters $(\gamma,\varepsilon)\in \Omega_{\mathrm{IGD}}$ for which a Lyapunov function exists; Red area: set of $(\gamma,\varepsilon)\in \Omega_{\mathrm{IGD}}$ for which \ref{['eq:igd']} cycles on at least one function in $\mathcal{F}_{0,L}$.
  • Figure 4: Three-operator splitting \ref{['eq:tos']}. Green area: set of parameters $(\gamma,\beta)\in \Omega_{\mathrm{TOS}}$ for which a Lyapunov function exists; Red area: set of $(\gamma,\beta)\in \Omega_{\mathrm{TOS}}$ for which \ref{['eq:tos']} cycles on at least a triplet of operators.

Theorems & Definitions (6)

  • Definition 2.1: Stationary first-order method (SFOM)
  • Definition 2.2: Cycle
  • Example 2.3
  • Proposition 3.1
  • Example 3.2: $L$-smooth convex functions
  • Theorem 3.3