Algebraic characterization of equivalence between optimization algorithms
Laurent Lessard, Madeleine Udell
TL;DR
This paper presents a principled framework for determining when iterative optimization algorithms are equivalent by modeling them as linear dynamical systems in feedback with nonlinear oracles. It introduces three notions—oracle equivalence, shift equivalence, and LFT equivalence—each enabling increasingly general forms of equivalence through transfer-function analysis. The framework unifies and explains known equivalences among classical methods (e.g., gradient methods, ADMM, Douglas–Rachford, Chambolle–Pock, proximal methods) and extends to accelerated, distributed, and operator-splitting schemes, including cases with related oracles via Moreau identities. By providing practical procedures for converting algorithms to transfer functions and checking equivalence (including via multi-shifts and LFTs), the work offers a principled toolkit for algorithm design, comparison, and discovery with potential impact on robust convergence and implementation choices.
Abstract
When are two algorithms the same? How can we be sure a recently proposed algorithm is novel, and not a minor twist on an existing method? In this paper, we present a framework for reasoning about equivalence between a broad class of iterative algorithms, with a focus on algorithms designed for convex optimization. We propose several notions of what it means for two algorithms to be equivalent, and provide computationally tractable means to detect equivalence. Our main definition, oracle equivalence, states that two algorithms are equivalent if they result in the same sequence of calls to the function oracles (for suitable initialization). Borrowing from control theory, we use state-space realizations to represent algorithms and characterize algorithm equivalence via transfer functions. Our framework can also identify and characterize equivalence between algorithms that use different oracles that are related via a linear fractional transformation. Prominent examples include linear transformations and function conjugation.
