Table of Contents
Fetching ...

Dictionary-Restricted First-Order Descent Methods: Bounds and Convergence Rates

Miguel Berasategui, Pablo M. Berná, Antonio Falcó

Abstract

This paper develops a general theory for first-order descent methods whose search directions are restricted to a prescribed dictionary in a reflexive Banach space. Instead of assuming that the linear span of the dictionary is dense, as in the classical Proper Generalized Decomposition framework of Falcó and Nouy or in the universality approach of Berná and Falcó, we introduce a geometric condition based on norming sets that guarantees density through a duality argument. This makes it possible to treat dictionaries arising from tensor formats, neural network units, and other nonlinear or parameterized approximation families within a unified setting. On the algorithmic side, we analyze a simple greedy update rule in which each iterate is obtained by minimizing the energy functional along one direction from the dictionary. Under mild differentiability, Lipschitz continuity, and ellipticity assumptions on the objective, we derive explicit quantitative descent bounds and sharp convergence rates. These include algebraic rates that improve those of classical steepest-descent schemes in Banach spaces, as well as arbitrarily high polynomial rates and exponential convergence in a critical regime. The results apply broadly to convex variational problems, high-dimensional approximation, and structured optimization methods that rely on restricted or compressed search directions.

Dictionary-Restricted First-Order Descent Methods: Bounds and Convergence Rates

Abstract

This paper develops a general theory for first-order descent methods whose search directions are restricted to a prescribed dictionary in a reflexive Banach space. Instead of assuming that the linear span of the dictionary is dense, as in the classical Proper Generalized Decomposition framework of Falcó and Nouy or in the universality approach of Berná and Falcó, we introduce a geometric condition based on norming sets that guarantees density through a duality argument. This makes it possible to treat dictionaries arising from tensor formats, neural network units, and other nonlinear or parameterized approximation families within a unified setting. On the algorithmic side, we analyze a simple greedy update rule in which each iterate is obtained by minimizing the energy functional along one direction from the dictionary. Under mild differentiability, Lipschitz continuity, and ellipticity assumptions on the objective, we derive explicit quantitative descent bounds and sharp convergence rates. These include algebraic rates that improve those of classical steepest-descent schemes in Banach spaces, as well as arbitrarily high polynomial rates and exponential convergence in a critical regime. The results apply broadly to convex variational problems, high-dimensional approximation, and structured optimization methods that rely on restricted or compressed search directions.
Paper Structure (24 sections, 21 theorems, 144 equations)

This paper contains 24 sections, 21 theorems, 144 equations.

Key Result

Lemma 2.2

BF2025, Canuto2005, FN2012, Zeidler1985. Let $\mathcal{E}$ be a Fréchet differentiable functional on a reflexive Banach space $\mathcal{X}$. Suppose that Assumption (B) holds and that $\mathcal{E}'$ is continuous. Then:

Theorems & Definitions (49)

  • Remark 2.1
  • Lemma 2.2
  • proof
  • Lemma 2.3
  • proof
  • Remark 2.4
  • Lemma 2.5
  • proof
  • Lemma 2.6
  • proof
  • ...and 39 more