Table of Contents
Fetching ...

An optimally fast objective-function-free minimization algorithm using random subspaces

S. Bellavia, S. Gratton, B. Morini, Ph. L. Toint

TL;DR

This work introduces SKOFFAR$p$, an objective-function-free adaptive-regularisation method that minimizes nonconvex functions in randomly chosen subspaces. By sketching the $p$-th order regularised model and reconstructing full steps, it preserves convergence guarantees and attains the optimal evaluation complexity $O\left(\epsilon^{-{(p+1)/p}}\right)$ for first-order critical points, with a second-order variant achieving $O\left(\epsilon^{-2}\right)$. The analysis combines random-subspace embeddings with adaptive regularisation, showing that randomness can reduce per-iteration cost without compromising global guarantees, particularly for problems with low-rank Hessians. Numerical experiments on CUTEst problems illustrate substantial practical gains when using second-order models in low-rank settings and highlight competitive performance against established first-order OFFO methods.

Abstract

An algorithm for unconstrained non-convex optimization is described, which does not evaluate the objective function and in which minimization is carried out, at each iteration, within a randomly selected subspace. It is shown that this random approximation technique does not affect the method's convergence nor its evaluation complexity for the search of an $ε$-approximate first-order critical point, which is $\mathcal{O}(ε^{-(p+1)/p})$, where $p$ is the order of derivatives used. A variant of the algorithm using approximate Hessian matrices is also analysed and shown to require at most $\mathcal{O}(ε^{-2})$ evaluations. Preliminary numerical tests show that the random-subspace technique can significantly improve performance when used with $p=2$ in the correct context, making it very competitive when compared to standard first-order algorithms.

An optimally fast objective-function-free minimization algorithm using random subspaces

TL;DR

This work introduces SKOFFAR, an objective-function-free adaptive-regularisation method that minimizes nonconvex functions in randomly chosen subspaces. By sketching the -th order regularised model and reconstructing full steps, it preserves convergence guarantees and attains the optimal evaluation complexity for first-order critical points, with a second-order variant achieving . The analysis combines random-subspace embeddings with adaptive regularisation, showing that randomness can reduce per-iteration cost without compromising global guarantees, particularly for problems with low-rank Hessians. Numerical experiments on CUTEst problems illustrate substantial practical gains when using second-order models in low-rank settings and highlight competitive performance against established first-order OFFO methods.

Abstract

An algorithm for unconstrained non-convex optimization is described, which does not evaluate the objective function and in which minimization is carried out, at each iteration, within a randomly selected subspace. It is shown that this random approximation technique does not affect the method's convergence nor its evaluation complexity for the search of an -approximate first-order critical point, which is , where is the order of derivatives used. A variant of the algorithm using approximate Hessian matrices is also analysed and shown to require at most evaluations. Preliminary numerical tests show that the random-subspace technique can significantly improve performance when used with in the correct context, making it very competitive when compared to standard first-order algorithms.
Paper Structure (8 sections, 14 theorems, 51 equations, 1 figure, 2 tables)

This paper contains 8 sections, 14 theorems, 51 equations, 1 figure, 2 tables.

Key Result

Lemma 3.1

Suppose that AS.1 and AS.3 hold. Then and

Figures (1)

  • Figure 1: The behaviour of $f(x)$ when ADAM-N, ADAG-N and SKOFFAR$2$ are run on rosenbr, as a function of $w_1$-weighted iteration numbers, where SKOFFAR$2$ uses $\tau =10^{-1}, 5 \cdot 10^{-2}, 2.5\cdot 10^{-2}, 10^{-2}, 5.10^{-3}$ and $10^{-3}$ (from right to left)

Theorems & Definitions (15)

  • Lemma 3.1
  • Lemma 3.2
  • Lemma 3.3
  • Lemma 3.4
  • Lemma 3.5
  • Lemma 3.6
  • Lemma 3.7
  • Lemma 3.8
  • Definition 3.9
  • Lemma 3.10
  • ...and 5 more