An optimally fast objective-function-free minimization algorithm using random subspaces

S. Bellavia; S. Gratton; B. Morini; Ph. L. Toint

An optimally fast objective-function-free minimization algorithm using random subspaces

S. Bellavia, S. Gratton, B. Morini, Ph. L. Toint

TL;DR

This work introduces SKOFFAR$p$, an objective-function-free adaptive-regularisation method that minimizes nonconvex functions in randomly chosen subspaces. By sketching the $p$-th order regularised model and reconstructing full steps, it preserves convergence guarantees and attains the optimal evaluation complexity $O\left(\epsilon^{-{(p+1)/p}}\right)$ for first-order critical points, with a second-order variant achieving $O\left(\epsilon^{-2}\right)$. The analysis combines random-subspace embeddings with adaptive regularisation, showing that randomness can reduce per-iteration cost without compromising global guarantees, particularly for problems with low-rank Hessians. Numerical experiments on CUTEst problems illustrate substantial practical gains when using second-order models in low-rank settings and highlight competitive performance against established first-order OFFO methods.

Abstract

An algorithm for unconstrained non-convex optimization is described, which does not evaluate the objective function and in which minimization is carried out, at each iteration, within a randomly selected subspace. It is shown that this random approximation technique does not affect the method's convergence nor its evaluation complexity for the search of an $ε$-approximate first-order critical point, which is $\mathcal{O}(ε^{-(p+1)/p})$, where $p$ is the order of derivatives used. A variant of the algorithm using approximate Hessian matrices is also analysed and shown to require at most $\mathcal{O}(ε^{-2})$ evaluations. Preliminary numerical tests show that the random-subspace technique can significantly improve performance when used with $p=2$ in the correct context, making it very competitive when compared to standard first-order algorithms.

An optimally fast objective-function-free minimization algorithm using random subspaces

TL;DR

This work introduces SKOFFAR

, an objective-function-free adaptive-regularisation method that minimizes nonconvex functions in randomly chosen subspaces. By sketching the

-th order regularised model and reconstructing full steps, it preserves convergence guarantees and attains the optimal evaluation complexity

for first-order critical points, with a second-order variant achieving

. The analysis combines random-subspace embeddings with adaptive regularisation, showing that randomness can reduce per-iteration cost without compromising global guarantees, particularly for problems with low-rank Hessians. Numerical experiments on CUTEst problems illustrate substantial practical gains when using second-order models in low-rank settings and highlight competitive performance against established first-order OFFO methods.

Abstract

-approximate first-order critical point, which is

, where

is the order of derivatives used. A variant of the algorithm using approximate Hessian matrices is also analysed and shown to require at most

evaluations. Preliminary numerical tests show that the random-subspace technique can significantly improve performance when used with

in the correct context, making it very competitive when compared to standard first-order algorithms.

Paper Structure (8 sections, 14 theorems, 51 equations, 1 figure, 2 tables)

This paper contains 8 sections, 14 theorems, 51 equations, 1 figure, 2 tables.

Introduction
An OFFO adaptive regularisation algorithm using random subspaces
The SKOFFAR$p$ algorithm
Evaluation complexity for the SKOFFAR$p$ algorithm
Selecting random subspaces
Numerical illustration
Conclusions and perspectives
Quadratic regularisation for approximate second-order models

Key Result

Lemma 3.1

Suppose that AS.1 and AS.3 hold. Then and

Figures (1)

Figure 1: The behaviour of $f(x)$ when ADAM-N, ADAG-N and SKOFFAR$2$ are run on rosenbr, as a function of $w_1$-weighted iteration numbers, where SKOFFAR$2$ uses $\tau =10^{-1}, 5 \cdot 10^{-2}, 2.5\cdot 10^{-2}, 10^{-2}, 5.10^{-3}$ and $10^{-3}$ (from right to left)

Theorems & Definitions (15)

Lemma 3.1
Lemma 3.2
Lemma 3.3
Lemma 3.4
Lemma 3.5
Lemma 3.6
Lemma 3.7
Lemma 3.8
Definition 3.9
Lemma 3.10
...and 5 more

An optimally fast objective-function-free minimization algorithm using random subspaces

TL;DR

Abstract

An optimally fast objective-function-free minimization algorithm using random subspaces

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (15)