Enhancing finite-difference based derivative-free optimization methods with machine learning

Timothé Taminiau; Estelle Massart; Geovani Nunes Grapiglia

Enhancing finite-difference based derivative-free optimization methods with machine learning

Timothé Taminiau, Estelle Massart, Geovani Nunes Grapiglia

TL;DR

This work tackles black-box, possibly nonconvex optimization by augmenting finite-difference derivative-free methods with a surrogate-based heuristic. A Sobolev-learning surrogate is trained on accumulated data and approximate gradients, then refined via gradient steps on the surrogate with an Armijo-type check against the true objective, allowing early surrogate-guided progress before reverting to the base method. The authors provide a worst-case complexity bound $O(n\epsilon^{-2})$ for finding an $\epsilon$-approximate stationary point, with the bound improved by the surrogate gain $\eta(S(T))$ when surrogate steps succeed often. Numerical experiments on a CUTEst subset show substantial performance gains, especially when Sobolev learning leverages gradient information, with SoftPlus-based neural surrogates and Gaussian RBFs delivering the strongest improvements and robust behavior across models. The framework offers a practical, general enhancement to a wide class of finite-difference-based DFO methods, potentially reducing expensive function evaluations in simulation- or experiment-driven optimization tasks.

Abstract

Derivative-Free Optimization (DFO) involves methods that rely solely on evaluations of the objective function. One of the earliest strategies for designing DFO methods is to adapt first-order methods by replacing gradients with finite-difference approximations. The execution of such methods generates a rich dataset about the objective function, including iterate points, function values, approximate gradients, and successful step sizes. In this work, we propose a simple auxiliary procedure to leverage this dataset and enhance the performance of finite-difference-based DFO methods. Specifically, our procedure trains a surrogate model using the available data and applies the gradient method with Armijo line search to the surrogate until it fails to ensure sufficient decrease in the true objective function, in which case we revert to the original algorithm and improve our surrogate based on the new available information. As a proof of concept, we integrate this procedure with the derivative-free method proposed in (Optim. Lett. 18: 195--213, 2024). Numerical results demonstrate significant performance improvements, particularly when the approximate gradients are also used to train the surrogates.

Enhancing finite-difference based derivative-free optimization methods with machine learning

TL;DR

Abstract

Enhancing finite-difference based derivative-free optimization methods with machine learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (10)