Dynamic Anisotropic Smoothing for Noisy Derivative-Free Optimization
Sam Reifenstein, Timothee Leleu, Yoshihisa Yamamoto
TL;DR
Derivative-free optimization under noisy evaluations is improved by dynamic anisotropic smoothing (DAS), which adapts the sampling kernel to heterogeneous curvature by converging toward the local Hessian $\nabla^2 f$ near optima. The framework comprises DIS (dynamic isotropic smoothing) and DAS, where DAS uses a matrix $L$ to shape the window and stochastic-dynamics for $x$ and $L$, yielding Hessian-aligned gradient estimates. The authors show that the gradient-estimation error is minimized when the kernel aligns with the Hessian eigenbasis, and demonstrate superior performance over derivative-free and Bayesian tuners on artificial benchmarks and NP-hard combinatorial solvers (e.g., SAT, Ising) under noise. These results suggest robust, curvature-aware optimization in high-noise settings and broader applicability to hyperparameter tuning and neural-network training with heterogeneous sensitivity across directions.
Abstract
We propose a novel algorithm that extends the methods of ball smoothing and Gaussian smoothing for noisy derivative-free optimization by accounting for the heterogeneous curvature of the objective function. The algorithm dynamically adapts the shape of the smoothing kernel to approximate the Hessian of the objective function around a local optimum. This approach significantly reduces the error in estimating the gradient from noisy evaluations through sampling. We demonstrate the efficacy of our method through numerical experiments on artificial problems. Additionally, we show improved performance when tuning NP-hard combinatorial optimization solvers compared to existing state-of-the-art heuristic derivative-free and Bayesian optimization methods.
