Locality-aware Surrogates for Gradient-based Black-box Optimization
Ali Momeni, Stefan Uhlich, Arun Venkitaraman, Chia-Yu Hsieh, Andrea Bonetti, Ryoga Matsuo, Eisaku Ohbuchi, Lorenzo Servadei
TL;DR
This work introduces GradPIE, a locality-aware surrogate training objective derived from the Gradient Theorem, to align a surrogate’s gradients with those of a non-differentiable black-box. By training surrogates with GradPIE using either offline data or online updates, the method improves gradient estimation and accelerates gradient-based optimization under limited query budgets. Empirical results on CNON, OpAmp, and OWMS show substantial gains in gradient accuracy and optimization efficiency, with notable query-budget reductions and robustness in high-dimensional settings. The approach provides a principled pathway to more reliable gradient estimates for black-box simulators and physical systems, with potential extensions to analog neural networks and reinforcement learning.
Abstract
In physics and engineering, many processes are modeled using non-differentiable black-box simulators, making the optimization of such functions particularly challenging. To address such cases, inspired by the Gradient Theorem, we propose locality-aware surrogate models for active model-based black-box optimization. We first establish a theoretical connection between gradient alignment and the minimization of a Gradient Path Integral Equation (GradPIE) loss, which enforces consistency of the surrogate's gradients in local regions of the design space. Leveraging this theoretical insight, we develop a scalable training algorithm that minimizes the GradPIE loss, enabling both offline and online learning while maintaining computational efficiency. We evaluate our approach on three real-world tasks - spanning automated in silico experiments such as coupled nonlinear oscillators, analog circuits, and optical systems - and demonstrate consistent improvements in optimization efficiency under limited query budgets. Our results offer dependable solutions for both offline and online optimization tasks where reliable gradient estimation is needed.
