Offline Stochastic Optimization of Black-Box Objective Functions
Juncheng Dong, Zihao Wu, Hamid Jafarkhani, Ali Pezeshki, Vahid Tarokh
TL;DR
This work introduces Stochastic Offline BBO (SOBBO), which optimizes a black-box objective under uncontrollable stochasticity using only offline data. It presents two regime-specific solutions: Estimate-Then-Differentiate (ETD) for large-data settings, which learns a differentiable surrogate and uses its gradient to perform standard gradient-based optimization with a consistency guarantee; and Deep Gradient Interpolation (DGI) for scarce-data settings, which directly learns a conservative gradient field enforcing balance, reconstruction, and path-independence to yield robust gradient estimates. Theoretical results establish the consistency of ETD's gradient estimates, and extensive experiments on synthetic benchmarks and real-world tasks show that both ETD and DGI outperform baselines, with DGI delivering particularly strong gradient accuracy under data scarcity. Overall, the approach bridges offline data-driven learning with stochastic optimization, enabling efficient, reliable design optimization when function evaluations are expensive and randomness is pervasive.
Abstract
Many challenges in science and engineering, such as drug discovery and communication network design, involve optimizing complex and expensive black-box functions across vast search spaces. Thus, it is essential to leverage existing data to avoid costly active queries of these black-box functions. To this end, while Offline Black-Box Optimization (BBO) is effective for deterministic problems, it may fall short in capturing the stochasticity of real-world scenarios. To address this, we introduce Stochastic Offline BBO (SOBBO), which tackles both black-box objectives and uncontrolled uncertainties. We propose two solutions: for large-data regimes, a differentiable surrogate allows for gradient-based optimization, while for scarce-data regimes, we directly estimate gradients under conservative field constraints, improving robustness, convergence, and data efficiency. Numerical experiments demonstrate the effectiveness of our approach on both synthetic and real-world tasks.
