Provably Efficient Bayesian Optimization with Unknown Gaussian Process Hyperparameter Estimation
Huong Ha, Vu Nguyen, Hung Tran-The, Hongyu Zhang, Xiuzhen Zhang, Anton van den Hengel
TL;DR
The paper tackles Bayesian optimization when GP hyperparameters are unknown and must be learned from biased, non-i.i.d. data. It introduces Unknown Hyperparameter Estimation for Bayesian Optimization (UHE), combining an EXP3-based iid-sampling mechanism with a consistent GP hyperparameter loss to ensure reliable estimation. Theoretical results establish high-probability sub-linear regret and convergence of the hyperparameters to the true value, while empirical evaluations on synthetic and real-world tasks show UHE outperforms existing approaches. This work enhances BO robustness in practical settings where hyperparameters are not known a priori and data collection is guided by acquisition strategies.
Abstract
Gaussian process (GP) based Bayesian optimization (BO) is a powerful method for optimizing black-box functions efficiently. The practical performance and theoretical guarantees of this approach depend on having the correct GP hyperparameter values, which are usually unknown in advance and need to be estimated from the observed data. However, in practice, these estimations could be incorrect due to biased data sampling strategies used in BO. This can lead to degraded performance and break the sub-linear global convergence guarantee of BO. To address this issue, we propose a new BO method that can sub-linearly converge to the objective function's global optimum even when the true GP hyperparameters are unknown in advance and need to be estimated from the observed data. Our method uses a multi-armed bandit technique (EXP3) to add random data points to the BO process, and employs a novel training loss function for the GP hyperparameter estimation process that ensures consistent estimation. We further provide theoretical analysis of our proposed method. Finally, we demonstrate empirically that our method outperforms existing approaches on various synthetic and real-world problems.
