Scalable Differentially Private Bayesian Optimization
Getoar Sopa, Juraj Marusic, Marco Avella-Medina, John P. Cunningham
TL;DR
This work introduces DP-GIBO, a scalable private optimization method for high-dimensional continuous hyperparameters that combines local Bayesian optimization with gradient information obtained from a Gaussian Process surrogate. By clipping gradient estimates, adaptively selecting evaluation points, and adding Gaussian privacy noise, the algorithm provides a formal Gaussian Differential Privacy guarantee while converging to a local minimum in the noiseless case or to a privacy-affected neighborhood when observations are noisy. Theoretical contributions include exponential convergence in the noiseless setting and dimension-dependent convergence bounds in the noisy case, with linear scaling in the dimension for the noiseless regime and polynomial scaling under privacy/noise. Empirically, DP-GIBO outperforms existing private hyperparameter tuning methods in high-dimensional spaces and remains competitive with non-private GIBO, demonstrating its potential for privacy-preserving tuning of large-scale models.
Abstract
In recent years, there has been much work on scaling Bayesian Optimization to high-dimensional problems, for example hyperparameter tuning in large machine learning models. These scalable methods have been successful, finding high objective values much more quickly than traditional global Bayesian Optimization or random search-based methods. At the same time, these large models often use sensitive data, but preservation of Differential Privacy has not scaled alongside these modern Bayesian Optimization procedures. Here we develop a method to privately optimize potentially high-dimensional parameter spaces using privatized Gradient Informative Bayesian Optimization. Our theoretical results show that under suitable conditions, our method converges exponentially fast to a locally optimal parameter configuration, up to a natural privacy error. Moreover, regardless of whether the assumptions are satisfied, we prove that our algorithm maintains privacy and empirically display superior performance to existing methods in the high-dimensional hyperparameter setting.
