Collaborative Bayesian Optimization via Wasserstein Barycenters
Donglin Zhan, Haoting Zhang, Rhonda Righter, Zeyu Zheng, James Anderson
TL;DR
This work addresses black-box optimization under data privacy by enabling collaboration among $N$ agents who share GP surrogates with a central server. The central model is constructed as a Wasserstein barycenter of local GPs, preserving privacy while providing explicit uncertainty and enabling a collaborative acquisition called Collaborative Knowledge Gradient (Co-KG) that blends central and local guidance. The authors prove asymptotic consistency of Co-KG and its MC-based approximation, and demonstrate through experiments that Co-KG outperforms non-collaborative baselines and is competitive with privacy-unrestricted centralized approaches. The framework shows practical promise for privacy-conscious distributed optimization in engineering and ML settings, with guidance on hyperparameters and discretization trade-offs.
Abstract
Motivated by the growing need for black-box optimization and data privacy, we introduce a collaborative Bayesian optimization (BO) framework that addresses both of these challenges. In this framework agents work collaboratively to optimize a function they only have oracle access to. In order to mitigate against communication and privacy constraints, agents are not allowed to share their data but can share their Gaussian process (GP) surrogate models. To enable collaboration under these constraints, we construct a central model to approximate the objective function by leveraging the concept of Wasserstein barycenters of GPs. This central model integrates the shared models without accessing the underlying data. A key aspect of our approach is a collaborative acquisition function that balances exploration and exploitation, allowing for the optimization of decision variables collaboratively in each iteration. We prove that our proposed algorithm is asymptotically consistent and that its implementation via Monte Carlo methods is numerically accurate. Through numerical experiments, we demonstrate that our approach outperforms other baseline collaborative frameworks and is competitive with centralized approaches that do not consider data privacy.
