Black-Box Optimization with Implicit Constraints for Public Policy
Wenqian Xing, JungHo Lee, Chong Liu, Shixiang Zhu
TL;DR
The paper tackles implicit constraints in black-box optimization for public policy and introduces CageBO, which learns a constraint-free latent space $\mathcal{Z}$ via a conditional variational autoencoder and performs Bayesian optimization there while decoding to feasible decisions in $\mathcal{X}$. It provides a post-decoding projection and proves a no-regret upper bound $\mathbb{E}[R_T] = \widetilde{O}(\sqrt{T \gamma_T} + \sqrt{d} (n+T)^{d/(d+1)})$ under standard GP-Bandit assumptions. The method is validated on synthetic benchmarks and a large-scale Atlanta police redistricting case, showing superior performance and efficiency over baselines. This approach enables efficient exploration of high-dimensional, implicitly constrained policy spaces with practical impact for policymaking, while acknowledging fairness considerations and the need for debiasing techniques.
Abstract
Black-box optimization (BBO) has become increasingly relevant for tackling complex decision-making problems, especially in public policy domains such as police redistricting. However, its broader application in public policymaking is hindered by the complexity of defining feasible regions and the high-dimensionality of decisions. This paper introduces a novel BBO framework, termed as the Conditional And Generative Black-box Optimization (CageBO). This approach leverages a conditional variational autoencoder to learn the distribution of feasible decisions, enabling a two-way mapping between the original decision space and a simplified, constraint-free latent space. The CageBO efficiently handles the implicit constraints often found in public policy applications, allowing for optimization in the latent space while evaluating objectives in the original space. We validate our method through a case study on large-scale police redistricting problems in Atlanta, Georgia. Our results reveal that our CageBO offers notable improvements in performance and efficiency compared to the baselines.
