High-dimensional Level Set Estimation with Trust Regions and Double Acquisition Functions
Giang Ngo, Dat Phan Trong, Dang Nguyen, Sunil Gupta
TL;DR
This work tackles high-dimensional level-set estimation for expensive black-box functions by introducing TRLSE, a multi-trust-region framework that jointly leverages a global acquisition function to locate the threshold boundary and local acquisition functions to refine it within regions. The method provides theoretical guarantees for classification accuracy outside the trust regions and demonstrates superior sample efficiency over baselines on synthetic and real-world problems up to 1000 dimensions. Empirically, TRLSE achieves competitive or superior F1-scores while maintaining reasonable runtimes, and ablation studies highlight the importance of boundary-centered TR updates, local GP modeling, and informed region reinitialization. The approach offers a scalable, theoretically grounded solution for HDLSE with practical impact on applications requiring accurate level-set delineation under costly evaluations.
Abstract
Level set estimation (LSE) classifies whether an unknown function's value exceeds a specified threshold for given inputs, a fundamental problem in many real-world applications. In active learning settings with limited initial data, we aim to iteratively acquire informative points to construct an accurate classifier for this task. In high-dimensional spaces, this becomes challenging where the search volume grows exponentially with increasing dimensionality. We propose TRLSE, an algorithm for high-dimensional LSE, which identifies and refines regions near the threshold boundary with dual acquisition functions operating at both global and local levels. We provide a theoretical analysis of TRLSE's accuracy and show its superior sample efficiency against existing methods through extensive evaluations on multiple synthetic and real-world LSE problems.
