Sample-Efficient "Clustering and Conquer" Procedures for Parallel Large-Scale Ranking and Selection
Zishi Zhang, Yijie Peng
TL;DR
This work tackles the inefficiency of large-scale ranking and selection (R&S) when many alternatives are evaluated in parallel. It introduces Parallel Correlation Clustering and Conquer (P3C), a framework that clusters correlated alternatives and assigns clusters to processors to improve sample efficiency; it also develops a gradient-based analysis of mean-covariance interactions and a few-shot clustering method AC^+ for scalability. It proves that, under a symmetric benchmark, P3C can achieve an $O(p)$ reduction in total samples for sample-optimal R&S procedures, and provides PCC-based guarantees for clustering. It demonstrates substantial empirical gains in fixed-precision drug discovery and fixed-budget neural architecture search (NAS), with P3C reducing required samples and lowering wall-clock time while incurring modest clustering overhead.
Abstract
This work aims to improve the sample efficiency of parallel large-scale ranking and selection (R&S) problems by leveraging correlation information. We modify the commonly used "divide and conquer" framework in parallel computing by adding a correlation-based clustering step, transforming it into "clustering and conquer". Analytical results under a symmetric benchmark scenario show that this seemingly simple modification yields an $\mathcal{O}(p)$ reduction in sample complexity for a widely used class of sample-optimal R&S procedures. Our approach enjoys two key advantages: 1) it does not require highly accurate correlation estimation or precise clustering, and 2) it allows for seamless integration with various existing R&S procedures, while achieving optimal sample complexity. Theoretically, we develop a novel gradient analysis framework to analyze sample efficiency and guide the design of large-scale R&S procedures. We also introduce a new parallel clustering algorithm tailored for large-scale scenarios. Finally, in large-scale AI applications such as neural architecture search, our methods demonstrate superior performance.
