Multi-Condition Conformal Selection
Qingyang Hao, Wenbo Liao, Bingyi Jing, Hongxin Wei
TL;DR
This work extends conformal selection to multi-condition settings, enabling FDR-controlled candidate selection under conjunctive and disjunctive constraints by introducing regionally monotone nonconformity scores for intervals and a global BH-based testing scheme. The proposed MCCS framework guarantees finite-sample FDR control across diverse target structures, including multi-interval and multivariate targets, and is validated through extensive simulations and real-data experiments spanning NLP, computer vision, multimodal tasks, and multi-class problems. Key contributions include a conjunctive-specific nonconformity construction, a global BH procedure for disjunctive targets, theoretical FDR guarantees, and extensions to more complex combinations and multivariate responses. The approach demonstrates robust performance, preserving FDR control while achieving competitive power across modalities and task types, with practical implications for drug discovery, precision medicine, and LLM alignment where resource constraints and multiple selection criteria are common.
Abstract
Selecting high-quality candidates from large-scale datasets is critically important in resource-constrained applications such as drug discovery, precision medicine, and the alignment of large language models. While conformal selection methods offer a rigorous solution with False Discovery Rate (FDR) control, their applicability is confined to single-threshold scenarios (i.e., y > c) and overlooks practical needs for multi-condition selection, such as conjunctive or disjunctive conditions. In this work, we propose the Multi-Condition Conformal Selection (MCCS) algorithm, which extends conformal selection to scenarios with multiple conditions. In particular, we introduce a novel nonconformity score with regional monotonicity for conjunctive conditions and a global Benjamini-Hochberg (BH) procedure for disjunctive conditions, thereby establishing finite-sample FDR control with theoretical guarantees. The integration of these components enables the proposed method to achieve rigorous FDR-controlled selection in various multi-condition environments. Extensive experiments validate the superiority of MCCS over baselines, its generalizability across diverse condition combinations, different real-world modalities, and multi-task scalability.
