COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits
Mintong Kang, Nezihe Merve Gürel, Linyi Li, Bo Li
TL;DR
This work tackles the challenge of maintaining conformal prediction guarantees under test-time adversarial perturbations by integrating knowledge-enabled reasoning with learning. It introduces COLEP, a learning-reasoning pipeline where a main model and multiple knowledge models feed a PC-based reasoning module to produce corrected class probabilities, which are then used in a split conformal predictor. The authors establish end-to-end robustness certificates by propagating perturbation bounds from the learning component through the reasoning PCs, plus worst-case and finite-sample analyses, and prove that COLEP can outperform a single model in both coverage and accuracy when knowledge utilities are non-trivial. Empirically, COLEP yields substantial gains in certified coverage (up to 12% on GTSRB, 9% on CIFAR-10, 14% on AwA2) and maintains competitive runtimes, validating the approach on diverse benchmarks and illustrating the value of knowledge-guided reasoning for robust uncertainty quantification in safety-critical settings.
Abstract
Conformal prediction has shown spurring performance in constructing statistically rigorous prediction sets for arbitrary black-box machine learning models, assuming the data is exchangeable. However, even small adversarial perturbations during the inference can violate the exchangeability assumption, challenge the coverage guarantees, and result in a subsequent decline in empirical coverage. In this work, we propose a certifiably robust learning-reasoning conformal prediction framework (COLEP) via probabilistic circuits, which comprise a data-driven learning component that trains statistical models to learn different semantic concepts, and a reasoning component that encodes knowledge and characterizes the relationships among the trained models for logic reasoning. To achieve exact and efficient reasoning, we employ probabilistic circuits (PCs) within the reasoning component. Theoretically, we provide end-to-end certification of prediction coverage for COLEP in the presence of bounded adversarial perturbations. We also provide certified coverage considering the finite size of the calibration set. Furthermore, we prove that COLEP achieves higher prediction coverage and accuracy over a single model as long as the utilities of knowledge models are non-trivial. Empirically, we show the validity and tightness of our certified coverage, demonstrating the robust conformal prediction of COLEP on various datasets, including GTSRB, CIFAR10, and AwA2. We show that COLEP achieves up to 12% improvement in certified coverage on GTSRB, 9% on CIFAR-10, and 14% on AwA2.
