Classification under Nuisance Parameters and Generalized Label Shift in Likelihood-Free Inference
Luca Masserano, Alex Shen, Michele Doro, Tommaso Dorigo, Rafael Izbicki, Ann B. Lee
TL;DR
The paper addresses reliable classification under generalized label shift (GLS) in likelihood-free inference by introducing a nuisance-parameter aware framework. It recasts classification as hypothesis testing and derives rejection probabilities across the entire nuisance space, enabling ROC-based calibration that is invariant to GLS. The authors construct nuisance-aware prediction sets (NAPS) with guaranteed (1−α) coverage conditional on both the class and nuisance parameters, and further improve power by restricting nuisance parameters to data-dependent confidence sets (gamma). They validate the approach on synthetic data and two scientific simulators (single-cell RNA sequencing and atmospheric cosmic-ray showers), showing that NAPS provide robust uncertainty quantification under GLS while maintaining high predictive power, and demonstrate practical gains by conditioning on nuisance information when available. This method offers a principled, domain-adaptive way to obtain reliable predictions in heavy-modeling regimes typical of scientific inference.}
Abstract
An open scientific challenge is how to classify events with reliable measures of uncertainty, when we have a mechanistic model of the data-generating process but the distribution over both labels and latent nuisance parameters is different between train and target data. We refer to this type of distributional shift as generalized label shift (GLS). Direct classification using observed data $\mathbf{X}$ as covariates leads to biased predictions and invalid uncertainty estimates of labels $Y$. We overcome these biases by proposing a new method for robust uncertainty quantification that casts classification as a hypothesis testing problem under nuisance parameters. The key idea is to estimate the classifier's receiver operating characteristic (ROC) across the entire nuisance parameter space, which allows us to devise cutoffs that are invariant under GLS. Our method effectively endows a pre-trained classifier with domain adaptation capabilities and returns valid prediction sets while maintaining high power. We demonstrate its performance on two challenging scientific problems in biology and astroparticle physics with data from realistic mechanistic models.
