Finite-Sample Valid Rank Confidence Sets for a Broad Class of Statistical and Machine Learning Models
Onrina Chandra, Min-ge Xie
TL;DR
This work develops a finite-sample, nonparametric framework for inferring ranks across K populations by extending the Repro-Samples approach to discrete rank parameters. It constructs level-$1-\alpha$ rank confidence sets via inversion of neighborhood constraints and introduces a data-adaptive, discordance-based candidate set to keep computation tractable while preserving coverage. The method is validated through case studies and extensive simulations across unknown distributions, regression-based rankings, and Plackett–Luce top-choice data, showing robust finite-sample coverage and sharper, interpretable rank intervals than bootstrap or asymptotic alternatives. The framework unifies multiple ranking settings and is demonstrated to be practically applicable to wealth quantiles, sports standings, joke rankings, and hospital performance with heteroscedastic data.
Abstract
Ranking populations such as institutions based on certain characteristics is often of interest, and these ranks are typically estimated using samples drawn from the populations. Due to sample randomness, it is important to quantify the uncertainty associated with the estimated ranks. This becomes crucial when latent characteristics are poorly separated and where many rank estimates may be incorrectly ordered. Understanding uncertainty can help quantify and mitigate these issues and provide a fuller picture. However, this task is especially challenging because the rank parameters are discrete and the central limit theorem does not apply to the rank estimates. In this article, we propose a Repro Samples Method to address this nontrivial inference problem by developing a confidence set for the true, unobserved population ranks. This method provides finite-sample coverage guarantees and is broadly applicable to ranking problems. The effectiveness of the method is illustrated and compared with several published large sample ranking approaches using simulation studies and real data examples involving samples both from traditional statistical models and modern data science algorithms.
