Confidence and Assurance of Percentiles
Sanjay M. Joshi
TL;DR
The paper addresses the lack of rigorous confidence qualifiers for percentile and tolerance statements by deriving a distribution-free, binomial-based expression for the confidence in the $p$-th percentile using order statistics, and by introducing tolerance and assurance intervals that pair percentile information with a desired confidence. It provides closed-form-like expressions via the binomial CDF $c = \mathrm{CDF}(j-1;n,p)$ and extends the framework to central tolerance intervals, as well as to assurance intervals computed with Brent's method. An open-source Python package, joshi23, accompanies the work, enabling practical calculation and demonstration through Jupyter notebooks. Overall, the work offers distribution-agnostic, interpretable tools to communicate percentile uncertainty and tolerance in applied settings, and highlights how median and mean confidence intervals can behave differently depending on the underlying distribution.
Abstract
Confidence interval of mean is often used when quoting statistics. The same rigor is often missing when quoting percentiles and tolerance or percentile intervals. This article derives the expression for confidence in percentiles of a sample population. Confidence intervals of median is compared to those of mean for a few sample distributions. The concept of assurance from reliability engineering is then extended to percentiles. The assurance level of sorted samples simply matches the confidence and percentile levels. Numerical method to compute assurance using Brent's optimization method is provided as an open-source python package.
