Distribution-Free Rates in Neyman-Pearson Classification
Mohammadreza M. Kalan, Samory Kpotufe
TL;DR
This paper characterizes distribution-free minimax rates for Neyman-Pearson classification over a fixed VC class $\mathcal{H}$ by introducing a geometric three-points-separation condition that separates easy vs. hard rate regimes. When $\mathcal{H}$ separates three points, the minimax excess risk decays as $\tilde{\Theta}(n^{-1/2})$ (up to logarithmic factors); otherwise, rates improve to $\tilde{\Theta}(d_{\mathcal{H}}/n)$ or even zero, depending on the existence of a maximal element in $\mathcal{H}_{\alpha}(\mu_0)$. The results extend to the unknown $\mu_0$ case with slack $\epsilon_0$ and approximate level $\alpha$, preserving the same dichotomy under additional technical conditions (e.g., finitely supported $\mu_0$). The upper and lower bounds match up to $\log n$ terms, highlighting a fundamental link between hypothesis-class structure and achievable rates in distribution-free Neyman-Pearson problems.
Abstract
We consider the problem of Neyman-Pearson classification which models unbalanced classification settings where error w.r.t. a distribution $μ_1$ is to be minimized subject to low error w.r.t. a different distribution $μ_0$. Given a fixed VC class $\mathcal{H}$ of classifiers to be minimized over, we provide a full characterization of possible distribution-free rates, i.e., minimax rates over the space of all pairs $(μ_0, μ_1)$. The rates involve a dichotomy between hard and easy classes $\mathcal{H}$ as characterized by a simple geometric condition, a three-points-separation condition, loosely related to VC dimension.
