The radius of statistical efficiency
Joshua Cutler, Mateo Díaz, Dmitriy Drusvyatskiy
TL;DR
This work introduces the radius of statistical efficiency (RSE), a robustness measure that quantifies how close a statistical estimation problem is to ill-posedness by the smallest Wasserstein-2 perturbation to the data distribution that makes the Fisher-information-like matrix singular. It establishes a reciprocal relationship between RSE and the intrinsic statistical difficulty captured by the minimal eigenvalue of the Fisher-information-like object, paralleling classical Eckart–Young-type results in numerical linear algebra. The authors develop a general theory based on tilt-stable minimizers and slope-based error bounds, and compute RSE with numerical constants for PCA, generalized linear models, phase retrieval, bilinear sensing, and matrix completion. Across these problems, RSE quantifies well-posed neighborhoods and yields insight into how problem structure and data perturbations govern estimation difficulty, providing a unified, spectral-measure framework for robustness in statistics. The framework also introduces spectral-function techniques for measures, linking infinite-dimensional problems to tractable matrix-function computations via Wasserstein/Bures reductions.
Abstract
Classical results in asymptotic statistics show that the Fisher information matrix controls the difficulty of estimating a statistical model from observed data. In this work, we introduce a companion measure of robustness of an estimation problem: the radius of statistical efficiency (RSE) is the size of the smallest perturbation to the problem data that renders the Fisher information matrix singular. We compute RSE up to numerical constants for a variety of testbed problems, including principal component analysis, generalized linear models, phase retrieval, bilinear sensing, and matrix completion. Interestingly, we observe a precise reciprocal relationship between RSE and the intrinsic complexity/sensitivity of the problem instance, paralleling the classical Eckart-Young theorem in numerical analysis. To establish our results, we develop theory for spectral functions of measures that extends well-known results from matrix analysis and eigenvalue optimization$-$a contribution that may be of interest beyond our immediate findings.
