A Geometric Unification of Distributionally Robust Covariance Estimators: Shrinking the Spectrum by Inflating the Ambiguity Set
Man-Chung Yue, Yves Rychener, Daniel Kuhn, Viet Anh Nguyen
TL;DR
The paper introduces a principled distributionally robust approach to high-dimensional covariance estimation, showing that minimizing worst-case Frobenius prediction loss over a divergence-based ambiguity set around a nominal covariance yields nonlinear spectral shrinkage. By diagonalizing the nominal estimator and solving a univariate equation for a shrinkage intensity, the authors obtain estimators that preserve eigenvectors while shrinking eigenvalues, with explicit formulas for KL, Wasserstein, and Fisher-Rao divergences. The framework provides existence, uniqueness, efficient computation, and strong statistical guarantees, including consistency and finite-sample bounds, without assuming Gaussianity. Numerical experiments demonstrate competitive performance against state-of-the-art shrinkage methods in portfolio optimization and discriminant analysis, highlighting robustness benefits in ill-conditioned, high-dimensional settings.
Abstract
The state-of-the-art methods for estimating high-dimensional covariance matrices all shrink the eigenvalues of the sample covariance matrix towards a data-insensitive shrinkage target. The underlying shrinkage transformation is either chosen heuristically - without compelling theoretical justification - or optimally in view of restrictive distributional assumptions. In this paper, we propose a principled approach to construct covariance estimators without imposing restrictive assumptions. That is, we study distributionally robust covariance estimation problems that minimize the worst-case Frobenius error with respect to all data distributions close to a nominal distribution, where the proximity of distributions is measured via a divergence on the space of covariance matrices. We identify mild conditions on this divergence under which the resulting minimizers represent shrinkage estimators. We show that the corresponding shrinkage transformations are intimately related to the geometrical properties of the underlying divergence. We also prove that our robust estimators are efficiently computable and asymptotically consistent and that they enjoy finite-sample performance guarantees. We exemplify our general methodology by synthesizing explicit estimators induced by the Kullback-Leibler, Fisher-Rao, and Wasserstein divergences. Numerical experiments based on synthetic and real data show that our robust estimators are competitive with state-of-the-art estimators.
