Random matrix theory improved Fréchet mean of symmetric positive definite matrices
Florent Bouchard, Ammar Mian, Malik Tiomoko, Guillaume Ginolhac, Frédéric Pascal
TL;DR
The paper tackles the problem of computing Fréchet means of symmetric positive definite covariance matrices in high-dimensional, low-sample regimes. It introduces an RMT-based correction to the squared Fisher distance and a one-step Fréchet mean algorithm that operates directly on the SPD manifold, leveraging random-matrix theory to reduce dependence on limited data. It also provides an improved covariance estimator and extends the RMT mean to learning tasks like Nearest Centroid and K-Means, demonstrating advantages on synthetic data and real EEG and hyperspectral datasets. The findings indicate improved robustness and accuracy when averaging many covariance matrices under modest sample sizes, offering practical impact for covariance-based learning in challenging data regimes.
Abstract
In this study, we consider the realm of covariance matrices in machine learning, particularly focusing on computing Fréchet means on the manifold of symmetric positive definite matrices, commonly referred to as Karcher or geometric means. Such means are leveraged in numerous machine-learning tasks. Relying on advanced statistical tools, we introduce a random matrix theory-based method that estimates Fréchet means, which is particularly beneficial when dealing with low sample support and a high number of matrices to average. Our experimental evaluation, involving both synthetic and real-world EEG and hyperspectral datasets, shows that we largely outperform state-of-the-art methods.
