Covariance Scattering Transforms

Andrea Cavallo; Ayushman Raghuvanshi; Sundeep Prabhakar Chepuri; Elvin Isufi

Covariance Scattering Transforms

Andrea Cavallo, Ayushman Raghuvanshi, Sundeep Prabhakar Chepuri, Elvin Isufi

TL;DR

The paper tackles robust covariance-based representations without supervision. It introduces Covariance Scattering Transforms (CSTs), a deep, untrained architecture built from covariance wavelets that spectrally filter the covariance matrix and produce hierarchical embeddings, with pruning for efficiency. It proves permutation equivariance and stability to finite-sample perturbations, with bounds that scale as $O(1/\,\sqrt{T})$ and separate from eigengap considerations. Empirically, CSTs yield stable, competitive age-prediction performance from cortical thickness across four datasets, outperforming PCA and rivaling VNNs while requiring no training.

Abstract

Machine learning and data processing techniques relying on covariance information are widespread as they identify meaningful patterns in unsupervised and unlabeled settings. As a prominent example, Principal Component Analysis (PCA) projects data points onto the eigenvectors of their covariance matrix, capturing the directions of maximum variance. This mapping, however, falls short in two directions: it fails to capture information in low-variance directions, relevant when, e.g., the data contains high-variance noise; and it provides unstable results in low-sample regimes, especially when covariance eigenvalues are close. CoVariance Neural Networks (VNNs), i.e., graph neural networks using the covariance matrix as a graph, show improved stability to estimation errors and learn more expressive functions in the covariance spectrum than PCA, but require training and operate in a labeled setup. To get the benefits of both worlds, we propose Covariance Scattering Transforms (CSTs), deep untrained networks that sequentially apply filters localized in the covariance spectrum to the input data and produce expressive hierarchical representations via nonlinearities. We define the filters as covariance wavelets that capture specific and detailed covariance spectral patterns. We improve CSTs' computational and memory efficiency via a pruning mechanism, and we prove that their error due to finite-sample covariance estimations is less sensitive to close covariance eigenvalues compared to PCA, improving their stability. Our experiments on age prediction from cortical thickness measurements on 4 datasets collecting patients with neurodegenerative diseases show that CSTs produce stable representations in low-data settings, as VNNs but without any training, and lead to comparable or better predictions w.r.t. more complex learning models.

Covariance Scattering Transforms

TL;DR

Abstract

Covariance Scattering Transforms

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (15)