Learning Covariance-Based Multi-Scale Representation of Neuroimaging Measures for Alzheimer Classification
Seunghun Baek, Injun Choi, Mustafa Dere, Minjeong Kim, Guorong Wu, Won Hwa Kim
TL;DR
This work tackles data-scarce medical imaging by replacing deep, heavily parameterized networks with a covariance-based multi-scale transform (COVLET) that produces informative, high-dimensional representations from limited samples. By treating the transform’s scales as trainable parameters and operating in the covariance domain via eigenbasis filtering, the approach yields efficient end-to-end learning with a downstream classifier and interpretable Grad-CAM-like maps. Empirical results on ADNI MRI cortical thickness and tau PET show higher accuracy and faster convergence than baselines, particularly for cortical thickness in 2- and 4-way Alzheimer’s classification, and provide ROI insights consistent with established AD literature. The method offers practical impact by delivering strong predictive performance with smaller models and transparent, region-specific interpretations useful for clinical grounding.
Abstract
Stacking excessive layers in DNN results in highly underdetermined system when training samples are limited, which is very common in medical applications. In this regard, we present a framework capable of deriving an efficient high-dimensional space with reasonable increase in model size. This is done by utilizing a transform (i.e., convolution) that leverages scale-space theory with covariance structure. The overall model trains on this transform together with a downstream classifier (i.e., Fully Connected layer) to capture the optimal multi-scale representation of the original data which corresponds to task-specific components in a dual space. Experiments on neuroimaging measures from Alzheimer's Disease Neuroimaging Initiative (ADNI) study show that our model performs better and converges faster than conventional models even when the model size is significantly reduced. The trained model is made interpretable using gradient information over the multi-scale transform to delineate personalized AD-specific regions in the brain.
