Supervised Quadratic Feature Analysis: Information Geometry Approach for Dimensionality Reduction
Daniel Herrera-Esposito, Johannes Burge
TL;DR
The paper introduces Supervised Quadratic Feature Analysis (SQFA), a linear dimensionality-reduction method that learns filters by maximizing Fisher-Rao distances between class-conditional Gaussians, offering a geometric alternative to traditional dissimilarity measures. By leveraging exact results for zero-mean Gaussians and a Calvo-Oller bound for general Gaussians, SQFA delivers discriminative low-dimensional features that support strong quadratic discriminability with QDA and competitive benchmarks on real datasets. The approach highlights the utility of information geometry in ML and neuroscience, providing a practical, efficient framework with potential extensions to non-Gaussian and nonlinear regimes. The work also demonstrates how comparing Fisher-Rao, Bhattacharyya, and Hellinger distances sheds light on the behavior of multiclass discriminability and model robustness.
Abstract
Supervised dimensionality reduction maps labeled data into a low-dimensional feature space while preserving class discriminability. A common approach is to maximize a statistical measure of dissimilarity between classes in the feature space. Information geometry provides an alternative framework for measuring class dissimilarity, with the potential for improved insights and novel applications. Information geometry, which is grounded in Riemannian geometry, uses the Fisher information metric, a local measure of discriminability that induces the Fisher-Rao distance. Here, we present Supervised Quadratic Feature Analysis (SQFA), a linear dimensionality reduction method that maximizes Fisher-Rao distances between class-conditional distributions, under Gaussian assumptions. We motivate the Fisher-Rao distance as a good proxy for discriminability. We show that SQFA features support good classification performance with Quadratic Discriminant Analysis (QDA) on three real-world datasets. SQFA provides a novel framework for supervised dimensionality reduction, motivating future research in applying information geometry to machine learning and neuroscience.
