GT-PCA: Effective and Interpretable Dimensionality Reduction with General Transform-Invariant Principal Component Analysis
Florian Heinrichs
TL;DR
GT-PCA introduces an interpretable extension of PCA that yields components invariant to problem-dependent transformations by solving a transform-invariant energy maximization. A neural network GT-PC-layer approximates these components, enabling sequential estimation of multiple GT-PCs in settings such as time shifts and image rotations. Empirical results show GT-PCA outperforms PCA, KPCA, autoencoders, and VAEs on synthetic and real data when nontrivial transformations are present, while retaining interpretability similar to functional PCA. This approach offers a principled, diagnostically meaningful dimensionality reduction tool with potential applications across EEG analysis, computer vision, and functional time series.
Abstract
Data analysis often requires methods that are invariant with respect to specific transformations, such as rotations in case of images or shifts in case of images and time series. While principal component analysis (PCA) is a widely-used dimension reduction technique, it lacks robustness with respect to these transformations. Modern alternatives, such as autoencoders, can be invariant with respect to specific transformations but are generally not interpretable. We introduce General Transform-Invariant Principal Component Analysis (GT-PCA) as an effective and interpretable alternative to PCA and autoencoders. We propose a neural network that efficiently estimates the components and show that GT-PCA significantly outperforms alternative methods in experiments based on synthetic and real data.
