Table of Contents
Fetching ...

Fun with Flags: Robust Principal Directions via Flag Manifolds

Nathan Mankovich, Gustau Camps-Valls, Tolga Birdal

TL;DR

This work introduces a unifying flag-manifold framework for principal directions that subsumes PCA, RPCA, DPCA, PGA, and tangent-PCA, enabling a single Stiefel-optimization solver on $St(n_k,n)$ to compute nested subspaces represented as flags on $ ext{FL}(\

Abstract

Principal component analysis (PCA), along with its extensions to manifolds and outlier contaminated data, have been indispensable in computer vision and machine learning. In this work, we present a unifying formalism for PCA and its variants, and introduce a framework based on the flags of linear subspaces, ie a hierarchy of nested linear subspaces of increasing dimension, which not only allows for a common implementation but also yields novel variants, not explored previously. We begin by generalizing traditional PCA methods that either maximize variance or minimize reconstruction error. We expand these interpretations to develop a wide array of new dimensionality reduction algorithms by accounting for outliers and the data manifold. To devise a common computational approach, we recast robust and dual forms of PCA as optimization problems on flag manifolds. We then integrate tangent space approximations of principal geodesic analysis (tangent-PCA) into this flag-based framework, creating novel robust and dual geodesic PCA variations. The remarkable flexibility offered by the 'flagification' introduced here enables even more algorithmic variants identified by specific flag types. Last but not least, we propose an effective convergent solver for these flag-formulations employing the Stiefel manifold. Our empirical results on both real-world and synthetic scenarios, demonstrate the superiority of our novel algorithms, especially in terms of robustness to outliers on manifolds.

Fun with Flags: Robust Principal Directions via Flag Manifolds

TL;DR

This work introduces a unifying flag-manifold framework for principal directions that subsumes PCA, RPCA, DPCA, PGA, and tangent-PCA, enabling a single Stiefel-optimization solver on to compute nested subspaces represented as flags on $ ext{FL}(\

Abstract

Principal component analysis (PCA), along with its extensions to manifolds and outlier contaminated data, have been indispensable in computer vision and machine learning. In this work, we present a unifying formalism for PCA and its variants, and introduce a framework based on the flags of linear subspaces, ie a hierarchy of nested linear subspaces of increasing dimension, which not only allows for a common implementation but also yields novel variants, not explored previously. We begin by generalizing traditional PCA methods that either maximize variance or minimize reconstruction error. We expand these interpretations to develop a wide array of new dimensionality reduction algorithms by accounting for outliers and the data manifold. To devise a common computational approach, we recast robust and dual forms of PCA as optimization problems on flag manifolds. We then integrate tangent space approximations of principal geodesic analysis (tangent-PCA) into this flag-based framework, creating novel robust and dual geodesic PCA variations. The remarkable flexibility offered by the 'flagification' introduced here enables even more algorithmic variants identified by specific flag types. Last but not least, we propose an effective convergent solver for these flag-formulations employing the Stiefel manifold. Our empirical results on both real-world and synthetic scenarios, demonstrate the superiority of our novel algorithms, especially in terms of robustness to outliers on manifolds.
Paper Structure (39 sections, 8 theorems, 61 equations, 14 figures, 6 tables, 4 algorithms)

This paper contains 39 sections, 8 theorems, 61 equations, 14 figures, 6 tables, 4 algorithms.

Key Result

Proposition 1

Setting $1\leq q<2$, gives us novel, robust formulations of the PGA problem (RPGA and WPGA) defined in dfn:pgagen, which we will solve in the unifying flag framework we provide. While general robust manifold-optimizers such as robust median-of-meanslin2020robust can be used to implement RPGA and WPG

Figures (14)

  • Figure 1: \ref{['alg: flagified pca']} converges faster to more optimal cost values compared to Stiefel CGD or Flag RTR.
  • Figure 2: Average ROC curves over five trials of outlier samples for UCMercedLandUse (top) and YaleFaceDB-B (bottom). All data is reshaped and projected to $\mathbb{R}^{50}$ before outlier detection.
  • Figure 3: $50$ random initializations of f$\mathcal{T}$PCA variations. The blue line is the mean and the shaded region is the standard deviation. The $x$-axis of this plot is the number of iterations of \ref{['alg: flagified pca']} performed in the tangent space.
  • Figure 4: AUC of different algorithms for outlier detection using the first $k=2$ principal directions of outlier-contaminated data on $Gr(2,4)$. All iterative variants are optimized with $100$ max. iters.
  • Figure 5: Mean AUC for outlier predictions using the first $k=4$ principal directions where we gradually add outlier ellipses to the $2$D Hands dataset. The mean is over $20$ trials of adding outliers.
  • ...and 9 more figures

Theorems & Definitions (37)

  • Definition 1: Riemannian manifold lee2006riemannian
  • Definition 2: Geodesics & Exp/Log-maps lee2006riemannian
  • Definition 3: Flag
  • Definition 4: Flag manifold
  • Definition 5: PCA hotelling1933analysis
  • Definition 6: Generalized PCA
  • Definition 7: Dual-PCA vidal2018dpcp
  • Definition 8: Dual Principal Component Pursuit (DPCP)
  • Definition 9: Dual PCA Generalizations
  • Remark 1: New DPCP Variants
  • ...and 27 more