Nested subspace learning with flags
Tom Szwagier, Xavier Pennec
TL;DR
The paper addresses the nestedness gap that arises when selecting subspace dimensions in Grassmannian-based learning. It introduces flag manifolds and the flag trick to lift fixed-dimension subspace problems into multilevel, nested formulations, yielding natural hierarchies of subspaces via the average of projection operators. The authors develop optimization on flag manifolds (steepest descent, with IRLS and Newton variants in the appendices) and demonstrate the approach on robust subspace recovery, trace ratio problems, and spectral clustering, showing improved consistency across levels and potential performance gains through ensemble-like aggregation. This framework offers a unifying, extensible path to multilevel subspace learning with interpretable hierarchies and broad applicability to ML tasks beyond PCA.
Abstract
Many machine learning methods look for low-dimensional representations of the data. The underlying subspace can be estimated by first choosing a dimension $q$ and then optimizing a certain objective function over the space of $q$-dimensional subspaces (the Grassmannian). Trying different $q$ yields in general non-nested subspaces, which raises an important issue of consistency between the data representations. In this paper, we propose a simple and easily implementable principle to enforce nestedness in subspace learning methods. It consists in lifting Grassmannian optimization criteria to flag manifolds (the space of nested subspaces of increasing dimension) via nested projectors. We apply the flag trick to several classical machine learning methods and show that it successfully addresses the nestedness issue.
