Topological Autoencoders++: Fast and Accurate Cycle-Aware Dimensionality Reduction
Mattéo Clémot, Julie Digne, Julien Tierny
TL;DR
TopoAE++ advances topology-aware dimensionality reduction by extending TopoAE to preserve $PH^{1}$ cycles through a cascade distortion loss that enforces isometric filling of 1-cycles. It introduces a fast planar $PH$ computation algorithm tailored to 2D embeddings, enabling practical optimization, and demonstrates a strong balance between topological accuracy and visual fidelity on synthetic and real datasets. Theoretical results clarify when $PH^{0}$ preservation guarantees hold and motivate the new $PH^{1}$-aware formulation, while empirical results show competitive $PD^{1}$ distances and improved cycle visualization compared to baselines. The work provides an open-source C++ implementation and outlines directions toward higher-dimensional latent spaces and homology levels, broadening the applicability of cycle-aware DR in complex data landscapes.
Abstract
This paper presents a novel topology-aware dimensionality reduction approach aiming at accurately visualizing the cyclic patterns present in high dimensional data. To that end, we build on the Topological Autoencoders (TopoAE) formulation. First, we provide a novel theoretical analysis of its associated loss and show that a zero loss indeed induces identical persistence pairs (in high and low dimensions) for the $0$-dimensional persistent homology (PH$^0$) of the Rips filtration. We also provide a counter example showing that this property no longer holds for a naive extension of TopoAE to PH$^d$ for $d\ge 1$. Based on this observation, we introduce a novel generalization of TopoAE to $1$-dimensional persistent homology (PH$^1$), called TopoAE++, for the accurate generation of cycle-aware planar embeddings, addressing the above failure case. This generalization is based on the notion of cascade distortion, a new penalty term favoring an isometric embedding of the $2$-chains filling persistent $1$-cycles, hence resulting in more faithful geometrical reconstructions of the $1$-cycles in the plane. We further introduce a novel, fast algorithm for the exact computation of PH for Rips filtrations in the plane, yielding improved runtimes over previously documented topology-aware methods. Our method also achieves a better balance between the topological accuracy, as measured by the Wasserstein distance, and the visual preservation of the cycles in low dimensions. Our C++ implementation is available at https://github.com/MClemot/TopologicalAutoencodersPlusPlus.
