Point-Level Topological Representation Learning on Point Clouds

Vincent P. Grande; Michael T. Schaub

Point-Level Topological Representation Learning on Point Clouds

Vincent P. Grande, Michael T. Schaub

TL;DR

TOPF introduces Topological Point Features, a framework that converts global topological information from a point cloud into per-point descriptors by projecting persistent-homology generators into the harmonic space of the Hodge Laplacian and aggregating locally. The method couples alpha- and Vietoris–Rips filtrations with harmonic projections, normalizations, and density-aware simplicial weighting to produce a |X|×|F| feature matrix suitable for downstream learning. The authors provide theoretical guarantees (e.g., a TOPF sphere-type theorem) and validate TOPF across synthetic and real data, including a new topological clustering benchmark (TCBS) where TOPF-based spectral clustering often outperforms baselines and matches higher-cost topology-aware methods. They also demonstrate robustness to noise, downsampling, and sampling heterogeneity, and show applicability to high-dimensional and latent spaces (e.g., VAE embeddings). The work advances topology-aware feature extraction for point clouds and enables scalable, interpretable integration with standard ML pipelines.

Abstract

Topological Data Analysis (TDA) allows us to extract powerful topological and higher-order information on the global shape of a data set or point cloud. Tools like Persistent Homology or the Euler Transform give a single complex description of the global structure of the point cloud. However, common machine learning applications like classification require point-level information and features to be available. In this paper, we bridge this gap and propose a novel method to extract node-level topological features from complex point clouds using discrete variants of concepts from algebraic topology and differential geometry. We verify the effectiveness of these topological point features (TOPF) on both synthetic and real-world data and study their robustness under noise and heterogeneous sampling.

Point-Level Topological Representation Learning on Point Clouds

TL;DR

Abstract

Paper Structure (66 sections, 2 theorems, 20 equations, 18 figures, 2 tables, 1 algorithm)

This paper contains 66 sections, 2 theorems, 20 equations, 18 figures, 2 tables, 1 algorithm.

Introduction
Contributions
Related Work
Organisation of the paper
Code
Main Ideas of topf
Simplicial Complexes
Vietoris--Rips and $\alpha$-complexes
Boundary matrices
Betti Numbers and Persistent Homology
The Hodge Laplacian and the Harmonic Space
How to Compute Topological Point Features
Step 1: Computing the persistent homology
Step 2: Selecting the relevant topological features
Step 3: Projecting the features into harmonic space and normalising
...and 51 more sections

Key Result

Theorem 2.5

For an sc$\mathcal{S}$ with boundary matrices $\left(\mathcal{B}_i\right)$ and Hodge Laplacians $\left(L_i\right)$, we have in every dimension $k$

Figures (18)

Figure 1: Topological point features (topf) on real-world and simulated 3d point clouds. For every point, we highlight the largest corresponding topological feature, where colour stands for different features and saturation for the value of the feature. (Cf. \ref{['fig:QualitativeExperiments']})
Figure 1: Computing Topological Point Features (topf). Input. A point cloud $X$ in $n$-dimensional space. Step 1. To extract global topological information, the persistent homology is computed on an $\alpha$/vr-filtration. The most significant topological features $\mathcal{F}$ across all specified dimensions are selected. Step 2.$k$-homology generators associated to all features $f_{i,k}\in\mathcal{F}$ are computed. For every feature, a simplicial complex is built at a step of the filtration where $f_{i,k}$ is alive. Step 3. The homology generators are projected to the harmonic space. Step 4. The vectors are normalised to obtain vectors $\mathbf{e_k^i}$ indexed over the $k$-simplices. For every point $x$ and feature $f\in\mathcal{F}$, we compute the mean of the entries of $\mathbf{e_k^i}$ corresponding to simplices containing $x$. The output is a $|X|\times|\mathcal{F}|$ matrix which can be used for downstream ml tasks. Optional. We weigh the simplicial complexes resulting in a topologically more faithful harmonic representative in Step 3.
Figure 2: ph sketch and topf pipeline applied to nalcn channelosome, a membrane protein Kschonsak:2022.Left: Bars represent life times of features grande2023nonisotropic. Centre left: Steps 1&2a, when computing persistent $1$-homology, three classes are more prominent than the rest. Centre right: Step 2b: The selected homology generators. Right: Step 3: The projections of the generators into harmonic space are now each supported on one of the rings.
Figure 3: topf on 3d real-world and synthetic point clouds. For every point, we highlight the largest corresponding topological feature, where colour stands for the different features and saturation for the value of the feature. (b): Atoms of nalcn channelosome Kschonsak:2022 display three distinct loops. (c): Points sampled in the state space of a Lorentz attractor. The two features correspond to the two lobes of the attractor. (d): Point cloud spaceship of our newly introduced topological clustering benchmark suite (See \ref{['app:BenchmarkSuite']}). (e): Latent space of a vae trained on image patches showing topological structure (See \ref{['fig:figcow']} for details).
Figure 4: topf on a high-dimensional point cloud. We used topf on $6500$ points sampled from the $24$-dimensional conformation space of cyclooctane Martin2011. We show the isomap projection from $24$ dimensions. Top: Three features automatically selected by topf. Bottom:topf-Clustering for 3 and 4 clusters. topf can correctly cluster similar points according to their topological function. Four clusters, topf can even identify the anomalous points (blue) violating the manifold structure.
...and 13 more figures

Theorems & Definitions (14)

Definition 2.1: Simplicial complexes
Definition 2.2: vr complex
Definition 2.3: Boundary matrices
Definition 2.4: Hodge Laplacian
Theorem 2.5: Hodge Decomposition Lim:2020Schaub:2021Roddenberry:2021
Definition B.1: $n$-dimensional Delaunay triangulation
Definition B.2: $\alpha^*$-complex of a point cloud
Definition B.3: Gabriel Simplex
Definition B.4: $\alpha$-complex of a point cloud, kerber20133dedelsbrunner2011alpha
Theorem C.1: Topological Point Features of Spheres
...and 4 more

Point-Level Topological Representation Learning on Point Clouds

TL;DR

Abstract

Point-Level Topological Representation Learning on Point Clouds

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (18)

Theorems & Definitions (14)