Manifold Learning for Hyperspectral Images
Fethi Harkat, Guillaume Gey, Valérie Perrier, Kévin Polisano, Tiphaine Deuberet
TL;DR
The paper tackles the difficulty of representing X-ray transmission hyperspectral images with traditional linear methods by leveraging topology-preserving non-linear embeddings. It introduces a Parametric UMAP pipeline to map data from $[H,W,C]$ to a lower-dimensional $[H,W,D]$ while retaining intrinsic structure, subsequently feeding embeddings into CNNs for segmentation, regression, and classification. Across Cigarettes, Stones, and Indian Pines experiments, UMAP-based representations consistently outperform PCA, NMF, and raw spectra, demonstrating improved feature separability, robustness, and efficiency. The work highlights the potential of topology-aware analysis in hyperspectral XRT data and advocates further exploration of topological data analysis to enhance practical performance in real-world imaging applications.
Abstract
Traditional feature extraction and projection techniques, such as Principal Component Analysis, struggle to adequately represent X-Ray Transmission (XRT) Multi-Energy (ME) images, limiting the performance of neural networks in decision-making processes. To address this issue, we propose a method that approximates the dataset topology by constructing adjacency graphs using the Uniform Manifold Approximation and Projection. This approach captures nonlinear correlations within the data, significantly improving the performance of machine learning algorithms, particularly in processing Hyperspectral Images (HSI) from X-ray transmission spectroscopy. This technique not only preserves the global structure of the data but also enhances feature separability, leading to more accurate and robust classification results.
