An Incremental Non-Linear Manifold Approximation Method
Praveen T. W. Hettige, Benjamin W. Ong
TL;DR
This work introduces Incremental GMRA, a streaming non-linear dimension reduction method based on Geometric Multi-resolution Analysis that incrementally updates a multiscale manifold approximation as new data arrive. By leveraging a Cover Tree for data partitioning and Brand-type SVD/covariance updates, the method maintains the GMRA structure and updates PCA bases and wavelet coefficients in real time, with MSE-driven cluster splitting to control model complexity. Theoretical error analyses bound the impact of incremental updates on singular values and subspace angles, while numerical experiments on Swiss Roll and intersecting-manifold scenarios demonstrate accurate, adaptive manifold representation with scalable computation. The approach enables real-time visualization and interactive graphics where high-dimensional data evolve over time, offering robust incremental learning with multiscale, non-linear structure preservation.
Abstract
Analyzing high-dimensional data presents challenges due to the "curse of dimensionality'', making computations intensive. Dimension reduction techniques, categorized as linear or non-linear, simplify such data. Non-linear methods are particularly essential for efficiently visualizing and processing complex data structures in interactive and graphical applications. This research develops an incremental non-linear dimension reduction method using the Geometric Multi-Resolution Analysis (GMRA) framework for streaming data. The proposed method enables real-time data analysis and visualization by incrementally updating the cluster map, PCA basis vectors, and wavelet coefficients. Numerical experiments show that the incremental GMRA accurately represents non-linear manifolds even with small initial samples and aligns closely with batch GMRA, demonstrating efficient updates and maintaining the multiscale structure. The findings highlight the potential of Incremental GMRA for real-time visualization and interactive graphics applications that require adaptive high-dimensional data representations.
