Incremental Hierarchical Tucker Decomposition
Doruk Aksoy, Alex A. Gorodetsky
TL;DR
The paper tackles online, batch-aware tensor decomposition for streaming data by introducing Batch Hierarchical Tucker (BHT-l2r) and HT-RISE, the first incremental algorithm for the hierarchical Tucker format. BHT-l2r compresses entire batches by absorbing batch mode into the HT root, with provable error bounds; HT-RISE incrementally updates the HT representation as new batches arrive through projected residual expansions, preserving past reconstructions. Across scientific (PDE and gel simulations) and image (Minecraft frames and multispectral imagery) datasets, HT-RISE delivers competitive or superior relative test error and faster updates, while BHT-l2r achieves notable reductions in storage and runtime compared to standard HT. The work demonstrates that a batch-aware HT framework can yield expressive latent representations with practical online applicability and scalability for high-dimensional data.
Abstract
We present two new algorithms for approximating and updating the hierarchical Tucker decomposition of tensor streams. The first algorithm, Batch Hierarchical Tucker - leaf to root (BHT-l2r), proposes an alternative and more efficient way of approximating a batch of similar tensors in hierarchical Tucker format. The second algorithm, Hierarchical Tucker - Rapid Incremental Subspace Expansion (HT-RISE), updates the batch hierarchical Tucker representation of an accumulated tensor as new batches of tensors become available. The HT-RISE algorithm is suitable for the online setting and never requires full storage or reconstruction of all data while providing a solution to the incremental Tucker decomposition problem. We provide theoretical guarantees for both algorithms and demonstrate their effectiveness on physical and cyber-physical data. The proposed BHT-l2r algorithm and the batch hierarchical Tucker format offers up to $6.2\times$ compression and $3.7\times$ reduction in time over the hierarchical Tucker format. The proposed HT-RISE algorithm also offers up to $3.1\times$ compression and $3.2\times$ reduction in time over a state of the art incremental tensor train decomposition algorithm.
