A Randomized Algorithm for Tensor Singular Value Decomposition using an Arbitrary Number of Passes
Salman Ahmadi-Asl, Anh-Huy Phan, Andrzej Cichocki
TL;DR
The paper tackles the cost of computing tensor SVD (t-SVD) for large-scale data by enabling an arbitrary budget of data passes, addressing the limitation that traditional randomized methods require $2q+2$ passes. It extends pass-efficient randomized matrix techniques to the tensor setting via the t-product, introducing a pass-efficient truncated t-SVD that works for any budget $v$, including odd values, and provides average Frobenius error bounds that depend on the spectral gap $\tau_R = \sigma_{R+1}/\sigma_R$. The authors derive theoretical guarantees and demonstrate practical performance through simulations on synthetic and real data, showing notable speedups over baseline methods while maintaining competitive accuracy, and they apply the approach to tensor completion tasks. This work enables scalable, flexible tensor decompositions for large-scale data and repeated analyses in applications such as image/video completion and reconstruction.
Abstract
Efficient and fast computation of a tensor singular value decomposition (t-SVD) with a few passes over the underlying data tensor is crucial because of its many potential applications. The current/existing subspace randomized algorithms need (2q+2) passes over the data tensor to compute a t-SVD, where q is a non-negative integer number (power iteration parameter). In this paper, we propose an efficient and flexible randomized algorithm that can handle any number of passes q, which not necessary need be even. The flexibility of the proposed algorithm in using fewer passes naturally leads to lower computational and communication costs. This advantage makes it particularly appropriate when our task calls for several tensor decompositions or when the data tensors are huge. The proposed algorithm is a generalization of the methods developed for matrices to tensors. The expected/ average error bound of the proposed algorithm is derived. Extensive numerical experiments on random and real-world data sets are conducted, and the proposed algorithm is compared with some baseline algorithms. The extensive computer simulation experiments demonstrate that the proposed algorithm is practical, efficient, and in general outperforms the state of the arts algorithms. We also demonstrate how to use the proposed method to develop a fast algorithm for the tensor completion problem.
