Randomized Large-Scale Quaternion Matrix Approximation: Practical Rangefinders and One-Pass Algorithm
Chao Chang, Yuning Yang
TL;DR
The paper tackles large-scale low-rank approximation of quaternion matrices by introducing two practical rangefinders, pseudo-QR and pseudo-SVD, that operate via complex representations to accelerate quaternion computations. These rangefinders are integrated into a quaternion one-pass randomized algorithm, with deterministic and probabilistic error bounds showing the truncation error scales with the rangefinder condition κ(H) while preserving the data range. The approach delivers substantial speed-ups over existing quaternion orthonormalization-based methods and demonstrates high-accuracy, scalable compression on synthetic data, a CFD simulation, a 4D Lorenz-type chaotic system, and a large color image. The work provides rigorous QB error analyses for Gaussian and sub-Gaussian embeddings and releases MATLAB code, highlighting practical impact for large-scale quaternion data processing.
Abstract
Recently, randomized algorithms for low-rank approximation of quaternion matrices have received increasing attention. However, for large-scale problems, existing quaternion orthonormalizations are inefficient, leading to slow rangefinders. To address this, by appropriately leveraging efficient scientific computing libraries in the complex arithmetic, this work devises two practical quaternion rangefinders, one of which is non-orthonormal yet well-conditioned. They are then integrated into the quaternion version of a one-pass algorithm, which originally takes orthonormal rangefinders only. We establish the error bounds and demonstrate that the error is proportional to the condition number of the rangefinder. The probabilistic bounds are exhibited for both quaternion Gaussian and sub-Gaussian embeddings. Numerical experiments demonstrate that the one-pass algorithm with the proposed rangefinders significantly outperforms previous techniques in efficiency. Additionally, we tested the algorithm in a 3D Navier-Stokes equation ($5.22$GB) and a 4D Lorenz-type chaotic system ($5.74$GB) data compression, as well as a $31365\times 27125$ image compression to demonstrate its capability for handling large-scale applications.
