Gaussian Splatting-based Low-Rank Tensor Representation for Multi-Dimensional Image Recovery
Yiming Zeng, Xi-Le Zhao, Wei-Hao Wu, Teng-Yu Ji, Chao Wang
TL;DR
This work addresses the limitations of tensor-SVD approaches in multi-dimensional image recovery, notably the coarse latent-tensor approximation and fixed, data-agnostic transform bases that fail to capture local high-frequency information. It introduces Gaussian Splatting-based Low-rank Tensor Representation (GSLR), which jointly models a latent tensor $\mathcal{A}$ via 2D Gaussian splatting and a transform matrix $\mathbf{T}$ via 1D Gaussian splatting, yielding a continuous representation $\mathcal{X}=\mathcal{A} \times_3 \mathbf{T}$. The framework includes a principled unsupervised recovery model with a nuclear-norm regularization on frontal slices, and demonstrates, through extensive experiments on color images and multispectral images under random, tube, and slice missing patterns, that GSLR outperforms state-of-the-art t-SVD-based methods, particularly in preserving local high-frequency details. Theoretical connections show that GSLR generalizes classical t-SVD under degenerate settings, highlighting its stronger representational capacity and practical impact for high-fidelity multi-dimensional image recovery and related tasks.
Abstract
Tensor singular value decomposition (t-SVD) is a promising tool for multi-dimensional image representation, which decomposes a multi-dimensional image into a latent tensor and an accompanying transform matrix. However, two critical limitations of t-SVD methods persist: (1) the approximation of the latent tensor (e.g., tensor factorizations) is coarse and fails to accurately capture spatial local high-frequency information; (2) The transform matrix is composed of fixed basis atoms (e.g., complex exponential atoms in DFT and cosine atoms in DCT) and cannot precisely capture local high-frequency information along the mode-3 fibers. To address these two limitations, we propose a Gaussian Splatting-based Low-rank tensor Representation (GSLR) framework, which compactly and continuously represents multi-dimensional images. Specifically, we leverage tailored 2D Gaussian splatting and 1D Gaussian splatting to generate the latent tensor and transform matrix, respectively. The 2D and 1D Gaussian splatting are indispensable and complementary under this representation framework, which enjoys a powerful representation capability, especially for local high-frequency information. To evaluate the representation ability of the proposed GSLR, we develop an unsupervised GSLR-based multi-dimensional image recovery model. Extensive experiments on multi-dimensional image recovery demonstrate that GSLR consistently outperforms state-of-the-art methods, particularly in capturing local high-frequency information.
