MTS-CSNet: Multiscale Tensor Factorization for Deep Compressive Sensing on RGB Images
Mehmet Yamac, Lei Xu, Serkan Kiranyaz, Moncef Gabbouj
TL;DR
MTS-CSNet addresses the inefficiency of block-wise sensing in compressive sensing for high-dimensional RGB images by introducing a learnable Multiscale Tensor Summation (MTS) operator. The framework performs end-to-end, non-iterative reconstruction in tensor space, using an MTS-based sensing layer, a nonlinear adjoint back-projection, and a lightweight MTSNet refinement. Key contributions include the MTS layer formulation with multiscale patch processing, a nonlinear adjoint back-projection via the MHG prior, and a compact refinement network that achieves state-of-the-art PSNR/SSIM with substantially lower computation than diffusion-based CS methods. Experiments on Urban100 and DIV2K demonstrate superior reconstruction quality and faster inference, validating the effectiveness and efficiency of multiscale tensor operators for high-dimensional CS. The work suggests that structured tensor factorizations can jointly model spatial and channel correlations across scales, enabling scalable CS for multidimensional data.
Abstract
Deep learning based compressive sensing (CS) methods typically learn sampling operators using convolutional or block wise fully connected layers, which limit receptive fields and scale poorly for high dimensional data. We propose MTSCSNet, a CS framework based on Multiscale Tensor Summation (MTS) factorization, a structured operator for efficient multidimensional signal processing. MTS performs mode-wise linear transformations with multiscale summation, enabling large receptive fields and effective modeling of cross-dimensional correlations. In MTSCSNet, MTS is first used as a learnable CS operator that performs linear dimensionality reduction in tensor space, with its adjoint defining the initial back-projection, and is then applied in the reconstruction stage to directly refine this estimate. This results in a simple feed-forward architecture without iterative or proximal optimization, while remaining parameter and computation efficient. Experiments on standard CS benchmarks show that MTSCSNet achieves state-of-the-art reconstruction performance on RGB images, with notable PSNR gains and faster inference, even compared to recent diffusion-based CS methods, while using a significantly more compact feed-forward architecture.
