Interleaved Block-based Learned Image Compression with Feature Enhancement and Quantization Error Compensation
Shiqi Jiang, Hui Yuan, Shuai Li, Raouf Hamzaoui, Xu Wang, Junyan Huo
TL;DR
This paper tackles the inefficiencies of learned image compression (LIC) by mitigating quantization-induced information loss and shrinking latent representations. It introduces four plug-and-play modules—Feature Extraction Module (FExM), Feature Refinement Module (FRM), Feature Enhancement Module (FEnM), and Quantization Error Compensation Module (QECM)—into a Tiny-LIC backbone to improve rate-distortion performance, leveraging pixel shuffling, 3D concatenated residuals, Fourier-series quantization compensation, and a FE loss. The approach achieves notable RD gains on Kodak and CLIC datasets, outperforming traditional standards like H.266/VVC and several LIC baselines in PSNR and MS-SSIM metrics, while maintaining reasonable complexity. Overall, the work offers a flexible, modular framework that enhances LIC via targeted feature manipulation and quantization error handling, with potential extensions to lighter architectures and other 3D tasks.
Abstract
In recent years, learned image compression (LIC) methods have achieved significant performance improvements. However, obtaining a more compact latent representation and reducing the impact of quantization errors remain key challenges in the field of LIC. To address these challenges, we propose a feature extraction module, a feature refinement module, and a feature enhancement module. Our feature extraction module shuffles the pixels in the image, splits the resulting image into sub-images, and extracts coarse features from the sub-images. Our feature refinement module stacks the coarse features and uses an attention refinement block composed of concatenated three-dimensional convolution residual blocks to learn more compact latent features by exploiting correlations across channels, within sub-images (intra-sub-image correlations), and across sub-images (inter-sub-image correlations). Our feature enhancement module reduces information loss in the decoded features following quantization. We also propose a quantization error compensation module that mitigates the quantization mismatch between training and testing. Our four modules can be readily integrated into state-of-the-art LIC methods. Experiments show that combining our modules with Tiny-LIC outperforms existing LIC methods and image compression standards in terms of peak signal-to-noise ratio (PSNR) and multi-scale structural similarity (MS-SSIM) on the Kodak dataset and the CLIC dataset.
