Neural Image Compression with Quantization Rectifier
Wei Luo, Bo Chen
TL;DR
This work tackles the quantization-induced loss of feature expressiveness in neural image compression by introducing a Quantization Rectifier (QR) that predicts unquantized features from quantized ones using spatial correlations. A soft-to-predictive (STP) training procedure enables seamless integration of QR with existing neural codecs, requiring no major changes to encoders or decoders. Across four baseline architectures, QR yields consistent gains in PSNR and MS-SSIM on Kodak without increasing bitrate, and it can reduce quantization error significantly (up to ~38% in some settings). The method is lightweight, incurring only modest runtime overhead, and it broadens the practical impact of neural compression by reliably improving image quality with minimal computational cost.
Abstract
Neural image compression has been shown to outperform traditional image codecs in terms of rate-distortion performance. However, quantization introduces errors in the compression process, which can degrade the quality of the compressed image. Existing approaches address the train-test mismatch problem incurred during quantization, the random impact of quantization on the expressiveness of image features is still unsolved. This paper presents a novel quantization rectifier (QR) method for image compression that leverages image feature correlation to mitigate the impact of quantization. Our method designs a neural network architecture that predicts unquantized features from the quantized ones, preserving feature expressiveness for better image reconstruction quality. We develop a soft-to-predictive training technique to integrate QR into existing neural image codecs. In evaluation, we integrate QR into state-of-the-art neural image codecs and compare enhanced models and baselines on the widely-used Kodak benchmark. The results show consistent coding efficiency improvement by QR with a negligible increase in the running time.
