Synonymous Variational Inference for Perceptual Image Compression
Zijian Liang, Kai Niu, Changshuo Wang, Jin Xu, Ping Zhang
TL;DR
Perceptual image compression is analyzed from a semantic-information perspective using Synonymous Variational Inference (SVI). The method introduces Synonymous Image Compression (SIC), a codec that encodes only the synonymous latent representation and samples detailed content to generate multiple perceptually similar images. The authors prove that the optimization direction corresponds to a synonymous rate-distortion-perception tradeoff and implement a progressive SIC codec that achieves competitive RD-P and perceptual performance against established PIC methods. This work provides a unified theoretical framework for semantic information in image coding and points to future enhancements using adversarial losses.
Abstract
Recent contributions of semantic information theory reveal the set-element relationship between semantic and syntactic information, represented as synonymous relationships. In this paper, we propose a synonymous variational inference (SVI) method based on this synonymity viewpoint to re-analyze the perceptual image compression problem. It takes perceptual similarity as a typical synonymous criterion to build an ideal synonymous set (Synset), and approximate the posterior of its latent synonymous representation with a parametric density by minimizing a partial semantic KL divergence. This analysis theoretically proves that the optimization direction of perception image compression follows a triple tradeoff that can cover the existing rate-distortion-perception schemes. Additionally, we introduce synonymous image compression (SIC), a new image compression scheme that corresponds to the analytical process of SVI, and implement a progressive SIC codec to fully leverage the model's capabilities. Experimental results demonstrate comparable rate-distortion-perception performance using a single progressive SIC codec, thus verifying the effectiveness of our proposed analysis method.
