Deeply-Conditioned Image Compression via Self-Generated Priors

Zhineng Zhao; Zhihai He; Zikun Zhou; Siwei Ma; Yaowei Wang

Deeply-Conditioned Image Compression via Self-Generated Priors

Zhineng Zhao, Zhihai He, Zikun Zhou, Siwei Ma, Yaowei Wang

TL;DR

The paper tackles geometric deformation and entanglement in learned image compression by introducing DCIC-sgp, which first learns a potent self-generated structure prior and then deeply conditions the entire compression pipeline. This explicit functional decomposition decouples the stable structural backbone from transient textures, enabling the analysis transform to focus on residual details and providing global context to the entropy model. Empirical results show substantial BD-rate savings against VTM-12.1 (up to around 15%), improved structure preservation at low bitrates, and strong generalization to medical image domains, demonstrating practical impact for high-fidelity, low-rate compression. Overall, the work presents a principled, deeply conditioned, internally guided compression framework that achieves state-of-the-art efficiency without prohibitive computation, paving the way for extensions to video and 3D data.

Abstract

Learned image compression (LIC) has shown great promise for achieving high rate-distortion performance. However, current LIC methods are often limited in their capability to model the complex correlation structures inherent in natural images, particularly the entanglement of invariant global structures with transient local textures within a single monolithic representation. This limitation precipitates severe geometric deformation at low bitrates. To address this, we introduce a framework predicated on functional decomposition, which we term Deeply-Conditioned Image Compression via self-generated priors (DCIC-sgp). Our central idea is to first encode a potent, self-generated prior to encapsulate the image's structural backbone. This prior is subsequently utilized not as mere side-information, but to holistically modulate the entire compression pipeline. This deep conditioning, most critically of the analysis transform, liberates it to dedicate its representational capacity to the residual, high-entropy details. This hierarchical, dependency-driven approach achieves an effective disentanglement of information streams. Our extensive experiments validate this assertion; visual analysis demonstrates that our method substantially mitigates the geometric deformation artifacts that plague conventional codecs at low bitrates. Quantitatively, our framework establishes highly competitive performance, achieving significant BD-rate reductions of 14.4%, 15.7%, and 15.1% against the VVC test model VTM-12.1 on the Kodak, CLIC, and Tecnick datasets.

Deeply-Conditioned Image Compression via Self-Generated Priors

TL;DR

Abstract

Deeply-Conditioned Image Compression via Self-Generated Priors

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)