Causal Context Adjustment Loss for Learned Image Compression

Minghao Han; Shiyin Jiang; Shengxi Li; Xin Deng; Mai Xu; Ce Zhu; Shuhang Gu

Causal Context Adjustment Loss for Learned Image Compression

Minghao Han, Shiyin Jiang, Shengxi Li, Xin Deng, Mai Xu, Ce Zhu, Shuhang Gu

TL;DR

This paper makes the first attempt in investigating the way to explicitly adjust the causal context with the proposed Causal Context Adjustment loss (CCA-loss), and enables the neural network to spontaneously adjust important information into the early stage of the autoregressive entropy model.

Abstract

In recent years, learned image compression (LIC) technologies have surpassed conventional methods notably in terms of rate-distortion (RD) performance. Most present learned techniques are VAE-based with an autoregressive entropy model, which obviously promotes the RD performance by utilizing the decoded causal context. However, extant methods are highly dependent on the fixed hand-crafted causal context. The question of how to guide the auto-encoder to generate a more effective causal context benefit for the autoregressive entropy models is worth exploring. In this paper, we make the first attempt in investigating the way to explicitly adjust the causal context with our proposed Causal Context Adjustment loss (CCA-loss). By imposing the CCA-loss, we enable the neural network to spontaneously adjust important information into the early stage of the autoregressive entropy model. Furthermore, as transformer technology develops remarkably, variants of which have been adopted by many state-of-the-art (SOTA) LIC techniques. The existing computing devices have not adapted the calculation of the attention mechanism well, which leads to a burden on computation quantity and inference latency. To overcome it, we establish a convolutional neural network (CNN) image compression model and adopt the unevenly channel-wise grouped strategy for high efficiency. Ultimately, the proposed CNN-based LIC network trained with our Causal Context Adjustment loss attains a great trade-off between inference latency and rate-distortion performance.

Causal Context Adjustment Loss for Learned Image Compression

TL;DR

Abstract

Paper Structure (33 sections, 8 equations, 9 figures, 3 tables)

This paper contains 33 sections, 8 equations, 9 figures, 3 tables.

Introduction
Related Works
Auto-Encoder Architectures for Learned Image Compression
Entropy Models for Learned Image Compression
Preliminary: Learned Image Compression with Variational Auto-Encoder
Variational Auto-Encoder based Image Compression Framework.
Entropy Model with Hyperprior.
Autoregressive Entropy Model.
Causal Context Adjustment for Efficient Learned Image Compression
Causal Context Adjustment
Training Compression Network with CCA Loss
Auto-Encoder Architecture
Channel-wise Unevenly Grouped Entropy Model
Overall Loss Function
Experiments
...and 18 more sections

Figures (9)

Figure 1: Left: A systematic overview of our method. We adopt the VAE-based framework balle2017endtoend with hyperprior balle2018variational and channel-wise autoregressive entropy model minnen2018joint; besides the original Rate-Distortion loss ($\mathcal{L}_R$, $\mathcal{L}_D$), we introduce an auxiliary entropy model and propose the causal context adjustment loss ($\mathcal{L}_{CCA}$) for better training the entropy model. Right: An illustration of the entropy model and the auxiliary entropy model. The auxiliary entropy model does not use the information to be encoded to predict the following representations, our $\mathcal{L}_{CCA}$ encourage the predicting gap between the two models, so as to enhance the importance of causal context in early stages.
Figure 2: The comparison of averaged information distributed ratios of various models in Table \ref{['ablation1']}.
Figure 3: Rate-Distortion performance evaluation of PSNR on Kodak dataset (left), CLIC Professional Validation dataset (middle), Tecnick dataset (right), respectively.
Figure 4: Visualization of the reconstructed images (top: kodim19, middle: kodim10, bottom: kodim4) from Kodak dataset. The titles under the sub-figures are represented as [bpp | PSNR(dB)].
Figure 5: Architecture of NAF-block, Entropy Model and Latent Residual Prediction (LRP).
...and 4 more figures

Causal Context Adjustment Loss for Learned Image Compression

TL;DR

Abstract

Causal Context Adjustment Loss for Learned Image Compression

Authors

TL;DR

Abstract

Table of Contents

Figures (9)