Perceptual-oriented Learned Image Compression with Dynamic Kernel
Nianxiang Fu, Junxi Zhang, Huairui Wang, Zhenzhong Chen
TL;DR
PO-DKIC addresses the need for perceptually faithful image compression under bitrate constraints. It builds on DKIC by introducing a dynamic residual block group with content-adaptive kernels and an asymmetric space-channel entropy model, plus a GAN-based perceptual loss via PatchGAN and LPIPS. It formulates a constrained optimization using multiple lambda-trained models and linear integer programming to allocate bitrate for maximum perceptual quality. Experiments on standard benchmarks show superior perceptual quality and texture fidelity compared with state-of-the-art methods.
Abstract
In this paper, we extend our prior research named DKIC and propose the perceptual-oriented learned image compression method, PO-DKIC. Specifically, DKIC adopts a dynamic kernel-based dynamic residual block group to enhance the transform coding and an asymmetric space-channel context entropy model to facilitate the estimation of gaussian parameters. Based on DKIC, PO-DKIC introduces PatchGAN and LPIPS loss to enhance visual quality. Furthermore, to maximize the overall perceptual quality under a rate constraint, we formulate this challenge into a constrained programming problem and use the Linear Integer Programming method for resolution. The experiments demonstrate that our proposed method can generate realistic images with richer textures and finer details when compared to state-of-the-art image compression techniques.
