Continuous Patch Stitching for Block-wise Image Compression
Zifu Zhang, Shengxi Li, Henan Liu, Mai Xu, Ce Zhu
TL;DR
The paper tackles block artefacts and high memory costs in learned block-wise image compression by introducing Continuous Patch Stitching (CPS), a padding-free framework that processes image patches in parallel while guaranteeing full-image equivalence. CPS relies on the Parallel Overlapping Patch Stitching (POPS) strategy to mathematically bound overlaps and ensure continuity, aided by padding-free residual blocks with even-sized kernels (EDRB/EURB) and a bottleneck residual block (BRB) to deepen the network without size mismatches. An entropy-model-based compression framework with joint autoregressive and hierarchical priors underpins the rate-distortion optimization, achieving state-of-the-art performance with significantly reduced memory, including successful ultra-high-resolution compression under 4GB. The approach yields complete elimination of block artefacts without post-processing and offers practical benefits for scalable distributed vision tasks and high-resolution media compression.
Abstract
Most recently, learned image compression methods have outpaced traditional hand-crafted standard codecs. However, their inference typically requires to input the whole image at the cost of heavy computing resources, especially for high-resolution image compression; otherwise, the block artefact can exist when compressed by blocks within existing learned image compression methods. To address this issue, we propose a novel continuous patch stitching (CPS) framework for block-wise image compression that is able to achieve seamlessly patch stitching and mathematically eliminate block artefact, thus capable of significantly reducing the required computing resources when compressing images. More specifically, the proposed CPS framework is achieved by padding-free operations throughout, with a newly established parallel overlapping stitching strategy to provide a general upper bound for ensuring the continuity. Upon this, we further propose functional residual blocks with even-sized kernels to achieve down-sampling and up-sampling, together with bottleneck residual blocks retaining feature size to increase network depth. Experimental results demonstrate that our CPS framework achieves the state-of-the-art performance against existing baselines, whilst requiring less than half of computing resources of existing models. Our code shall be released upon acceptance.
