Table of Contents
Fetching ...

Lossless Image Compression Using Multi-level Dictionaries: Binary Images

Samar Agnihotri, Renu Rameshan, Ritwik Ghosal

TL;DR

It is argued that compressibility of a color image is essentially derived from the patterns in its spatial structure, intensity variations, and color variations, and it is further argued that the binarized version of an image captures its fundamental spatial structure.

Abstract

Lossless image compression is required in various applications to reduce storage or transmission costs of images, while requiring the reconstructed images to have zero information loss compared to the original. Existing lossless image compression methods either have simple design but poor compression performance, or complex design, better performance, but with no performance guarantees. In our endeavor to develop a lossless image compression method with low complexity and guaranteed performance, we argue that compressibility of a color image is essentially derived from the patterns in its spatial structure, intensity variations, and color variations. Thus, we divide the overall design of a lossless image compression scheme into three parts that exploit corresponding redundancies. We further argue that the binarized version of an image captures its fundamental spatial structure. In this first part of our work, we propose a scheme for lossless compression of binary images. The proposed scheme first learns dictionaries of $16\times16$, $8\times8$, $4\times4$, and $2\times 2$ square pixel patterns from various datasets of binary images. It then uses these dictionaries to encode binary images. These dictionaries have various interesting properties that are further exploited to construct an efficient and scalable scheme. Our preliminary results show that the proposed scheme consistently outperforms existing conventional and learning based lossless compression approaches, and provides, on average, as much as $1.5\times$ better performance than a common general purpose lossless compression scheme (WebP), more than $3\times$ better performance than a state of the art learning based scheme, and better performance than a specialized scheme for binary image compression (JBIG2).

Lossless Image Compression Using Multi-level Dictionaries: Binary Images

TL;DR

It is argued that compressibility of a color image is essentially derived from the patterns in its spatial structure, intensity variations, and color variations, and it is further argued that the binarized version of an image captures its fundamental spatial structure.

Abstract

Lossless image compression is required in various applications to reduce storage or transmission costs of images, while requiring the reconstructed images to have zero information loss compared to the original. Existing lossless image compression methods either have simple design but poor compression performance, or complex design, better performance, but with no performance guarantees. In our endeavor to develop a lossless image compression method with low complexity and guaranteed performance, we argue that compressibility of a color image is essentially derived from the patterns in its spatial structure, intensity variations, and color variations. Thus, we divide the overall design of a lossless image compression scheme into three parts that exploit corresponding redundancies. We further argue that the binarized version of an image captures its fundamental spatial structure. In this first part of our work, we propose a scheme for lossless compression of binary images. The proposed scheme first learns dictionaries of , , , and square pixel patterns from various datasets of binary images. It then uses these dictionaries to encode binary images. These dictionaries have various interesting properties that are further exploited to construct an efficient and scalable scheme. Our preliminary results show that the proposed scheme consistently outperforms existing conventional and learning based lossless compression approaches, and provides, on average, as much as better performance than a common general purpose lossless compression scheme (WebP), more than better performance than a state of the art learning based scheme, and better performance than a specialized scheme for binary image compression (JBIG2).
Paper Structure (23 sections, 1 equation, 11 figures, 5 tables)

This paper contains 23 sections, 1 equation, 11 figures, 5 tables.

Figures (11)

  • Figure 1: The bits $b_i$ or blocks $B_n^i$ are read in a raster scan order; here $n$ indicates the patch size and $i$, the bit or the block.
  • Figure 2: Illustration of hybrid encoding for a $32 \times 32$ patch.
  • Figure 3: Sample images from each dataset. Left to right, ImageNet deng2009imagenet, SAM sam, iNaturalist iNat, and Kodak kodak.
  • Figure 4: Pattern frequencies for $2 \times 2$ in a log-linear scale. Vertical bars represent the count of symbols lying in the particular frequency range. The frequency corresponding to the center of the interval is indicated along the X-axis.
  • Figure 5: Statistics for $4 \times 4$ patterns.
  • ...and 6 more figures