TreeNet: A Light Weight Model for Low Bitrate Image Compression
Mahadev Prasad Panda, Purnachandra Rao Makkena, Srivatsa Prativadibhayankaram, Siegfried Fößel, André Kaup
TL;DR
TreeNet tackles high computational cost in learned image compression by introducing a binary-tree encoder–decoder with attentional fusion, achieving strong rate–distortion performance at low bitrates with substantially reduced complexity. The architecture yields four latent streams processed by separate entropy bottlenecks and a constrained, low-channel-count decoder, enabling efficient parallelizable computation. Comprehensive experiments on Kodak, CLIC, and Tecnick show competitive RD performance and a notable 87.8% reduction in complexity versus JPEG AI, supported by rigorous ablations that reveal latent-block contributions and spatial rate distribution. Overall, TreeNet offers a practical, interpretable, low-cost path toward scalable learning-based image compression.
Abstract
Reducing computational complexity remains a critical challenge for the widespread adoption of learning-based image compression techniques. In this work, we propose TreeNet, a novel low-complexity image compression model that leverages a binary tree-structured encoder-decoder architecture to achieve efficient representation and reconstruction. We employ attentional feature fusion mechanism to effectively integrate features from multiple branches. We evaluate TreeNet on three widely used benchmark datasets and compare its performance against competing methods including JPEG AI, a recent standard in learning-based image compression. At low bitrates, TreeNet achieves an average improvement of 4.83% in BD-rate over JPEG AI, while reducing model complexity by 87.82%. Furthermore, we conduct extensive ablation studies to investigate the influence of various latent representations within TreeNet, offering deeper insights into the factors contributing to reconstruction.
