CLIC: Contrastive Learning Framework for Unsupervised Image Complexity Representation
Shipeng Liu, Liang Zhao, Dengfeng Chen
TL;DR
CLIC addresses the challenge of quantifying image complexity without manual labels by learning IC representations through a dual-encoder contrastive framework. It introduces a complexity-aware loss guided by a global entropy prior and a novel positive/negative sampling strategy tailored to IC, enabling unsupervised learning from large unlabeled corpora and effective fine-tuning on IC9600 with only a small labeled subset. Ablation and downstream-task results show that CLIC yields strong IC representations, improves performance in detection and segmentation tasks, and reduces reliance on subjective human annotations. The work offers a scalable, bias-free approach to perceptual image complexity that can enhance a range of CV pipelines.
Abstract
As a fundamental visual attribute, image complexity significantly influences both human perception and the performance of computer vision models. However, accurately assessing and quantifying image complexity remains a challenging task. (1) Traditional metrics such as information entropy and compression ratio often yield coarse and unreliable estimates. (2) Data-driven methods require expensive manual annotations and are inevitably affected by human subjective biases. To address these issues, we propose CLIC, an unsupervised framework based on Contrastive Learning for learning Image Complexity representations. CLIC learns complexity-aware features from unlabeled data, thereby eliminating the need for costly labeling. Specifically, we design a novel positive and negative sample selection strategy to enhance the discrimination of complexity features. Additionally, we introduce a complexity-aware loss function guided by image priors to further constrain the learning process. Extensive experiments validate the effectiveness of CLIC in capturing image complexity. When fine-tuned with a small number of labeled samples from IC9600, CLIC achieves performance competitive with supervised methods. Moreover, applying CLIC to downstream tasks consistently improves performance. Notably, both the pretraining and application processes of CLIC are free from subjective bias.
