Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval
Leah Bar, Boaz Lerner, Nir Darshan, Rami Ben-Ari
TL;DR
The paper tackles interactive image retrieval under challenging open-set and imbalanced conditions with very few labeled examples. It introduces GAL, a Greedy Active Learning framework that combines a sample-wise impact value for linear and non-linear classifiers with a global GP-based impact measure, then builds batches greedily to balance uncertainty and diversity. The authors prove a $(1- frac{1}{e})$-approximation guarantee for the GP-based greedy strategy and demonstrate strong empirical gains across SVM, MLP, and GP on Paris-6K, Places, FSOD-IR, and MIRFLICKR-25K, with practical runtimes and a public code release. The approach advances interactive CBIR by enabling efficient cold-start learning and robust performance under open-set and imbalanced conditions, potentially benefiting other open-set AL applications. The combination of theoretical guarantees and broad empirical validation highlights GAL as a versatile toolkit for BMAL in challenging retrieval tasks.
Abstract
Active Learning (AL) is a user-interactive approach aimed at reducing annotation costs by selecting the most crucial examples to label. Although AL has been extensively studied for image classification tasks, the specific scenario of interactive image retrieval has received relatively little attention. This scenario presents unique characteristics, including an open-set and class-imbalanced binary classification, starting with very few labeled samples. We introduce a novel batch-mode Active Learning framework named GAL (Greedy Active Learning) that better copes with this application. It incorporates a new acquisition function for sample selection that measures the impact of each unlabeled sample on the classifier. We further embed this strategy in a greedy selection approach, better exploiting the samples within each batch. We evaluate our framework with both linear (SVM) and non-linear MLP/Gaussian Process classifiers. For the Gaussian Process case, we show a theoretical guarantee on the greedy approximation. Finally, we assess our performance for the interactive content-based image retrieval task on several benchmarks and demonstrate its superiority over existing approaches and common baselines. Code is available at https://github.com/barleah/GreedyAL.
