Adaptive Gradient Calibration for Single-Positive Multi-Label Learning in Remote Sensing Image Scene Classification
Chenying Liu, Gianmarco Perantoni, Lorenzo Bruzzone, Xiao Xiang Zhu
TL;DR
This work tackles scalable RS image scene classification under single-positive multi-label learning (SPML) by introducing AdaGC, a framework that calibrates gradients to mitigate false negatives, generates robust pseudo-labels with a dual-EMA scheme, and adaptively activates gradient calibration via learning-dynamics analysis. The approach integrates Mixup and provides a theoretical basis for early-learning detection, ensuring GC is engaged at an optimal training stage. Extensive experiments on reBEN and AID-multilabel under Random and Dominant SPML noise demonstrate state-of-the-art performance and robust generalization, with ablations confirming the contributions of the adaptive trigger, dual-EMA, and Mixup. AdaGC offers a practical, noise-robust solution for RS SPML, highlighting its potential for scalable, high-quality multi-label RS classification.
Abstract
Multi-label classification (MLC) offers a more comprehensive semantic understanding of Remote Sensing (RS) imagery compared to traditional single-label classification (SLC). However, obtaining complete annotations for MLC is particularly challenging due to the complexity and high cost of the labeling process. As a practical alternative, single-positive multi-label learning (SPML) has emerged, where each image is annotated with only one relevant label, and the model is expected to recover the full set of labels. While scalable, SPML introduces significant supervision ambiguity, demanding specialized solutions for model training. Although various SPML methods have been proposed in the computer vision domain, research in the RS context remains limited. To bridge this gap, we propose Adaptive Gradient Calibration (AdaGC), a novel and generalizable SPML framework tailored to RS imagery. AdaGC adopts a gradient calibration (GC) mechanism with a dual exponential moving average (EMA) module for robust pseudo-label generation. We introduce a theoretically grounded, training-dynamics-based indicator to adaptively trigger GC, which ensures GC's effectiveness by preventing it from being affected by model underfitting or overfitting to label noise. Extensive experiments on two benchmark RS datasets under two distinct label noise types demonstrate that AdaGC achieves state-of-the-art (SOTA) performance while maintaining strong robustness across diverse settings. The codes and data will be released at https://github.com/rslab-unitrento/AdaGC.
