Twofold Debiasing Enhances Fine-Grained Learning with Coarse Labels
Xin-yang Zhao, Jian Jin, Yang-yang Li, Yazhou Yao
TL;DR
This work tackles Coarse-to-Fine Few-Shot (C2FS) learning, where coarse supervision can suppress fine-grained cues and scarce fine-grained samples induce biased classifier distributions. It introduces Twofold Debiasing (TFB), integrating embedding learning debias (via multi-layer feature fusion for reconstruction and intermediate-layer feature alignment) with fine-grained classifier debias (prototype calibration using base-class relationships) to coherently improve both representation and decision boundaries. Across BREEDS and CIFAR-100, TFB achieves state-of-the-art results, notably a substantial gain on CIFAR-100, demonstrating the value of optimizing both the feature extractor and the classifier for C2FS. These results suggest that leveraging coarse-label information for distribution calibration, alongside richer multi-layer representations, yields robust fine-grained recognition under limited supervision and has practical impact for scalable, real-world deployment.
Abstract
The Coarse-to-Fine Few-Shot (C2FS) task is designed to train models using only coarse labels, then leverages a limited number of subclass samples to achieve fine-grained recognition capabilities. This task presents two main challenges: coarse-grained supervised pre-training suppresses the extraction of critical fine-grained features for subcategory discrimination, and models suffer from overfitting due to biased distributions caused by limited fine-grained samples. In this paper, we propose the Twofold Debiasing (TFB) method, which addresses these challenges through detailed feature enhancement and distribution calibration. Specifically, we introduce a multi-layer feature fusion reconstruction module and an intermediate layer feature alignment module to combat the model's tendency to focus on simple predictive features directly related to coarse-grained supervision, while neglecting complex fine-grained level details. Furthermore, we mitigate the biased distributions learned by the fine-grained classifier using readily available coarse-grained sample embeddings enriched with fine-grained information. Extensive experiments conducted on five benchmark datasets demonstrate the efficacy of our approach, achieving state-of-the-art results that surpass competitive methods.
