Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation
Bin-Bin Gao, Xiaochen Chen, Zhongyi Huang, Congchong Nie, Jun Liu, Jinxiang Lai, Guannan Jiang, Xi Wang, Chengjie Wang
TL;DR
This work addresses the missing-label issue in instance-level few-shot detection and segmentation, which induces biased foreground–background classification. It introduces a simple decoupled classifier with two parallel heads, forming $L_{CLS} = L_{CLS}^{fg} + L_{CLS}^{bg}$, where the negative head uses a constrained logits construction $\bar{x}_i = m_i x_i$ and probabilities $\bar{p}_i$ to mitigate mislabel-driven bias. By applying this approach to FSOD/FSIS and generalized FSOD/FSIS on PASCAL VOC and MS-COCO, the method yields significant performance gains without adding parameters or computation, outperforming strong baselines such as DeFRCN. The results demonstrate the method’s robustness across low-shot regimes and its applicability to both detection and instance segmentation, with code released for reproducibility. Overall, the paper advances a practical, theory-grounded strategy for coping with incomplete labels in few-shot vision tasks.
Abstract
This paper focus on few-shot object detection~(FSOD) and instance segmentation~(FSIS), which requires a model to quickly adapt to novel classes with a few labeled instances. The existing methods severely suffer from bias classification because of the missing label issue which naturally exists in an instance-level few-shot scenario and is first formally proposed by us. Our analysis suggests that the standard classification head of most FSOD or FSIS models needs to be decoupled to mitigate the bias classification. Therefore, we propose an embarrassingly simple but effective method that decouples the standard classifier into two heads. Then, these two individual heads are capable of independently addressing clear positive samples and noisy negative samples which are caused by the missing label. In this way, the model can effectively learn novel classes while mitigating the effects of noisy negative samples. Without bells and whistles, our model without any additional computation cost and parameters consistently outperforms its baseline and state-of-the-art by a large margin on PASCAL VOC and MS-COCO benchmarks for FSOD and FSIS tasks. The Code is available at https://csgaobb.github.io/Projects/DCFS.
