Enhancing cross-domain detection: adaptive class-aware contrastive transformer
Ziru Zeng, Yue Ding, Hongtao Lu
TL;DR
This work tackles cross-domain object detection with transformer-based detectors under target-domain label scarcity. It introduces an adaptive class-aware contrastive transformer (ACCT) framework that integrates IoU-guided pseudo-label refinement, per-class adaptive thresholds via Gaussian Mixture Models, and an instance-level class-aware contrastive loss within a mean-teacher and adversarial learning setup. The approach yields improved detection performance across weather, synthetic-to-real, and scene adaptation benchmarks, especially for minority classes, by stabilizing pseudo-label quality and promoting discriminative features. The work offers a practical pathway to robust, post-processing-free cross-domain detection with transformer architectures.
Abstract
Recently,the detection transformer has gained substantial attention for its inherent minimal post-processing requirement.However,this paradigm relies on abundant training data,yet in the context of the cross-domain adaptation,insufficient labels in the target domain exacerbate issues of class imbalance and model performance degradation.To address these challenges, we propose a novel class-aware cross domain detection transformer based on the adversarial learning and mean-teacher framework.First,considering the inconsistencies between the classification and regression tasks,we introduce an IoU-aware prediction branch and exploit the consistency of classification and location scores to filter and reweight pseudo labels.Second, we devise a dynamic category threshold refinement to adaptively manage model confidence.Third,to alleviate the class imbalance,an instance-level class-aware contrastive learning module is presented to encourage the generation of discriminative features for each class,particularly benefiting minority classes.Experimental results across diverse domain-adaptive scenarios validate our method's effectiveness in improving performance and alleviating class imbalance issues,which outperforms the state-of-the-art transformer based methods.
