PACF: Prototype Augmented Compact Features for Improving Domain Adaptive Object Detection
Chenguang Liu, Yongchao Feng, Yanan Zhang, Qingjie Liu, Yunhong Wang
TL;DR
Domain Adaptive Object Detection suffers from a domain gap that broadens intra-class variance and shifts class means in the target domain. PACF tackles this by combining a prototype cross entropy loss, which aligns target RoI features to both source and target class prototypes, with a mutual regularization strategy that balances feature compactness and discriminability between linear and prototype-based classifiers; a theoretical lower bound on the target feature likelihood underpins the approach. The method uses a teacher–student framework with prototype-based classifiers and dynamically updated class prototypes, leading to a more compact and discriminative cross-domain feature space. Empirical results across Cityscapes↔Foggy Cityscapes, Sim10k↔Cityscapes, and KITTI↔Cityscapes show consistent gains over strong baselines, validating the effectiveness and generality of PACF for robust cross-domain object detection.
Abstract
In recent years, there has been significant advancement in object detection. However, applying off-the-shelf detectors to a new domain leads to significant performance drop, caused by the domain gap. These detectors exhibit higher-variance class-conditional distributions in the target domain than that in the source domain, along with mean shift. To address this problem, we propose the Prototype Augmented Compact Features (PACF) framework to regularize the distribution of intra-class features. Specifically, we provide an in-depth theoretical analysis on the lower bound of the target features-related likelihood and derive the prototype cross entropy loss to further calibrate the distribution of target RoI features. Furthermore, a mutual regularization strategy is designed to enable the linear and prototype-based classifiers to learn from each other, promoting feature compactness while enhancing discriminability. Thanks to this PACF framework, we have obtained a more compact cross-domain feature space, within which the variance of the target features' class-conditional distributions has significantly decreased, and the class-mean shift between the two domains has also been further reduced. The results on different adaptation settings are state-of-the-art, which demonstrate the board applicability and effectiveness of the proposed approach.
