Hard-aware Instance Adaptive Self-training for Unsupervised Cross-domain Semantic Segmentation
Chuang Zhu, Kebin Liu, Wenqi Tang, Ke Mei, Jiaqi Zou, Tiejun Huang
TL;DR
This work tackles unsupervised domain adaptation for semantic segmentation under domain shift by focusing on high-quality, diverse pseudo-labels for hard classes. It introduces HIAST, a framework consisting of an Instance Adaptive Selector (IAS) for adaptive per-class thresholds, Hard-aware Pseudo-label Augmentation (HPLA) to enrich hard-class pseudo-labels via inter-image copying, and region-adaptive regularization with a Mean-Teacher-style consistency constraint to stabilize training. Ablations and experiments on GTA5→Cityscapes, SYNTHIA→Cityscapes, and Cityscapes→Oxford RobotCar show that HIAST delivers state-of-the-art performance, particularly for hard classes and small objects, and can be plugged into other UDA methods. The approach also extends to semi-supervised segmentation and emphasizes parameter selection without ground-truth labels, making it practical and broadly applicable.
Abstract
The divergence between labeled training data and unlabeled testing data is a significant challenge for recent deep learning models. Unsupervised domain adaptation (UDA) attempts to solve such problem. Recent works show that self-training is a powerful approach to UDA. However, existing methods have difficulty in balancing the scalability and performance. In this paper, we propose a hard-aware instance adaptive self-training framework for UDA on the task of semantic segmentation. To effectively improve the quality and diversity of pseudo-labels, we develop a novel pseudo-label generation strategy with an instance adaptive selector. We further enrich the hard class pseudo-labels with inter-image information through a skillfully designed hard-aware pseudo-label augmentation. Besides, we propose the region-adaptive regularization to smooth the pseudo-label region and sharpen the non-pseudo-label region. For the non-pseudo-label region, consistency constraint is also constructed to introduce stronger supervision signals during model optimization. Our method is so concise and efficient that it is easy to be generalized to other UDA methods. Experiments on GTA5 to Cityscapes, SYNTHIA to Cityscapes, and Cityscapes to Oxford RobotCar demonstrate the superior performance of our approach compared with the state-of-the-art methods. Our codes are available at https://github.com/bupt-ai-cz/HIAST.
