Towards Dynamic and Small Objects Refinement for Unsupervised Domain Adaptative Nighttime Semantic Segmentation
Jingyi Pan, Sihang Li, Yucheng Chen, Jinjing Zhu, Lin Wang
TL;DR
This paper tackles nighttime semantic segmentation via unsupervised domain adaptation, addressing illumination-induced domain gaps and the poor transferability of dynamic and small objects. It introduces Dynamic and Small Object Refinement (DSR), which creates a mixed domain by image- and label-level mixup that emphasizes dynamic/small classes and leverages a long-tailed memory bank, and Feature Prototype Alignment (FPA), which uses cross-domain prototypes and contrastive losses with adaptive re-weighting to align source, mixed, and nighttime features. The approach achieves state-of-the-art results on Dark Zurich, Nighttime Driving, and ACDC-night, with clear gains for dynamic and small object categories such as poles, cars, and buses, while preserving all-day performance. Overall, the method offers a practical improvement for robust nighttime perception by focusing supervision on hard-to-transfer objects and reducing domain shifts through prototype-guided mixing and alignment.
Abstract
Nighttime semantic segmentation plays a crucial role in practical applications, such as autonomous driving, where it frequently encounters difficulties caused by inadequate illumination conditions and the absence of well-annotated datasets. Moreover, semantic segmentation models trained on daytime datasets often face difficulties in generalizing effectively to nighttime conditions. Unsupervised domain adaptation (UDA) has shown the potential to address the challenges and achieved remarkable results for nighttime semantic segmentation. However, existing methods still face limitations in 1) their reliance on style transfer or relighting models, which struggle to generalize to complex nighttime environments, and 2) their ignorance of dynamic and small objects like vehicles and poles, which are difficult to be directly learned from other domains. This paper proposes a novel UDA method that refines both label and feature levels for dynamic and small objects for nighttime semantic segmentation. First, we propose a dynamic and small object refinement module to complement the knowledge of dynamic and small objects from the source domain to target the nighttime domain. These dynamic and small objects are normally context-inconsistent in under-exposed conditions. Then, we design a feature prototype alignment module to reduce the domain gap by deploying contrastive learning between features and prototypes of the same class from different domains, while re-weighting the categories of dynamic and small objects. Extensive experiments on three benchmark datasets demonstrate that our method outperforms prior arts by a large margin for nighttime segmentation. Project page: https://rorisis.github.io/DSRNSS/.
