FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions
Sohyun Lee, Namyup Kim, Sungyeon Kim, Suha Kwak
TL;DR
FREST tackles semantic segmentation under multiple adverse conditions in a source-free domain adaptation setting by alternating between learning a condition embedding space and restoring adverse-condition features to resemble normal-condition representations in that space. It introduces a light-weight condition strainer attached to a frozen encoder and a projection head to capture condition-specific information, and a feature-restoration objective that aligns adverse features with normal-condition embeddings while discriminating between encoder and condition-infused features. This approach reduces condition-induced feature gaps, avoids catastrophic forgetting of source-domain knowledge, and achieves state-of-the-art results on Cityscapes→ACDC and Cityscapes→RobotCar, with strong generalization to unseen domains. The method is parameter-efficient, requires no source data during adaptation, and relies on publicly available GNSS-based pairs and warping for alignment, offering practical utility for real-world robust semantic segmentation.
Abstract
Robust semantic segmentation under adverse conditions is crucial in real-world applications. To address this challenging task in practical scenarios where labeled normal condition images are not accessible in training, we propose FREST, a novel feature restoration framework for source-free domain adaptation (SFDA) of semantic segmentation to adverse conditions. FREST alternates two steps: (1) learning the condition embedding space that only separates the condition information from the features and (2) restoring features of adverse condition images on the learned condition embedding space. By alternating these two steps, FREST gradually restores features where the effect of adverse conditions is reduced. FREST achieved a state of the art on two public benchmarks (i.e., ACDC and RobotCar) for SFDA to adverse conditions. Moreover, it shows superior generalization ability on unseen datasets.
