Do We Need Perfect Data? Leveraging Noise for Domain Generalized Segmentation
Taeyeong Kim, SeungJoon Lee, Jung Uk Kim, MyeongAh Cho
TL;DR
This work tackles domain generalization in semantic segmentation by addressing the boundary-level misalignment that arises when using diffusion-based synthetic data. The proposed FLEX-Seg framework combines Granular Adaptive Prototypes for multi-scale boundary representation, Uncertainty Boundary Emphasis to dynamicall yweight uncertain regions, and Hardness-Aware Sampling to focus training on challenging examples. Through extensive experiments on five real-world datasets, FLEX-Seg consistently improves over state-of-the-art methods, including notable gains on adverse domains like ACDC and Dark Zurich, validating that leveraging imperfect synthetic data can yield robust domain generalization. The approach offers a practical path to better generalization in dense prediction tasks and suggests broader applicability to other boundary-sensitive problems.
Abstract
Domain generalization in semantic segmentation faces challenges from domain shifts, particularly under adverse conditions. While diffusion-based data generation methods show promise, they introduce inherent misalignment between generated images and semantic masks. This paper presents FLEX-Seg (FLexible Edge eXploitation for Segmentation), a framework that transforms this limitation into an opportunity for robust learning. FLEX-Seg comprises three key components: (1) Granular Adaptive Prototypes that captures boundary characteristics across multiple scales, (2) Uncertainty Boundary Emphasis that dynamically adjusts learning emphasis based on prediction entropy, and (3) Hardness-Aware Sampling that progressively focuses on challenging examples. By leveraging inherent misalignment rather than enforcing strict alignment, FLEX-Seg learns robust representations while capturing rich stylistic variations. Experiments across five real-world datasets demonstrate consistent improvements over state-of-the-art methods, achieving 2.44% and 2.63% mIoU gains on ACDC and Dark Zurich. Our findings validate that adaptive strategies for handling imperfect synthetic data lead to superior domain generalization. Code is available at https://github.com/VisualScienceLab-KHU/FLEX-Seg.
