ElimPCL: Eliminating Noise Accumulation with Progressive Curriculum Labeling for Source-Free Domain Adaptation
Jie Cheng, Hao Zheng, Meiguang Zheng, Lei Wang, Hao Wu, Jian Zhang
TL;DR
Source-free domain adaptation often struggles with noise accumulation from uncertain pseudo-labels on hard target samples, causing propagation of errors during learning. ElimPCL tackles this by a prototype-consistency curriculum that filters trustworthy pseudo-labels before adaptation and a Dual MixUP strategy in feature space to mitigate noise spread, followed by adaptive co-training with an ImageNet-pretrained backbone. Across multiple benchmarks, ElimPCL yields consistent improvements, especially under severe domain shifts, and ablation confirms the crucial roles of prototype filtering and Dual MixUP in improving hard-sample separability. The approach offers a practical, privacy-preserving pathway to robust SFDA with potential wider applicability to real-world domain adaptation scenarios.
Abstract
Source-Free Domain Adaptation (SFDA) aims to train a target model without source data, and the key is to generate pseudo-labels using a pre-trained source model. However, we observe that the source model often produces highly uncertain pseudo-labels for hard samples, particularly those heavily affected by domain shifts, leading to these noisy pseudo-labels being introduced even before adaptation and further reinforced through parameter updates. Additionally, they continuously influence neighbor samples through propagation in the feature space.To eliminate the issue of noise accumulation, we propose a novel Progressive Curriculum Labeling (ElimPCL) method, which iteratively filters trustworthy pseudo-labeled samples based on prototype consistency to exclude high-noise samples from training. Furthermore, a Dual MixUP technique is designed in the feature space to enhance the separability of hard samples, thereby mitigating the interference of noisy samples on their neighbors.Extensive experiments validate the effectiveness of ElimPCL, achieving up to a 3.4% improvement on challenging tasks compared to state-of-the-art methods.
