Label-Noise Robust Diffusion Models
Byeonghu Na, Yeongmin Kim, HeeSun Bae, Jung Hyun Lee, Se Jung Kwon, Wanmo Kang, Il-Chul Moon
TL;DR
This work addresses conditional diffusion models trained on noisy labels by deriving a linear relationship between clean-label and noisy-label conditional scores and introducing Transition-aware weighted Denoising Score Matching (TDSM). The core idea is to represent the noisy-score as a time- and instance-dependent convex combination of clean-score networks using weights $w(oldsymbol{x}_t, ilde{y},y,t)$ derived from the transition matrix $S$ and a time-dependent noisy-label classifier. The authors prove theoretical guarantees that, under invertible $S$, TDSM recovers the true clean-label conditional scores, and they provide practical estimation methods for the weights, including VolMinNet when $S$ is unknown. Empirical results on MNIST, CIFAR-10/100, and Clothing-1M show that TDSM improves conditional and unconditional generation metrics under various noise regimes and remains beneficial when combined with existing noisy-label correctors. The approach offers a scalable, diffusion-specific remedy to label noise with strong empirical robustness and practical training considerations.
Abstract
Conditional diffusion models have shown remarkable performance in various generative tasks, but training them requires large-scale datasets that often contain noise in conditional inputs, a.k.a. noisy labels. This noise leads to condition mismatch and quality degradation of generated data. This paper proposes Transition-aware weighted Denoising Score Matching (TDSM) for training conditional diffusion models with noisy labels, which is the first study in the line of diffusion models. The TDSM objective contains a weighted sum of score networks, incorporating instance-wise and time-dependent label transition probabilities. We introduce a transition-aware weight estimator, which leverages a time-dependent noisy-label classifier distinctively customized to the diffusion process. Through experiments across various datasets and noisy label settings, TDSM improves the quality of generated samples aligned with given conditions. Furthermore, our method improves generation performance even on prevalent benchmark datasets, which implies the potential noisy labels and their risk of generative model learning. Finally, we show the improved performance of TDSM on top of conventional noisy label corrections, which empirically proving its contribution as a part of label-noise robust generative models. Our code is available at: https://github.com/byeonghu-na/tdsm.
