Robust Learning of Diffusion Models with Extremely Noisy Conditions
Xin Chen, Gillian Dobbie, Xinyu Wang, Feng Liu, Di Wang, Jingfeng Zhang
TL;DR
The paper tackles the challenge of extremely noisy conditioning in conditional diffusion models. It introduces pseudo conditions that are progressively refined via temporal ensembling and a Reverse-time Diffusion Condition (RDC) to strengthen memorization and conditioning under noise. Empirical results demonstrate state-of-the-art robustness on class-conditioned image generation and visuomotor policy tasks across diverse noise regimes. The approach offers practical robustness for real-world applications with unreliable observations and noisy labels.
Abstract
Conditional diffusion models have the generative controllability by incorporating external conditions. However, their performance significantly degrades with noisy conditions, such as corrupted labels in the image generation or unreliable observations or states in the control policy generation. This paper introduces a robust learning framework to address extremely noisy conditions in conditional diffusion models. We empirically demonstrate that existing noise-robust methods fail when the noise level is high. To overcome this, we propose learning pseudo conditions as surrogates for clean conditions and refining pseudo ones progressively via the technique of temporal ensembling. Additionally, we develop a Reverse-time Diffusion Condition (RDC) technique, which diffuses pseudo conditions to reinforce the memorization effect and further facilitate the refinement of the pseudo conditions. Experimentally, our approach achieves state-of-the-art performance across a range of noise levels on both class-conditional image generation and visuomotor policy generation tasks.The code can be accessible via the project page https://robustdiffusionpolicy.github.io
