Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss
Artem Khrapov, Vadim Popov, Tasnima Sadekova, Assel Yermekova, Mikhail Kudinov
TL;DR
The paper tackles diffusion models' sensitivity to training data outliers by introducing a time-dependent pseudo-Huber loss, enabling robust learning without data filtering. By scheduling the delta parameter, the method balances robustness during early reverse-diffusion steps with fine-detail reconstruction in later steps. The authors provide theoretical motivation and demonstrate improvements on image and audio tasks, showing enhanced resilience to corrupted data and reduced reliance on dataset purification. This approach offers a practical, low-cost path to robust diffusion training with potential wide-ranging impact in real-world data-laden settings.
Abstract
Diffusion models are known to be vulnerable to outliers in training data. In this paper we study an alternative diffusion loss function, which can preserve the high quality of generated data like the original squared $L_{2}$ loss while at the same time being robust to outliers. We propose to use pseudo-Huber loss function with a time-dependent parameter to allow for the trade-off between robustness on the most vulnerable early reverse-diffusion steps and fine details restoration on the final steps. We show that pseudo-Huber loss with the time-dependent parameter exhibits better performance on corrupted datasets in both image and audio domains. In addition, the loss function we propose can potentially help diffusion models to resist dataset corruption while not requiring data filtering or purification compared to conventional training algorithms.
