Score-based Conditional Generation with Fewer Labeled Data by Self-calibrating Classifier Guidance
Paul Kuo-Ming Huang, Si-An Chen, Hsuan-Tien Lin
TL;DR
This paper tackles the challenge of conditional generation with score-based diffusion models when labeled data are scarce. It introduces a self-calibration (SC) technique that reinterprets a time-dependent classifier as an internal energy-based model and regularizes its conditional guidance through a DSM-based loss that aligns the classifier's internal unconditional score with the diffusion process. The approach enables improved class-conditioned generation by leveraging unlabeled data and maintaining stable training, outperforming vanilla CGSGMs and existing regularizers on CIFAR-10/100, especially in semi-supervised settings. The work also discusses practical considerations, including scalability versus high-resolution datasets, and highlights the potential for applying self-calibration to other conditional generative frameworks. Overall, SC provides a principled, efficient mechanism to calibrate classifier-guided SGMs under limited labeled data, with strong empirical gains in both fidelity and diversity of generated samples.
Abstract
Score-based generative models (SGMs) are a popular family of deep generative models that achieve leading image generation quality. Early studies extend SGMs to tackle class-conditional generation by coupling an unconditional SGM with the guidance of a trained classifier. Nevertheless, such classifier-guided SGMs do not always achieve accurate conditional generation, especially when trained with fewer labeled data. We argue that the problem is rooted in the classifier's tendency to overfit without coordinating with the underlying unconditional distribution. To make the classifier respect the unconditional distribution, we propose improving classifier-guided SGMs by letting the classifier regularize itself. The key idea of our proposed method is to use principles from energy-based models to convert the classifier into another view of the unconditional SGM. Existing losses for unconditional SGMs can then be leveraged to achieve regularization by calibrating the classifier's internal unconditional scores. The regularization scheme can be applied to not only the labeled data but also unlabeled ones to further improve the classifier. Across various percentages of fewer labeled data, empirical results show that the proposed approach significantly enhances conditional generation quality. The enhancements confirm the potential of the proposed self-calibration technique for generative modeling with limited labeled data.
