Enhancing Diffusion Model Guidance through Calibration and Regularization

Seyed Alireza Javid; Amirhossein Bagheri; Nuria González-Prelcic

Enhancing Diffusion Model Guidance through Calibration and Regularization

Seyed Alireza Javid, Amirhossein Bagheri, Nuria González-Prelcic

TL;DR

Classifier-guided diffusion commonly suffers from overconfident predictions during early denoising, causing vanishing guidance gradients. The paper couples a differentiable Smooth ECE calibration loss with divergence-aware sampling strategies that operate on off-the-shelf classifiers, avoiding diffusion-model retraining. Through theoretical analysis of reverse KL, forward KL, and Jensen-Shannon divergences and extensive ImageNet-128x128 experiments, Jensen-Shannon divergence with a ResNet-101 classifier achieves a new low $FID$ of 2.13 while maintaining strong precision-recall balance. The results demonstrate that principled calibration and divergence-aware sampling provide practical, plug-and-play improvements for conditional image generation in deployed diffusion systems.

Abstract

Classifier-guided diffusion models have emerged as a powerful approach for conditional image generation, but they suffer from overconfident predictions during early denoising steps, causing the guidance gradient to vanish. This paper introduces two complementary contributions to address this issue. First, we propose a differentiable calibration objective based on the Smooth Expected Calibration Error (Smooth ECE), which improves classifier calibration with minimal fine-tuning and yields measurable improvements in Frechet Inception Distance (FID). Second, we develop enhanced sampling guidance methods that operate on off-the-shelf classifiers without requiring retraining. These include tilted sampling with batch-level reweighting, adaptive entropy-regularized sampling to preserve diversity, and a novel f-divergence-based sampling strategy that strengthens class-consistent guidance while maintaining mode coverage. Experiments on ImageNet 128x128 demonstrate that our divergence-regularized guidance achieves an FID of 2.13 using a ResNet-101 classifier, improving upon existing classifier-guided diffusion methods while requiring no diffusion model retraining. The results show that principled calibration and divergence-aware sampling provide practical and effective improvements for classifier-guided diffusion.

Enhancing Diffusion Model Guidance through Calibration and Regularization

TL;DR

Abstract

Enhancing Diffusion Model Guidance through Calibration and Regularization

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (18)