Diffusion Sampling Path Tells More: An Efficient Plug-and-Play Strategy for Sample Filtering

Sixian Wang; Zhiwei Tang; Tsung-Hui Chang

Diffusion Sampling Path Tells More: An Efficient Plug-and-Play Strategy for Sample Filtering

Sixian Wang, Zhiwei Tang, Tsung-Hui Chang

TL;DR

This work tackles the problem of inconsistent image quality in diffusion models caused by stochastic sampling. It uncovers a strong link between sample quality and Accumulated Score Differences (ASD) during classifier-free guidance and proposes CFG-Rejection, a plug-and-play, reward-free method that prunes low-potential denoising trajectories early using a partial ASD measure $\mathcal{E}_{\tau:T}(c)$ with a threshold $\gamma$. The approach requires no architectural changes or retraining and integrates with existing diffusion pipelines, yielding consistent improvements in human and automated quality metrics across ImageNet, GenEval, DPG-Bench, and visual-text tasks. Through extensive experiments, the paper demonstrates substantial compute savings and quality gains, suggesting broad applicability of ASD-based latent-space filtering beyond images and highlighting a practical, zero-cost enhancement for diffusion-based generation.

Abstract

Diffusion models often exhibit inconsistent sample quality due to stochastic variations inherent in their sampling trajectories. Although training-based fine-tuning (e.g. DDPO [1]) and inference-time alignment techniques[2] aim to improve sample fidelity, they typically necessitate full denoising processes and external reward signals. This incurs substantial computational costs, hindering their broader applicability. In this work, we unveil an intriguing phenomenon: a previously unobserved yet exploitable link between sample quality and characteristics of the denoising trajectory during classifier-free guidance (CFG). Specifically, we identify a strong correlation between high-density regions of the sample distribution and the Accumulated Score Differences (ASD)--the cumulative divergence between conditional and unconditional scores. Leveraging this insight, we introduce CFG-Rejection, an efficient, plug-and-play strategy that filters low-quality samples at an early stage of the denoising process, crucially without requiring external reward signals or model retraining. Importantly, our approach necessitates no modifications to model architectures or sampling schedules and maintains full compatibility with existing diffusion frameworks. We validate the effectiveness of CFG-Rejection in image generation through extensive experiments, demonstrating marked improvements on human preference scores (HPSv2, PickScore) and challenging benchmarks (GenEval, DPG-Bench). We anticipate that CFG-Rejection will offer significant advantages for diverse generative modalities beyond images, paving the way for more efficient and reliable high-quality sample generation.

Diffusion Sampling Path Tells More: An Efficient Plug-and-Play Strategy for Sample Filtering

TL;DR

Abstract

Diffusion Sampling Path Tells More: An Efficient Plug-and-Play Strategy for Sample Filtering

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (23)