Test-time Alignment of Diffusion Models without Reward Over-optimization
Sunwoo Kim, Minkyu Kim, Dongmin Park
TL;DR
This paper tackles the problem of aligning diffusion models to downstream rewards without incurring reward over-optimization or losing the model’s diversity. It introduces a training-free, test-time approach called SMCAlign, which samples from a tempered, reward-aware target distribution $p_{tar}(x) \propto p_{data}(x) \exp(r(x)/\alpha)$ using Sequential Monte Carlo tailored for diffusion processes. By incorporating tempered intermediate targets and a locally optimal proposal, the method achieves effective reward optimization while preserving cross-reward generalization and diversity, and it extends to single and multi-objective settings as well as online black-box optimization. The approach offers a robust, scalable alternative to fine-tuning for aligning diffusion models with diverse downstream objectives, with public code available for reproducibility.
Abstract
Diffusion models excel in generative tasks, but aligning them with specific objectives while maintaining their versatility remains challenging. Existing fine-tuning methods often suffer from reward over-optimization, while approximate guidance approaches fail to optimize target rewards effectively. Addressing these limitations, we propose a training-free, test-time method based on Sequential Monte Carlo (SMC) to sample from the reward-aligned target distribution. Our approach, tailored for diffusion sampling and incorporating tempering techniques, achieves comparable or superior target rewards to fine-tuning methods while preserving diversity and cross-reward generalization. We demonstrate its effectiveness in single-reward optimization, multi-objective scenarios, and online black-box optimization. This work offers a robust solution for aligning diffusion models with diverse downstream objectives without compromising their general capabilities. Code is available at https://github.com/krafton-ai/DAS.
