Refining Alignment Framework for Diffusion Models with Intermediate-Step Preference Ranking
Jie Ren, Yuhang Zhang, Dongrui Liu, Xiaopeng Zhang, Qi Tian
TL;DR
This work argues that trajectory-level final-image rankings used in prior diffusion-model DPO methods can misalign with intermediate-step rewards. It proposes TailorPO, which ranks step-wise noisy samples generated from the same denoising input and uses a DPO-style loss to steer optimization, ensuring gradient directions align with human preferences. The addition of TailorPO-G integrates gradient guidance to broaden reward gaps and further boost performance. Empirical results on Stable Diffusion show improved human-aligned aesthetics and generalization across prompts and reward models, highlighting practical impact for more faithful image generation guided by nuanced human preferences.
Abstract
Direct preference optimization (DPO) has shown success in aligning diffusion models with human preference. Previous approaches typically assume a consistent preference label between final generations and noisy samples at intermediate steps, and directly apply DPO to these noisy samples for fine-tuning. However, we theoretically identify inherent issues in this assumption and its impacts on the effectiveness of preference alignment. We first demonstrate the inherent issues from two perspectives: gradient direction and preference order, and then propose a Tailored Preference Optimization (TailorPO) framework for aligning diffusion models with human preference, underpinned by some theoretical insights. Our approach directly ranks intermediate noisy samples based on their step-wise reward, and effectively resolves the gradient direction issues through a simple yet efficient design. Additionally, we incorporate the gradient guidance of diffusion models into preference alignment to further enhance the optimization effectiveness. Experimental results demonstrate that our method significantly improves the model's ability to generate aesthetically pleasing and human-preferred images.
