Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning
Riccardo De Santi, Marin Vlastelica, Ya-Ping Hsieh, Zebang Shen, Niao He, Andreas Krause
TL;DR
This work generalizes fine-tuning of flow and diffusion models from KL-regularized reward optimization to GO, where arbitrary distributional utilities and divergences are optimized under flow dynamics. It introduces Flow Density Control (FDC), a mirror-descent-based method that converts non-linear GO into a sequence of linear GO problems using the first variation of the objective. The authors provide convergence guarantees under both idealized concave and general noisy settings via mirror-flow theory and demonstrate effectiveness across risk-averse, novelty-seeking, and OT-regularized tasks, including molecular design and text-to-image generation. Together, these advances extend the practical reach of pretrained generative models for complex scientific and creative objectives beyond current fine-tuning schemes.
Abstract
Adapting large-scale foundation flow and diffusion generative models to optimize task-specific objectives while preserving prior information is crucial for real-world applications such as molecular design, protein docking, and creative image generation. Existing principled fine-tuning methods aim to maximize the expected reward of generated samples, while retaining knowledge from the pre-trained model via KL-divergence regularization. In this work, we tackle the significantly more general problem of optimizing general utilities beyond average rewards, including risk-averse and novelty-seeking reward maximization, diversity measures for exploration, and experiment design objectives among others. Likewise, we consider more general ways to preserve prior information beyond KL-divergence, such as optimal transport distances and Renyi divergences. To this end, we introduce Flow Density Control (FDC), a simple algorithm that reduces this complex problem to a specific sequence of simpler fine-tuning tasks, each solvable via scalable established methods. We derive convergence guarantees for the proposed scheme under realistic assumptions by leveraging recent understanding of mirror flows. Finally, we validate our method on illustrative settings, text-to-image, and molecular design tasks, showing that it can steer pre-trained generative models to optimize objectives and solve practically relevant tasks beyond the reach of current fine-tuning schemes.
