GLASS Flows: Transition Sampling for Alignment of Flow and Diffusion Models
Peter Holderrieth, Uriel Singer, Tommi Jaakkola, Ricky T. Q. Chen, Yaron Lipman, Brian Karrer
TL;DR
GLASS Flows address the inefficiency of reward-alignment methods that rely on stochastic sampling by enabling efficient ODE-based sampling of Markov transitions $p_{t'|t}$ via an inner, retrievable flow-matching model derived through sufficient statistics. This unifies the efficiency of ODE sampling with the stochasticity of SDEs, improving text-to-image generation performance when combined with reward-alignment techniques like Feynman-Kac Steering and reward guidance. The approach yields a theoretical framework for constructing transitions, practical algorithms for posterior sampling and SMC, and strong empirical gains on large-scale models, effectively removing the efficiency-stochasticity tradeoff in inference-time scaling. Overall, GLASS Flows serve as a plug-in, training-free enhancement for reward-aligned sampling across flow and diffusion models with clear practical impact for scalable, high-quality generation.
Abstract
The performance of flow matching and diffusion models can be greatly improved at inference time using reward alignment algorithms, yet efficiency remains a major limitation. While several algorithms were proposed, we demonstrate that a common bottleneck is the sampling method these algorithms rely on: many algorithms require to sample Markov transitions via SDE sampling, which is significantly less efficient and often less performant than ODE sampling. To remove this bottleneck, we introduce GLASS Flows, a new sampling paradigm that simulates a "flow matching model within a flow matching model" to sample Markov transitions. As we show in this work, this "inner" flow matching model can be retrieved from a pre-trained model without any re-training, combining the efficiency of ODEs with the stochastic evolution of SDEs. On large-scale text-to-image models, we show that GLASS Flows eliminate the trade-off between stochastic evolution and efficiency. Combined with Feynman-Kac Steering, GLASS Flows improve state-of-the-art performance in text-to-image generation, making it a simple, drop-in solution for inference-time scaling of flow and diffusion models.
