FALCON: Few-step Accurate Likelihoods for Continuous Flows
Danyal Rehman, Tara Akhound-Sadegh, Artem Gazizov, Yoshua Bengio, Alexander Tong
TL;DR
FALCON tackles the long-standing challenge of Boltzmann sampling by fusing few-step invertible flows with accurate, tractable likelihoods. By introducing an invertibility-based training objective and a discrete-time flow map, it enables fast sampling while preserving likelihood estimates necessary for SNIS corrections. Empirical results show FALCON outperforms both state-of-the-art continuous flows in scalability and discrete normalizing flows in sample quality, achieving two orders of magnitude speedups over equivalent CNFs. The approach leverages a diffusion-transformer backbone with efficient Jacobian operations and careful inference scheduling to balance accuracy and efficiency. This work expands the practicality of Boltzmann Generators for MD-scale problems and sets a foundation for broader applications in scientific computing that require fast, reliable likelihoods.
Abstract
Scalable sampling of molecular states in thermodynamic equilibrium is a long-standing challenge in statistical physics. Boltzmann Generators tackle this problem by pairing a generative model, capable of exact likelihood computation, with importance sampling to obtain consistent samples under the target distribution. Current Boltzmann Generators primarily use continuous normalizing flows (CNFs) trained with flow matching for efficient training of powerful models. However, likelihood calculation for these models is extremely costly, requiring thousands of function evaluations per sample, severely limiting their adoption. In this work, we propose Few-step Accurate Likelihoods for Continuous Flows (FALCON), a method which allows for few-step sampling with a likelihood accurate enough for importance sampling applications by introducing a hybrid training objective that encourages invertibility. We show FALCON outperforms state-of-the-art normalizing flow models for molecular Boltzmann sampling and is two orders of magnitude faster than the equivalently performing CNF model.
