A Theory of Non-Acyclic Generative Flow Networks
Leo Maxime Brunswic, Yinchuan Li, Yushun Xu, Shangling Jui, Lizhuang Ma
TL;DR
This work generalizes Generative Flows (GFlowNets) from DAGs to measurable spaces, addressing cycles and continuous-state challenges by introducing a measure-theoretic framework and a notion of 0-flows. It shows that conventional training losses can become unstable in non-acyclic settings and develops a family of stable losses with stabilizing regularization, framed as generalized divergences over state/edge/path measures. Theoretical results characterize stability conditions and sampling behavior, while experiments on hypergrids, Cayley graphs, and continuous control tasks validate that stable losses prevent runaway flows and improve convergence to reward-proportional distributions. The approach broadens the applicability of GFlowNets, offering practical guidance for training in spaces with cycles and continuous components, with implications for MCMC-like sampling and policy-based generation.
Abstract
GFlowNets is a novel flow-based method for learning a stochastic policy to generate objects via a sequence of actions and with probability proportional to a given positive reward. We contribute to relaxing hypotheses limiting the application range of GFlowNets, in particular: acyclicity (or lack thereof). To this end, we extend the theory of GFlowNets on measurable spaces which includes continuous state spaces without cycle restrictions, and provide a generalization of cycles in this generalized context. We show that losses used so far push flows to get stuck into cycles and we define a family of losses solving this issue. Experiments on graphs and continuous tasks validate those principles.
