Distributional GFlowNets with Quantile Flows
Dinghuai Zhang, Ling Pan, Ricky T. Q. Chen, Aaron Courville, Yoshua Bengio
TL;DR
This work identifies a limitation of standard GFlowNets in handling stochastic rewards and uncertainty. It introduces Distributional GFlowNets by modeling edge/state flows as distributions and parameterizing their quantile functions with Quantile Matching (QM), enabling risk-sensitive policies via distortion risk measures. QM provides a principled training objective that improves signal quality and generalization, yielding superior performance on deterministic benchmarks and robust behavior in stochastic settings such as risky hypergrid tasks, sequence generation, and molecule optimization. The approach broadens the applicability of GFlowNets to real-world, uncertainty-prone domains, with potential impact on areas like drug discovery and complex combinatorial generation.
Abstract
Generative Flow Networks (GFlowNets) are a new family of probabilistic samplers where an agent learns a stochastic policy for generating complex combinatorial structure through a series of decision-making steps. Despite being inspired from reinforcement learning, the current GFlowNet framework is relatively limited in its applicability and cannot handle stochasticity in the reward function. In this work, we adopt a distributional paradigm for GFlowNets, turning each flow function into a distribution, thus providing more informative learning signals during training. By parameterizing each edge flow through their quantile functions, our proposed \textit{quantile matching} GFlowNet learning algorithm is able to learn a risk-sensitive policy, an essential component for handling scenarios with risk uncertainty. Moreover, we find that the distributional approach can achieve substantial improvement on existing benchmarks compared to prior methods due to our enhanced training algorithm, even in settings with deterministic rewards.
