On Generalization for Generative Flow Networks

Anas Krichel; Nikolay Malkin; Salem Lahlou; Yoshua Bengio

On Generalization for Generative Flow Networks

Anas Krichel, Nikolay Malkin, Salem Lahlou, Yoshua Bengio

TL;DR

This work formalizes generalization in Generative Flow Networks (GFlowNets) trained via the Trajectory Balance loss, linking trajectory probabilities to an unnormalized reward $R$ through $P_F$, $P_B$, and $Z$. It introduces a stability-focused perspective and establishes a bound showing small reward perturbations yield controlled changes in trajectory distributions under TB, with a concrete result for uniform $P_B$. The authors provide empirical evidence by hiding parts of the reward and comparing TB, DB, and FL-DB losses, finding that DB often generalizes better and that access to intermediate rewards (FL-DB) boosts generalization when available. Overall, the paper contributes theoretical generalization and stability frameworks for GFlowNets and demonstrates practical implications for designing robust training losses and policies across unseen regions of the reward landscape.

Abstract

Generative Flow Networks (GFlowNets) have emerged as an innovative learning paradigm designed to address the challenge of sampling from an unnormalized probability distribution, called the reward function. This framework learns a policy on a constructed graph, which enables sampling from an approximation of the target probability distribution through successive steps of sampling from the learned policy. To achieve this, GFlowNets can be trained with various objectives, each of which can lead to the model s ultimate goal. The aspirational strength of GFlowNets lies in their potential to discern intricate patterns within the reward function and their capacity to generalize effectively to novel, unseen parts of the reward function. This paper attempts to formalize generalization in the context of GFlowNets, to link generalization with stability, and also to design experiments that assess the capacity of these models to uncover unseen parts of the reward function. The experiments will focus on length generalization meaning generalization to states that can be constructed only by longer trajectories than those seen in training.

On Generalization for Generative Flow Networks

TL;DR

This work formalizes generalization in Generative Flow Networks (GFlowNets) trained via the Trajectory Balance loss, linking trajectory probabilities to an unnormalized reward

through

, and

. It introduces a stability-focused perspective and establishes a bound showing small reward perturbations yield controlled changes in trajectory distributions under TB, with a concrete result for uniform

. The authors provide empirical evidence by hiding parts of the reward and comparing TB, DB, and FL-DB losses, finding that DB often generalizes better and that access to intermediate rewards (FL-DB) boosts generalization when available. Overall, the paper contributes theoretical generalization and stability frameworks for GFlowNets and demonstrates practical implications for designing robust training losses and policies across unseen regions of the reward landscape.

Abstract

Paper Structure (16 sections, 4 theorems, 27 equations, 3 figures)

This paper contains 16 sections, 4 theorems, 27 equations, 3 figures.

Introduction
Our contributions.
Preliminaries and definitions
Definition
Trajectory Balance loss
Statistical definition of generalization for GFlowNet
Stability
Result concerning GFlowNets
Experiments
How we measure generalization
Observation
Examples of the learned distribution
Limits and future work
Appendix
Stability in supervised learning
...and 1 more sections

Key Result

proposition thmcounterproposition

We note for all $\tau$ in $\mathcal{T}$, $\mathcal{L}_{\rm TB}(R, f_{\theta}(\tau)) := \mathcal{L}_{\rm TB}(\tau,P_{F}^{\theta}, P_{B}^{\theta}, Z_{\theta}, R)$ when $P_F$ and $P_B$ vary in a fixed hypothesis set (typically neural networks): $f_{\theta}(\tau) = \frac{Z_{\theta}\prod_{i=1}^{n}P_F(s_{ where $\mathcal{R(G)}$ is the Rademacher complexity of $\mathcal{G}$ and $d$ is the pseudodimension

Figures (3)

Figure 1: Illustration of the process of hiding states in order to measure generalization
Figure 2: Tracking generalization of different losses and environments sizes.
Figure 3: Showing the challenge of out-of-distribution generalization

Theorems & Definitions (12)

definition thmcounterdefinition: A definition of generalization
proposition thmcounterproposition: Bound with i.i.d. assumption
proof : It is a direct application of the Rademacher properties for bounded losses see mohri2012foundations or bach
remark thmcounterremark
proposition thmcounterproposition: Beyond i.i.d. assumption
proof
definition thmcounterdefinition: Stability for GFlowNets
proposition thmcounterproposition
proof
lemma thmcounterlemma
...and 2 more

On Generalization for Generative Flow Networks

TL;DR

Abstract

On Generalization for Generative Flow Networks

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (12)