Enhancing Solution Efficiency in Reinforcement Learning: Leveraging Sub-GFlowNet and Entropy Integration
Siyi He
TL;DR
This work enhances GFlowNet-based reinforcement learning by integrating network-structure awareness through sub-GFlowNet and an entropy-weighted loss. By decomposing the loss across subgraphs and weighting by subflow entropy, the approach aims to improve candidate diversity and training efficiency. Empirical results in hypergrid environments show accelerated convergence and better alignment with the target distribution, while molecule synthesis tasks reveal a favorable trade-off between rapid reward attainment and diversity, depending on the objective. The study advances flow-based generative modeling by linking DAG structure to training dynamics and suggesting future directions for scalable, conditional sub-GFlowNets.
Abstract
Traditional reinforcement learning often struggles to generate diverse, high-reward solutions, especially in domains like drug design and black-box function optimization. Markov Chain Monte Carlo (MCMC) methods provide an alternative method of RL in candidate selection but suffer from high computational costs and limited candidate diversity exploration capabilities. In response, GFlowNet, a novel neural network architecture, was introduced to model complex system dynamics and generate diverse high-reward trajectories. To further enhance this approach, this paper proposes improvements to GFlowNet by introducing a new loss function and refining the training objective associated with sub-GFlowNet. These enhancements aim to integrate entropy and leverage network structure characteristics, improving both candidate diversity and computational efficiency. We demonstrated the superiority of the refined GFlowNet over traditional methods by empirical results from hypergrid experiments and molecule synthesis tasks. The findings underscore the effectiveness of incorporating entropy and exploiting network structure properties in solution generation in molecule synthesis as well as diverse experimental designs.
