Ant Colony Sampling with GFlowNets for Combinatorial Optimization
Minsu Kim, Sanghyeok Choi, Hyeonah Kim, Jiwoo Son, Jinkyoo Park, Yoshua Bengio
TL;DR
The paper tackles combinatorial optimization by addressing limitations of reward-focused RL pretraining. It proposes GFACS, a two-layer approach that first learns a multi-modal prior via GFlowNets and then refines it with posterior search using Ant Colony Optimization, enhanced by energy reshaping and off-policy TB training. The method achieves strong performance across seven CO benchmarks, outperforming vanilla ACO, several RL baselines, and other GFlowNet training approaches, and it improves when paired with active search. This approach offers a modular, scalable framework that balances diversity and optimality, with practical implications for large-scale, constraint-aware CO problems, while leaving theoretical guarantees for future work.
Abstract
We present the Generative Flow Ant Colony Sampler (GFACS), a novel meta-heuristic method that hierarchically combines amortized inference and parallel stochastic search. Our method first leverages Generative Flow Networks (GFlowNets) to amortize a \emph{multi-modal} prior distribution over combinatorial solution space that encompasses both high-reward and diversified solutions. This prior is iteratively updated via parallel stochastic search in the spirit of Ant Colony Optimization (ACO), leading to the posterior distribution that generates near-optimal solutions. Extensive experiments across seven combinatorial optimization problems demonstrate GFACS's promising performances.
