Scaling Up Bayesian DAG Sampling
Daniele Nikzad, Alexander Zhilkin, Juha Harviainen, Jack Kuipers, Giusi Moffa, Mikko Koivisto
TL;DR
This work addresses scalable Bayesian learning of DAGs by accelerating Markov chain sampling in the space of graphs. It introduces Gibby, a fast basic-moves scheme that upper-bounds acceptance via an extended score and leverages geometric waiting times, along with efficient proposal generation and acyclicity checking, achieving substantial speedups over prior methods. It also develops epsilon-pruning to safely discard unlikely parent sets, drastically reducing the cost of moves that resample neighborhoods while preserving posterior accuracy within quantified bounds. Empirical evaluations show Gibby outperforming competing samplers on large networks and, when combined with pruning, enabling reliable posterior estimation where exact methods are infeasible. These techniques extend the practical reach of structure MCMC for Bayesian DAGs and have potential applications beyond DAG sampling in other combinatorial inference tasks.
Abstract
Bayesian inference of Bayesian network structures is often performed by sampling directed acyclic graphs along an appropriately constructed Markov chain. We present two techniques to improve sampling. First, we give an efficient implementation of basic moves, which add, delete, or reverse a single arc. Second, we expedite summing over parent sets, an expensive task required for more sophisticated moves: we devise a preprocessing method to prune possible parent sets so as to approximately preserve the sums. Our empirical study shows that our techniques can yield substantial efficiency gains compared to previous methods.
