Randomized sequential importance sampling for estimating the number of perfect matchings in bipartite graphs
Persi Diaconis, Brett Kolesnik
TL;DR
This work introduces randomized sequential importance sampling methods to estimate the number of perfect matchings in bipartite graphs, and develops central limit theorems for the log-likelihood ratio under several sampling orders. By analyzing Fibonacci, 2-Fibonacci, and distance-2 matchings, it derives explicit mean and variance growth rates and demonstrates how KL-based sample-size criteria $N^*$ can drastically reduce the number of samples needed compared to classical variance-based criteria. The results include CLTs for random, fixed, and greedy order algorithms, the construction of almost-perfect variants with constant variance, and quantitative comparisons with Markov chain Monte Carlo approaches in practical regimes. Collectively, the paper provides a framework for efficient, theory-backed approximation of #P-complete counting problems in structured bipartite graphs. The demonstrated concentration phenomena and exact asymptotics offer guidance for applying importance sampling to related counting and sampling tasks, such as contingency tables and degree-sequence graphs.
Abstract
We introduce and study randomized sequential importance sampling algorithms for estimating the number of perfect matchings in bipartite graphs. In analyzing their performance, we establish various non-standard central limit theorems. We expect our methods to be useful for other applied problems.
