Spanning Trees and Redistricting: New Methods for Sampling and Validation
Sarah Cannon, Moon Duchin, Dana Randall, Parker Rule
TL;DR
This work advances neutral baseline sampling for redistricting by formalizing a spanning-tree distribution $\pi(P) \propto \prod_i N_{\sf ST}(P_i)$ over graph partitions and introducing RevReCom, a reversible recombination Markov chain that targets $\pi$ under exact and approximate population balance. It provides rigorous detailed-balance proofs for exact balance and practical extensions to $\epsilon$-balanced partitions, along with a high-performance Rust implementation and parallelization strategy. Through extensive benchmarking on a $7\times7$ grid and on real-world-like state graphs (PA, VA), the authors demonstrate that RevReCom, Forest ReCom, and SMC yield convergent, comparable null models, with RevReCom scaling effectively to larger district counts. The results establish spanning-tree-based ensembles as a robust, policy-relevant tool for evaluating gerrymandering, while offering open-source software and diagnostics for convergence and cross-validation in legal and legislative contexts.
Abstract
Deciding whether a political districting plan was distorted by a hidden agenda, or whether it dilutes the voting power of some group, requires a neutral baseline for comparison. Remarkably, all nine U.S. Supreme Court justices have now signed on to decisions that find that computational methods can provide key evidence. Today, the leading approaches for benchmarking districting plans are based on the use of spanning trees for sampling graph partitions. We present a new *reversible recombination* algorithm and rigorously prove its fundamental properties. Furthermore, we argue for a canonical sampling distribution called the *spanning tree distribution* that is well adapted to redistricting and provides a principled foundation for comparing and validating methods. Together with a highly efficient (and open-source) implementation that can generate and handle large datasets, this work provides the most powerful null model to date for the gerrymandering problem, meeting an urgent democratic challenge with sound scientific methodology.
