Sublinear-Time Sampling of Spanning Trees in the Congested Clique
Sriram V. Pemmaraju, Sourya Roy, Joshua Z. Sobel
TL;DR
The paper presents the first sublinear-round algorithm for approximately sampling a uniform spanning tree in the Congested Clique, achieving $ ilde{O}(n^{1/2+\alpha})$ rounds to within total variation $\epsilon=\Omega(1/n^{c})$ (for any fixed $c>0$) by combining a top-down random-walk filling strategy with Schur-complement shortcut graphs and a compression-based reconstruction via weighted perfect matchings. It leverages distributed matrix multiplication (with exponent $\alpha$ currently $0.157$) to build and manipulate the derivative graphs, enabling $O(\sqrt{n})$ phases each costing $\tilde{O}(n^{\alpha})$ rounds, and thus yields a sublinear-time sampler. The authors also show how to achieve shorter walks efficiently, obtaining $O(\log^3 n)$-round sampling for graphs with cover times $O(n \log n)$, and provide a pathway to exact sampling at a higher but still sublinear runtime $\tilde{O}(n^{2/3+\alpha})$. The work connects classic Aldous–Broder random-walk ideas with modern distributed-graph techniques (Schur complements, shortcut graphs, and permanent-based sampling), offering new avenues for distributed sampling of combinatorial objects and potentially impacting related problems in MPC and other distributed models.
Abstract
We present the first sublinear-in-$n$ round algorithm for sampling an approximately uniform spanning tree of an $n$-vertex graph in the CongestedClique model of distributed computing. In particular, our algorithm requires $\Tilde{O}(n^{0.657})$ rounds for sampling a spanning tree within total variation distance $1/n^c$, for arbitrary constant $c > 0$, from the uniform distribution. More precisely, our algorithm requires $\Tilde{O}(n^{1/2 + α})$ rounds, where $O(n^α)$ is the running time of matrix multiplication in the CongestedClique model (currently $α= 1 - 2/ω= 0.157$, where $ω$ is the sequential matrix multiplication time exponent). We can adapt our algorithm to give exact rather than approximate samples, but with a larger, though still $o(n)$, runtime of $\Tilde{O}(n^{2/3+α}) = O(n^{.824})$. In a remarkable result, Aldous (SIDM 1990) and Broder (FOCS 1989) showed that the first visit edge to each vertex, excluding the start vertex, during a random walk forms a uniformly chosen spanning tree of the underlying graph. Our algorithm is a significant departure from known techniques, featuring a top-down walk filling approach paired with Schur complement graphs for walk shortcutting. To make this idea work in the CongestedClique model, we present a novel compressed random walk reconstruction algorithm, based on randomly sampling a weighted perfect matching. In addition, we show how to take somewhat shorter random walks even more efficiently in the CongestedClique model, obtaining an $O(\log^3 n)$-round algorithm for uniformly sampling spanning trees from graphs with $O(n\log n)$ cover times. These results are obtained by adding a load balancing component to the random walk algorithm of Bahmani, Chakrabarti and Xin (SIGMOD 2011) that uses the bottom-up ``doubling'' technique.
