Optimal Bounds for Private Minimum Spanning Trees via Input Perturbation
Rasmus Pagh, Lukas Retschmeier, Hao Wu, Hanwen Zhang
TL;DR
The paper addresses privately releasing a minimum spanning tree under edge-weight differential privacy with an $\ell_\infty$-neighboring relation. It introduces an input-perturbation reduction: perturb all edge weights with exponential noise and run a non-private MST, achieving an $(\varepsilon,\delta)$-DP mechanism whose utility matches the best known non-private MST bounds up to logarithmic factors. The approach yields an $\tilde{O}(n^{3/2})$ excess weight bound with near-linear time, and a matching $\tilde{\Omega}(n^{3/2})$ lower bound is established via a reduction to private top-$k$ selection, demonstrating near-optimality. Empirical results corroborate practicality and show favorable performance relative to input-privatization methods, particularly on denser graphs. The work thus provides the first framework achieving the best of both worlds: efficient private MST computation with optimal or near-optimal utility guarantees, and a concrete lower bound under approximate DP.
Abstract
We study the problem of privately releasing an approximate minimum spanning tree (MST). Given a graph $G = (V, E, \vec{W})$ where $V$ is a set of $n$ vertices, $E$ is a set of $m$ undirected edges, and $ \vec{W} \in \mathbb{R}^{|E|} $ is an edge-weight vector, our goal is to publish an approximate MST under edge-weight differential privacy, as introduced by Sealfon in PODS 2016, where $V$ and $E$ are considered public and the weight vector is private. Our neighboring relation is $\ell_\infty$-distance on weights: for a sensitivity parameter $Δ_\infty$, graphs $ G = (V, E, \vec{W}) $ and $ G' = (V, E, \vec{W}') $ are neighboring if $\|\vec{W}-\vec{W}'\|_\infty \leq Δ_\infty$. Existing private MST algorithms face a trade-off, sacrificing either computational efficiency or accuracy. We show that it is possible to get the best of both worlds: With a suitable random perturbation of the input that does not suffice to make the weight vector private, the result of any non-private MST algorithm will be private and achieves a state-of-the-art error guarantee. Furthermore, by establishing a connection to Private Top-k Selection [Steinke and Ullman, FOCS '17], we give the first privacy-utility trade-off lower bound for MST under approximate differential privacy, demonstrating that the error magnitude, $\tilde{O}(n^{3/2})$, is optimal up to logarithmic factors. That is, our approach matches the time complexity of any non-private MST algorithm and at the same time achieves optimal error. We complement our theoretical treatment with experiments that confirm the practicality of our approach.
