Table of Contents
Fetching ...

Massively Parallel Minimum Spanning Tree in General Metric Spaces

Amir Azarmehr, Soheil Behnezhad, Rajesh Jayaram, Jakub Łącki, Vahab Mirrokni, Peilin Zhong

TL;DR

This work proves that in the strictly sublinear MPC regime, MST in general metric spaces can be computed to a (1+ε) approximation in O(log(1/ε) + log log n) rounds, using per-machine space n^δ and near-linear total space. The algorithm combines an MPX-based low-diameter partition hierarchy with a Borůvka-style merge that avoids constructing all intermediate partitions, achieving subloground complexity previously known only for specialized metric settings. A matching conditional lower bound under the 1vs2-Cycle conjecture shows that the ε-dependence is optimal and that no o(log(1/ε))-round sublogarithmic MPC algorithm exists for general metrics, even for (1,2)-metrics, unless the conjecture is false. The results extend to TSP via known reductions, and they advance understanding of MPC lower bounds beyond component-stable models by introducing a random relabeling technique. Overall, the paper delivers a near-optimal sublogarithmic MPC MST algorithm in general metric spaces and a robust conditional lower bound, with broad implications for distributed graph optimization in MPC.

Abstract

We study the minimum spanning tree (MST) problem in the massively parallel computation (MPC) model. Our focus is particularly on the *strictly sublinear* regime of MPC where the space per machine is $O(n^δ)$. Here $n$ is the number of vertices and constant $δ\in (0, 1)$ can be made arbitrarily small. The MST problem admits a simple and folklore $O(\log n)$-round algorithm in the MPC model. When the weights can be arbitrary, this matches a conditional lower bound of $Ω(\log n)$ which follows from a well-known 1vs2-Cycle conjecture. As such, much of the literature focuses on breaking the logarithmic barrier in more structured variants of the problem, such as when the vertices correspond to points in low- [ANOY14, STOC'14] or high-dimensional Euclidean spaces [JMNZ, SODA'24]. In this work, we focus more generally on metric spaces. Namely, all pairwise weights are provided and guaranteed to satisfy the triangle inequality, but are otherwise unconstrained. We show that for any $\varepsilon > 0$, a $(1+\varepsilon)$-approximate MST can be found in $O(\log \frac{1}{\varepsilon} + \log \log n)$ rounds, which is the first $o(\log n)$-round algorithm for finding any constant approximation in this setting. Other than being applicable to more general weight functions, our algorithm also slightly improves the $O(\log \log n \cdot \log \log \log n)$ round-complexity of [JMNZ24, SODA'24] and significantly improves its approximation from a large constant to $1+\varepsilon$. On the lower bound side, we prove that under the 1vs2-Cycle conjecture, $Ω(\log \frac{1}{\varepsilon})$ rounds are needed for finding a $(1+\varepsilon)$-approximate MST in general metrics. It is worth noting that while many existing lower bounds in the MPC model under the 1vs2-Cycle conjecture only hold against "component stable" algorithms, our lower bound applies to *all* algorithms.

Massively Parallel Minimum Spanning Tree in General Metric Spaces

TL;DR

This work proves that in the strictly sublinear MPC regime, MST in general metric spaces can be computed to a (1+ε) approximation in O(log(1/ε) + log log n) rounds, using per-machine space n^δ and near-linear total space. The algorithm combines an MPX-based low-diameter partition hierarchy with a Borůvka-style merge that avoids constructing all intermediate partitions, achieving subloground complexity previously known only for specialized metric settings. A matching conditional lower bound under the 1vs2-Cycle conjecture shows that the ε-dependence is optimal and that no o(log(1/ε))-round sublogarithmic MPC algorithm exists for general metrics, even for (1,2)-metrics, unless the conjecture is false. The results extend to TSP via known reductions, and they advance understanding of MPC lower bounds beyond component-stable models by introducing a random relabeling technique. Overall, the paper delivers a near-optimal sublogarithmic MPC MST algorithm in general metric spaces and a robust conditional lower bound, with broad implications for distributed graph optimization in MPC.

Abstract

We study the minimum spanning tree (MST) problem in the massively parallel computation (MPC) model. Our focus is particularly on the *strictly sublinear* regime of MPC where the space per machine is . Here is the number of vertices and constant can be made arbitrarily small. The MST problem admits a simple and folklore -round algorithm in the MPC model. When the weights can be arbitrary, this matches a conditional lower bound of which follows from a well-known 1vs2-Cycle conjecture. As such, much of the literature focuses on breaking the logarithmic barrier in more structured variants of the problem, such as when the vertices correspond to points in low- [ANOY14, STOC'14] or high-dimensional Euclidean spaces [JMNZ, SODA'24]. In this work, we focus more generally on metric spaces. Namely, all pairwise weights are provided and guaranteed to satisfy the triangle inequality, but are otherwise unconstrained. We show that for any , a -approximate MST can be found in rounds, which is the first -round algorithm for finding any constant approximation in this setting. Other than being applicable to more general weight functions, our algorithm also slightly improves the round-complexity of [JMNZ24, SODA'24] and significantly improves its approximation from a large constant to . On the lower bound side, we prove that under the 1vs2-Cycle conjecture, rounds are needed for finding a -approximate MST in general metrics. It is worth noting that while many existing lower bounds in the MPC model under the 1vs2-Cycle conjecture only hold against "component stable" algorithms, our lower bound applies to *all* algorithms.
Paper Structure (20 sections, 14 theorems, 33 equations, 5 figures, 1 table, 2 algorithms)

This paper contains 20 sections, 14 theorems, 33 equations, 5 figures, 1 table, 2 algorithms.

Key Result

Theorem 1

Given a metric, for any fixed $\delta \in (0, 1)$ and any $\varepsilon > 0$, a $(1 + \varepsilon)$-approximate MST can be computed in $O\left(\log \frac{1}{\varepsilon} + \log \log n\right)$ rounds of the MPC model, with $O(n^\delta)$ space per machine and $\widetilde{\Theta}(n^2)$ total space which

Figures (5)

  • Figure 1: A two-switch on edges $(u, v)$ and $(u', v')$, replaces them with $(u, u')$ and $(v, v')$. Here, the two-switch breaks the cycle into two cycles. See also \ref{['def:two-switch']} and \ref{['clm:two-switch']}.
  • Figure 2: An example of breaking down a cycle of length $L = 15$ into $\frac{1}{\varepsilon} = 3$ cycles of length $\varepsilon L = 5$. To break down the cycle (a), a new graph (b) is constructed on the same vertex set, where each vertex is connected to the two vertices at distance $\frac{1}{\varepsilon}$, e.g. $v_2$ is connected to $v_{14}$ and $v_{5}$. Given the subpath $v_1$-$v_2$-$v_3$, and the order of the vertices in each cycle of (b), the order of the vertices in (a) can be constructed as follows. Start the order with the first vertex of each cycle order, i.e. $v_1, v_2, v_3$. Then, add the second vertex of each cycle to the order, i.e. $v_4, v_5, v_6$. Then, the third vertex of each cycle, and so on.
  • Figure 3: A possible outcome of a two-switch. (b) is obtained from (a) by doing a two-switch on $(v_1, v_2)$ and $(v_{16}, v_{15})$. Note that $v_1$ and $v_{16}$ are on different cycles here (case \ref{['item:two-switch-two-cycles']} in \ref{['clm:cycle-breakdown']}).
  • Figure 4: Two possible outcomes of a two-switch. (b) is obtained from (a) by doing a two-switch on $(v_1, v_2)$ and $(v_5, v_6)$. Note that $v_6$ and $v_2$ are on different paths from $v_1$ to $v_5$ (case \ref{['item:two-switch-one-cycle-different-paths']} in \ref{['clm:cycle-breakdown']}). (c) is obtained from (a) by doing a two-switch on $(v_1, v_2)$ and $(v_6, v_5)$. Observe that $v_5$ and $v_2$, are on the same path from $v_1$ to $v_6$ (case \ref{['item:two-switch-one-cycle-same-paths']} in \ref{['clm:cycle-breakdown']}).
  • Figure :

Theorems & Definitions (66)

  • Conjecture 1: The 1vs2-Cycle Conjecture
  • Theorem 1
  • Remark 1
  • Theorem 2
  • Definition 3.1
  • Definition 3.2
  • Proposition 3.3: MillerPX13
  • Lemma 4.1
  • Claim 4.2
  • proof
  • ...and 56 more