PolarStar: Expanding the Scalability Horizon of Diameter-3 Networks
Kartik Lakhotia, Laura Monroe, Kelly Isham, Maciej Besta, Nils Blach, Torsten Hoefler, Fabrizio Petrini
TL;DR
PolarStar targets scalable, low-diameter interconnects by using a star-product framework that combines a large diameter-2 structure graph with a diameter-3 supernode to create diameter-3 networks with unprecedented scale. By selecting ER_q as the structure graph (property $R$) and Inductive-Quad (IQ) as the supernode (property $R^*$), PolarStar achieves near-Moore-bound efficiency and outperforms existing diameter-3 topologies across radixes in the range $[8,128]$, including 1.3x over Bundlefly, 1.9x over Dragonfly, and 6.7x over 3-D HyperX on geometric means. The design supports a modular, bundlable layout and a rich design space with multiple configurations per radix, along with a routing approach that exploits the underlying graph properties to minimize state and latency. Through synthetic and real-world motif simulations, PolarStar demonstrates competitive or superior performance while delivering higher scalability and resilience to link failures, making it a practical candidate for future exascale and co-packaged HPC interconnects.
Abstract
We present PolarStar, a novel family of diameter-3 network topologies derived from the star product of low-diameter factor graphs. PolarStar gives the largest known diameter-3 network topologies for almost all radixes, thus providing the best known scalable diameter-$3$ network. Compared to current state-of-the-art diameter-$3$ networks, PolarStar achieves $1.3\times$ geometric mean increase in scale over Bundlefly, $1.9\times$ over Dragonfly, and $6.7\times$ over {3-D} HyperX. PolarStar has many other desirable properties, including a modular layout, large bisection, high resilience to link failures and a large number of feasible configurations for every radix. We give a detailed evaluation with simulations of synthetic and real-world traffic patterns and show that PolarStar exhibits comparable or better performance than current diameter-3 networks.
