To Reconfigure or Not to Reconfigure: Optimizing All-to-All Collectives in Circuit-Switched Photonic Interconnects
Anchengcheng Zhou, Vamsi Addanki, Maria Apostolaki
TL;DR
This work tackles the challenge of optimizing all-to-all communication on circuit-switched photonic interconnects by jointly selecting topology sequences and flow schedules while accounting for realistic reconfiguration costs. It introduces a matrix-based abstraction that expresses strategies as decompositions of the traffic matrix into sums of adjacency matrices and their powers, enabling a closed-form TotalCost and a lower bound on the optimum. By identifying a region of highly symmetric, high-expansion topologies and proposing a simple, contention-free scheduling approach, the method constructs near-optimal strategies without exhaustive search and demonstrates substantial gains. Empirical evaluation shows up to 44% reduction in completion time across diverse network sizes, topologies, and workloads, with a clear characterization of when reconfiguration is most beneficial and a quantified optimality gap.
Abstract
All-to-all collective communication is a core primitive in distributed machine learning and high-performance computing. At the server scale, the communication demands of these workloads are increasingly outstripping the bandwidth and energy limits of electrical interconnects, driving a growing interest in photonic interconnects. However, leveraging these interconnects for all-to-all communication is nontrivial. The core challenge lies in jointly optimizing a sequence of topologies and flow schedules, reconfiguring only when the transmission savings from traversing shorter paths outweigh the reconfiguration cost. Yet the search space of this joint optimization is enormous. Existing work sidesteps this challenge by making unrealistic assumptions on reconfiguration costs so that it is never or always worthwhile to reconfigure. In this paper, we show that any candidate sequence of topologies and flow schedules can be expressed as a sum of adjacency matrices and their powers. This abstraction captures the entire solution space and yields a lower bound on all-to-all completion time. Building on this formulation, we identify a family of topology sequences with strong symmetry and high expansion that admits bandwidth-efficient schedules, which our algorithm constructs with low computational overhead. Together, these insights allow us to efficiently construct near-optimal solutions, effectively avoiding enumeration of the combinatorial design space. Evaluation shows that our approach reduces all-to-all completion time by up to 44% on average across a wide range of network parameters, message sizes and workload types.
