Table of Contents
Fetching ...

A Highly Scalable LLM Clusters with Optical Interconnect

Xinchi Han, Yongxi Lv, Weihao Jiang, Shuyuan Zhang, Yingming Mao, Shizhen Zhao, ZhuoRan Liu, Zhuotao Liu, Peirui Cao, Ximeng Liu, Xinbing Wang

TL;DR

The paper tackles the challenge of designing an optimal physical topology for optical circuit switch (OCS)-based GPU clusters, aiming to maximize logical topology compatibility, cluster scalability, and topology engineering polynomial solvability. It introduces the Symmetric Integer Matrix Decomposition Theorem to reduce the ToE problem to a polynomial-time minimum-cost flow formulation and proposes Cross Wiring as a concrete topology that achieves full compatibility, full scalability, and polynomial solvability. The authors prove the optimality of Cross Wiring, discuss online ToE feasibility, and validate the approach through a 128-NPU testbed and large-scale trace-based simulations, showing up to 28.3% throughput gains and substantial reductions in maximum link utilization. The work provides a universal, hardware-agnostic pattern for future OCS architectures and demonstrates practical performance benefits for high-scale AI training in OC-based data centers.

Abstract

Recent years have witnessed the adoption of optical circuit switch (OCS) technology. How to design the physical topology, defined by the physical wiring between electrical switching equipments and the OCS, is fundamental to designing efficient OCS-based clusters. We identify three features to evaluate the quality of a physical topology design: logical topology compatibility, cluster scalability, and topology engineering polynomial-solvability. However, none of existing physical topologies has achieved these three features simultaneously. This paper explores designing an optimal physical topology that simultaneously maximizes all. We begin by analyzing the importance of these features in OCS-based cluster and examine the limitations of current designs. Leveraging a proposed \emph{Symmetric Integer Matrix Decomposition Theorem}, we outline a general approach for designing optimal physical topologies and introduce \textbf{Cross Wiring} as a concrete instantiation. The feasibility and advantages of Cross Wiring are verified through a 128-NPU testbed and large-scale real-trace-based simulations.

A Highly Scalable LLM Clusters with Optical Interconnect

TL;DR

The paper tackles the challenge of designing an optimal physical topology for optical circuit switch (OCS)-based GPU clusters, aiming to maximize logical topology compatibility, cluster scalability, and topology engineering polynomial solvability. It introduces the Symmetric Integer Matrix Decomposition Theorem to reduce the ToE problem to a polynomial-time minimum-cost flow formulation and proposes Cross Wiring as a concrete topology that achieves full compatibility, full scalability, and polynomial solvability. The authors prove the optimality of Cross Wiring, discuss online ToE feasibility, and validate the approach through a 128-NPU testbed and large-scale trace-based simulations, showing up to 28.3% throughput gains and substantial reductions in maximum link utilization. The work provides a universal, hardware-agnostic pattern for future OCS architectures and demonstrates practical performance benefits for high-scale AI training in OC-based data centers.

Abstract

Recent years have witnessed the adoption of optical circuit switch (OCS) technology. How to design the physical topology, defined by the physical wiring between electrical switching equipments and the OCS, is fundamental to designing efficient OCS-based clusters. We identify three features to evaluate the quality of a physical topology design: logical topology compatibility, cluster scalability, and topology engineering polynomial-solvability. However, none of existing physical topologies has achieved these three features simultaneously. This paper explores designing an optimal physical topology that simultaneously maximizes all. We begin by analyzing the importance of these features in OCS-based cluster and examine the limitations of current designs. Leveraging a proposed \emph{Symmetric Integer Matrix Decomposition Theorem}, we outline a general approach for designing optimal physical topologies and introduce \textbf{Cross Wiring} as a concrete instantiation. The feasibility and advantages of Cross Wiring are verified through a 128-NPU testbed and large-scale real-trace-based simulations.

Paper Structure

This paper contains 39 sections, 6 theorems, 25 equations, 18 figures, 5 tables, 1 algorithm.

Key Result

Theorem 2.1

For Uniform Wiring physical topology, the ToE problem is NP-Complete.

Figures (18)

  • Figure 1: With Cross Wiring (\ref{['fig:illu_l2_protocal_incompatibility']}), any logical topology can be realized with symmetry constraint. Under symmetry constraint, if the Tx of port A is connected to the Rx of port B, then the Tx of port B must be connected to the Rx of port A.
  • Figure 2: Different components are paired to create mirror-symmetric sub-topologies
  • Figure 3: Widely adopted Uniform Wiring is suboptimal
  • Figure 4: Cross Wiring without Circulators
  • Figure 5: Cross Wiring can ensure an optimal Logical Topology Realization Rate.
  • ...and 13 more figures

Theorems & Definitions (9)

  • Definition 1: Optimal Physical Topology
  • Theorem 2.1: Uniform Wiring ToE NP-Complete
  • Theorem 3.1: Integer Matrix Decomposition
  • Theorem 3.2: Symmetric Integer Matrix Decomposition
  • Theorem 3.3: Polynomial Solvable Physical Topologies
  • Definition 2: Logical Topology
  • Theorem 4.1: Compatibility and Scalability of Cross Wiring
  • Theorem 5.1: MDMCF is Optimal when $N_{\text{ocs}} =2$
  • Definition 3: MinRewiring Achievement Rate (MRAR)