The BrainScaleS-2 multi-chip system: Interconnecting continuous-time neuromorphic compute substrates
Joscha Ilmberger, Johannes Schemmel
TL;DR
The paper tackles scaling continuous-time neuromorphic substrates beyond single chips by introducing an FPGA-based interconnect architecture for the BrainScaleS-2 system. It presents a routing-enabled multi-chip design with an Aggregator unit and per-chip Node-FPGAs that forms a star topology within a standard rack, achieving deterministic spike latencies and scalable connectivity. Key results include sub-1.3 µs backplane latencies, BER around 1e-15 at 5 Gbps links, and the ability to interconnect up to roughly 120 chips (≈61k neurons, ≈15M synapses) with modest additional latency. The work lays a scalable path toward large-scale analog SNNs and informs future systems with direct ASIC interconnects for higher density and efficiency, enabling large-scale training and experimentation in neuromorphic computing.
Abstract
The BrainScaleS-2 SoC integrates analog neuron and synapse circuits with digital periphery, including two CPUs with SIMD extensions. Each ASIC is connected to a Node-FPGA, providing experiment control and Ethernet connectivity. This work details the scaling of the compute substrate through FPGA-based interconnection via an additional Aggregator unit. The Aggregator provides up to 12 transceiver links to a backplane of Node-FPGAs, as well as 4 transceiver lanes for further extension. Two such interconnected backplanes are integrated into a standard 19in rack case with 4U height together with an Ethernet switch, system controller and power supplies. For all spike rates, chip-to-chip latencies -- consisting of four hops across three FPGAs -- below 1.3$μ$s are achieved within each backplane.
