relOBI: A Reliable Low-latency Interconnect for Tightly-Coupled On-chip Communication
Michael Rogenmoser, Angelo Garofalo, Luca Benini
TL;DR
This work targets soft errors in tightly coupled on-chip interconnects by introducing relOBI, an extension of the Open Bus Interface (OBI) that provides complete soft-error tolerance at low latency. By triplicating handshake signals (with voting) and ECC-protecting data and control signals, relOBI enables continuous operation and in-flight error correction within a crossbar interconnect. Fault-injection and synthesis show relOBI eliminates single-fault vulnerability (from 34.85% to zero) with a 2.6× area increase—significantly lower than fine-grained triplication—while incurring manageable timing overhead. The approach, demonstrated on a 6×8 crossbar and made open-source, offers a practical path to higher system reliability in radiation-prone environments, such as space-based SoCs, without prohibitive area or timing penalties.
Abstract
On-chip communication is a critical element of modern systems-on-chip (SoCs), allowing processor cores to interact with memory and peripherals. Interconnects require special care in radiation-heavy environments, as any soft error within the SoC interconnect is likely to cause a functional failure of the whole SoC. This work proposes relOBI, an extension to the Open Bus Interface (OBI) combining triple modular redundancy (TMR) for critical handshake signals with error correction codes (ECC) protection on other signals. Implementing and testing the reliable crossbar shows improved reliability to injected single faults from a vulnerability of 34.85 % to zero compared to the irredundant baseline, with an area increase of 2.6 $\times$. The area overhead is 1.8 $\times$ lower than that reported in the literature for fine-grained triplication and voting.
