Towards Generalized On-Chip Communication for Programmable Accelerators in Heterogeneous Architectures
Joseph Zuckerman, John-David Wellman, Ajay Vanamali, Manish Shankar, Gabriele Tombesi, Karthik Swaminathan, Kevin Lee, Mohit Kapur, Robert Philhower, Pradip Bose, Luca P. Carloni
TL;DR
The paper tackles the challenge of flexible, efficient on-chip communication for programmable accelerators in heterogeneous SoCs. It introduces hardware-based ESP enhancements including flexible per-burst P2P, a multicast NoC, coherence-based synchronization, an expanded accelerator interface, and an ISA extension for DMA control. The contributions are demonstrated through FPGA-based validation, showing modest area overheads (especially for multicast) and substantial speedups (up to 203% with 16 destinations on large data workloads). The work enables tighter data forwarding and synchronization across accelerators with minimal changes to accelerators and plans for integration into the ESP mainline release, benefiting future heterogeneous architectures with programmable accelerators.
Abstract
We present several enhancements to the open-source ESP platform to support flexible and efficient on-chip communication for programmable accelerators in heterogeneous SoCs. These enhancements include 1) a flexible point-to-point communication mechanism between accelerators, 2) a multicast NoC that supports data forwarding to multiple accelerators simultaneously, 3) accelerator synchronization leveraging the SoC's coherence protocol, 4) an accelerator interface that offers fine-grained control over the communication mode used, and 5) an example ISA extension to support our enhancements. Our solution adds negligible area to the SoC architecture and requires minimal changes to the accelerators themselves. We have validated most of these features in complex FPGA prototypes and plan to include them in the open-source release of ESP in the coming months.
