Online Training and Inference System on Edge FPGA Using Delayed Feedback Reservoir
Sosei Ikeda, Hiromitsu Awano, Takashi Sato
TL;DR
The paper tackles the challenge of online training for delayed feedback reservoirs (DFRs) on edge hardware by introducing a modular DFR that supports backpropagation-based reservoir optimization, a truncated backpropagation scheme to curb memory use, and an in-place Ridge regression method via 1-D Cholesky decomposition. These innovations collectively enable real-time online training and inference on FPGA, achieving substantial gains in speed (up to ~1/13 of software time) and energy (up to ~1/27) while maintaining accuracy comparable to grid-search baselines. A fast parameter optimization method reduces the previously prohibitive grid-search cost by about 700×, and the 1-D Cholesky-based Ridge regression reduces memory by roughly 75% without sacrificing performance. The work demonstrates the viability of end-to-end online edge processing for time-series tasks using DFRs, with clear implications for predictive maintenance and other real-time edge applications.
Abstract
A delayed feedback reservoir (DFR) is a hardwarefriendly reservoir computing system. Implementing DFRs in embedded hardware requires efficient online training. However, two main challenges prevent this: hyperparameter selection, which is typically done by offline grid search, and training of the output linear layer, which is memory-intensive. This paper introduces a fast and accurate parameter optimization method for the reservoir layer utilizing backpropagation and gradient descent by adopting a modular DFR model. A truncated backpropagation strategy is proposed to reduce memory consumption associated with the expansion of the recursive structure while maintaining accuracy. The computation time is significantly reduced compared to grid search. Additionally, an in-place Ridge regression for the output layer via 1-D Cholesky decomposition is presented, reducing memory usage to be 1/4. These methods enable the realization of an online edge training and inference system of DFR on an FPGA, reducing computation time by about 1/13 and power consumption by about 1/27 compared to software implementation on the same board.
