Benchmarking Neural Decoding Backbones towards Enhanced On-edge iBCI Applications
Zhou Zhou, Guohang He, Zheng Zhang, Luziwei Leng, Qinghai Guo, Jianxing Liao, Xuan Song, Ran Cheng
TL;DR
This work addresses the challenge of running neural decoding on edge devices for everyday iBCI usage by systematically benchmarking four backbones—GRU, Transformer, RWKV, and Mamba—on a nonhuman primate random-reach task. The study demonstrates that RWKV and Mamba provide faster inference and calibration with favorable scaling, while GRU remains competitive in some scenarios but suffers from slower processing and limited scalability, and Transformer tends to be less tractable on edge hardware. Scaling analyses show RWKV and Mamba improve with increased model size up to several million parameters, suggesting strong potential for future edge deployments as data availability grows. The findings offer practical guidance for selecting edge-friendly neural decoding backbones and highlight avenues for further research in online learning and cross-task/adaptive decoding.
Abstract
Traditional invasive Brain-Computer Interfaces (iBCIs) typically depend on neural decoding processes conducted on workstations within laboratory settings, which prevents their everyday usage. Implementing these decoding processes on edge devices, such as the wearables, introduces considerable challenges related to computational demands, processing speed, and maintaining accuracy. This study seeks to identify an optimal neural decoding backbone that boasts robust performance and swift inference capabilities suitable for edge deployment. We executed a series of neural decoding experiments involving nonhuman primates engaged in random reaching tasks, evaluating four prospective models, Gated Recurrent Unit (GRU), Transformer, Receptance Weighted Key Value (RWKV), and Selective State Space model (Mamba), across several metrics: single-session decoding, multi-session decoding, new session fine-tuning, inference speed, calibration speed, and scalability. The findings indicate that although the GRU model delivers sufficient accuracy, the RWKV and Mamba models are preferable due to their superior inference and calibration speeds. Additionally, RWKV and Mamba comply with the scaling law, demonstrating improved performance with larger data sets and increased model sizes, whereas GRU shows less pronounced scalability, and the Transformer model requires computational resources that scale prohibitively. This paper presents a thorough comparative analysis of the four models in various scenarios. The results are pivotal in pinpointing an optimal backbone that can handle increasing data volumes and is viable for edge implementation. This analysis provides essential insights for ongoing research and practical applications in the field.
