Spintronics for image recognition: performance benchmarking via ultrafast data-driven simulations
Anatole Moureaux, Chloé Chopin, Simon de Wergifosse, Laurent Jacques, Flavio Abreu Araujo
TL;DR
The paper tackles energy and scalability concerns in AI by presenting a time-multiplexed echo-state network (ESN) that uses a single vortex-based spin-torque oscillator (STVO) as the nonlinear reservoir, with STVO dynamics simulated via the data-driven Thiele equation approach (DD-TEA) to achieve ultrafast, hardware-friendly modeling. The method processes images by PCA-based dimensionality reduction, random feature mapping, and sequential STVO-driven nonlinear transformation, followed by linear readout trained with the Moore-Penrose pseudoinverse. It reports state-of-the-art reservoir-computing performance on MNIST (≈98.1% accuracy) and reasonable results on EMNIST-letters and Fashion-MNIST, with STVO nonlinearity effectively matching conventional nonlinearities like ReLU and Sigmoid when the reservoir has enough learnable parameters. The DD-TEA framework enables extensive hyperparameter sweeps and supports the design of deeper architectures for improved accuracy, potentially enabling energy-efficient neuromorphic image recognition and time-series tasks on spintronic hardware.
Abstract
We present a demonstration of image classification using an echo-state network (ESN) relying on a single simulated spintronic nanostructure known as the vortex-based spin-torque oscillator (STVO) delayed in time. We employ an ultrafast data-driven simulation framework called the data-driven Thiele equation approach (DD-TEA) to simulate the STVO dynamics. This allows us to avoid the challenges associated with repeated experimental manipulation of such a nanostructured system. We showcase the versatility of our solution by successfully applying it to solve classification challenges with the MNIST, EMNIST-letters and Fashion MNIST datasets. Through our simulations, we determine that within an ESN with numerous learnable parameters the results obtained using the STVO dynamics as an activation function are comparable to the ones obtained with other conventional nonlinear activation functions like the reLU and the sigmoid. While achieving state-of-the-art accuracy levels on the MNIST dataset, our model's performance on EMNIST-letters and Fashion MNIST is lower due to the relative simplicity of the system architecture and the increased complexity of the tasks. We expect that the DD-TEA framework will enable the exploration of deeper architectures, ultimately leading to improved classification accuracy.
