Mass-Spring Models for Passive Keyword Spotting: A Springtronics Approach
Finn Bohte, Theophile Louvet, Vincent Maillou, Marc Serra Garcia
TL;DR
This work demonstrates a passive mass-spring computational framework, springtronics, capable of performing keyword spotting with competitive accuracy on a real 12-class speech benchmark using hundreds of degrees of freedom. It combines analogue feature extraction (mechanical Mel filters and cubic-root compression) with a continuous-time convolution realized via delay lines and a matrix-vector multiplication implemented by zero-modes, followed by a quadratic activation and leaky-integrator readout. Training leverages the reformulation of the convolution as a linear SVM, enabling efficient weight determination before mapping to the mass-spring hardware. The results show competitive accuracy relative to sub-milliwatt electronics and highlight energy-accuracy trade-offs, suggesting a viable path toward low-power mechanical computing on MEMS platforms and broader applications of the springtronics framework.
Abstract
Mechanical systems played a foundational role in computing history, and have regained interest due to their unique properties, such as low damping and the ability to process mechanical signals without transduction. However, recent efforts have primarily focused on elementary computations, implemented in systems based on pre-defined reservoirs, or in periodic systems such as arrays of buckling beams. Here, we numerically demonstrate a passive mechanical system -- in the form of a nonlinear mass-spring model -- that tackles a real-world benchmark for keyword spotting in speech signals. The model is organized in a hierarchical architecture combining feature extraction and continuous-time convolution, with each individual stage tailored to the physics of the considered mass-spring systems. For each step in the computation, a subsystem is designed by combining a small set of low-order polynomial potentials. These potentials act as fundamental components that interconnect a network of masses. In analogy to electronic circuit design, where complex functional circuits are constructed by combining basic components into hierarchical designs, we refer to this framework as springtronics. We introduce springtronic systems with hundreds of degrees of freedom, achieving speech classification accuracy comparable to existing sub-mW electronic systems.
