Sequential Printed MLP Circuits for Super TinyML Multi-Sensory Applications
Gurol Saglam, Florentia Afentaki, Georgios Zervakis, Mehdi B. Tahoori
TL;DR
This work tackles deploying large, multi-parameter neural networks on Printed Electronics by introducing sequential Super-TinyML circuits that combine single-cycle and multi-cycle neurons. It achieves substantial hardware reductions through pow2 quantization, multiplexed weight hardwiring, and architectural choices that replace shifting registers with multiplexers, enabling up to $753$ inputs and $8505$ coefficients (i.e., $35.9\times$ more features and $65.4\times$ more coefficients than prior PE solutions). The framework integrates redundant feature pruning and neuron approximation (via leading-$1$ bits and NSGA-II optimization) to further shrink area and power while maintaining accuracy, with thorough empirical validation across multiple UCI datasets. Overall, the results demonstrate that bespoke sequential designs can significantly push the feasibility of large-scale MLPs in PE, offering practical benefits for ultra-low-power, multi-sensory wearables and implants.
Abstract
Super-TinyML aims to optimize machine learning models for deployment on ultra-low-power application domains such as wearable technologies and implants. Such domains also require conformality, flexibility, and non-toxicity which traditional silicon-based systems cannot fulfill. Printed Electronics (PE) offers not only these characteristics, but also cost-effective and on-demand fabrication. However, Neural Networks (NN) with hundreds of features -- often necessary for target applications -- have not been feasible in PE because of its restrictions such as limited device count due to its large feature sizes. In contrast to the state of the art using fully parallel architectures and limited to smaller classifiers, in this work we implement a super-TinyML architecture for bespoke (application-specific) NNs that surpasses the previous limits of state of the art and enables NNs with large number of parameters. With the introduction of super-TinyML into PE technology, we address the area and power limitations through resource sharing with multi-cycle operation and neuron approximation. This enables, for the first time, the implementation of NNs with up to $35.9\times$ more features and $65.4\times$ more coefficients than the state of the art solutions.
