Energy-Efficient Sampling Using Stochastic Magnetic Tunnel Junctions
Nicolas Alder, Shivam Nitin Kajale, Milin Tunsiricharoengul, Deblina Sarkar, Ralf Herbrich
TL;DR
This work tackles the energy burden of probabilistic ML by introducing a hardware-software framework that uses room-temperature stochastic MTJs (s-MTJs) to generate true randomness and map outputs directly to Float16 bit positions for uniform sampling. It provides a closed-form, bit-level parameterization to realize Uniform(Float16) sampling without symbolic computation, and extends this with a mixture-of-uniforms representation to sample from arbitrary 1D distributions via convolution and prior-likelihood transforms. Empirical results report substantial energy savings—exceeding $9721\times$ vs Mersenne-Twister and $5649\times$ vs PCG—alongside quantifiable approximation errors (KL divergences) that remain small for convolution and prior-likelihood operations. The approach enables scalable uncertainty quantification in probabilistic ML and offers a path toward hardware-accelerated sampling, with future work focusing on prototype development and robustness across hardware variations.
Abstract
(Pseudo)random sampling, a costly yet widely used method in (probabilistic) machine learning and Markov Chain Monte Carlo algorithms, remains unfeasible on a truly large scale due to unmet computational requirements. We introduce an energy-efficient algorithm for uniform Float16 sampling, utilizing a room-temperature stochastic magnetic tunnel junction device to generate truly random floating-point numbers. By avoiding expensive symbolic computation and mapping physical phenomena directly to the statistical properties of the floating-point format and uniform distribution, our approach achieves a higher level of energy efficiency than the state-of-the-art Mersenne-Twister algorithm by a minimum factor of 9721 and an improvement factor of 5649 compared to the more energy-efficient PCG algorithm. Building on this sampling technique and hardware framework, we decompose arbitrary distributions into many non-overlapping approximative uniform distributions along with convolution and prior-likelihood operations, which allows us to sample from any 1D distribution without closed-form solutions. We provide measurements of the potential accumulated approximation errors, demonstrating the effectiveness of our method.
