Probabilistic error analysis of limited-precision stochastic rounding
El-Mehdi El Arar, Massimiliano Fasi, Silviu-Ioan Filip, Mantas Mikaitis
TL;DR
The paper develops a hardware-faithful model of limited-precision stochastic rounding, $\text{SR}_{p,r}$, showing that its bias depends on the random-bit budget $r$ and vanishes as $r\to\infty$, recovering the classical $\text{SR}_p$ behavior. It then constructs a probabilistic error-analysis framework that yields bounds of the form $\mathcal{O}(\sqrt{n}u_p) + \mathcal{O}(n u_{p+r})$ for fundamental linear-algebraic routines, such as recursive summation and inner products, with both mean-bias and probabilistic high-probability guarantees derived via martingales and Bienaymé–Chebyshev methods. The results are validated by numerical experiments on summation, the Rosenbrock function, and neural-network parameter updates, demonstrating that an appropriately chosen $r$ (roughly $\lceil(\log_2 n)/2\rceil$) yields substantial accuracy improvements with manageable hardware cost. Overall, the work provides a principled bridge between ideal SR theory and practical, bit-budget-constrained implementations, guiding practitioners in selecting $r$ for reliable stochastic rounding in large-scale computations.
Abstract
Classical probabilistic rounding error analysis is particularly well suited to stochastic rounding (SR), and it yields strong results when dealing with floating-point algorithms that rely heavily on summation. For many numerical linear algebra algorithms, one can prove probabilistic error bounds that grow as O(nu), where n is the problem size and u is the unit roundoff. These probabilistic bounds are asymptotically tighter than the worst-case ones, which grow as O(nu). For certain classes of algorithms, SR has been shown to be unbiased. However, all these results were derived under the assumption that SR is implemented exactly, which typically requires a number of random bits that is too large to be suitable for practical implementations. We investigate the effect of the number of random bits on the probabilistic rounding error analysis of SR. To this end, we introduce a new rounding mode, limited-precision SR. By taking into account the number r of random bits used, this new rounding mode matches hardware implementations accurately, unlike the ideal SR operator generally used in the literature. We show that this new rounding mode is biased and that the bias is a function of r. As r approaches infinity, however, the bias disappears, and limited-precision SR converges to the ideal, unbiased SR operator. We develop a novel model for probabilistic error analysis of algorithms employing SR. Several numerical examples corroborate our theoretical findings.
