Digit-Recurrence Posit Division
Raul Murillo, Julio Villalba-Moreno, Alberto A. Del Barrio, Guillermo Botella
TL;DR
This work adapts digit-recurrence division to Posit arithmetic, introducing the first radix-4 implementation and a set of hardware optimizations to reduce latency and energy. It combines redundant residual representations, on-the-fly quotient conversion, and operand scaling to streamline the division process across Posit widths, validated by synthesis in a 28 nm flow. Across 16-, 32-, and 64-bit Posits, radix-4 designs consistently reduce iteration counts and latency, achieving up to ~80% energy savings with modest area overhead relative to prior methods. The results demonstrate that radix-4 digit-recurrence division is a viable, high-performance approach for Posit units, offering substantial improvements for future PAU designs in RISCV/RISC architectures, with quantitative gains over existing digit-recurrence and Newton-Raphson-based dividers. The work further provides a detailed account of initialization, selection functions, and termination, enabling reproducible hardware realizations and guiding design choices for high-speed, low-power posit arithmetic cores.
Abstract
Posit arithmetic has emerged as a promising alternative to IEEE 754 floating-point representation, offering enhanced accuracy and dynamic range. However, division operations in posit systems remain challenging due to their inherent hardware complexity. In this work, we present posit division units based on the digit-recurrence algorithm, marking the first implementation of radix-4 digit-recurrence techniques within this context. Our approach incorporates hardware-centric optimizations including redundant arithmetic, on-the-fly quotient conversion, and operand scaling to streamline the division process while mitigating latency, area, and power overheads. Comprehensive synthesis evaluations across multiple posit configurations demonstrate significant performance improvements, including more than 80% energy reduction with small area overhead compared to existing methods, and a substantial decrease in the number of iterations. These results underscore the potential of our adapted algorithm to enhance the efficiency of posit-based arithmetic units.
