A Reconfigurable Time-Domain In-Memory Computing Macro using FeFET-Based CAM with Multilevel Delay Calibration in 28 nm CMOS
Jeries Mattar, Mor M. Dahan, Stefan Dunkel, Halid Mulaosmanovic, Gunda Beernink, Sven Beyer, Eilam Yalon, Nicolás Wainstein
TL;DR
This work tackles data movement and energy efficiency bottlenecks in neural-network accelerators by introducing a reconfigurable time-domain nonvolatile IMC macro based on FeFETs. The architecture integrates a CAM array, a cascaded delay-element chain, and a time-to-digital converter in 28 nm CMOS, enabling XOR- and AND-based MAC as well as in-memory Boolean logic with sub-nanosecond delay steps. A key contribution is a bulk-assisted multilevel-state calibration that achieves fine delay tuning (around 100 ps resolution) and resilience to device variations, along with write-disturb prevention via isolated triple-well bulks. Experimental results demonstrate 222.2 MOPS per cell and 1887 TOPS/W at 0.85 V, using a 3×3 FeFET CAM and a 3-stage DE, indicating a practical path toward scalable, energy-efficient TD-nvIMC accelerators for edge AI.
Abstract
Time-domain nonvolatile in-memory computing (TD-nvIMC) offers a promising pathway to reduce data movement and improve energy efficiency by encoding computation in delay rather than voltage or current. This work presents a fully integrated and reconfigurable TD-nvIMC macro, fabricated in 28 nm CMOS, that combines a ferroelectric FET (FeFET)-based content-addressable memory array, a cascaded delay element chain, and a time-to-digital converter. The architecture supports binary multiply-and-accumulate (MAC) operations using XOR- and AND-based matching, as well as in-memory Boolean logic and arithmetic functions. Sub-nanosecond MAC resolution is achieved through experimentally demonstrated 550 ps delay steps, representing a 2000$\times$ improvement over prior FeFET TD-nvIMC work, enabled by multilevel-state calibration with $\leq$ 100 ps resolution. Write-disturb resilience is ensured via isolated triple-well bulks. The proposed macro achieves a measured throughput of 222.2 MOPS/cell and energy efficiency of 1887 TOPS/W at 0.85 V, establishing a viable path toward scalable, energy-efficient TD-nvIMC accelerators.
