The Landscape of Compute-near-memory and Compute-in-memory: A Research and Commercial Overview
Asif Ali Khan, João Paulo C. De Lima, Hamid Farzaneh, Jeronimo Castrillon
TL;DR
This survey addresses the data movement bottleneck by examining compute-near-memory (CNM) and compute-in-memory (CIM) paradigms, their underlying memory technologies, and a spectrum of architectures from academic prototypes to commercial solutions. It systematically categorizes CIM/CNM designs, explains device primitives (crossbars, CAMs, logic units), and reviews memory technologies (DRAM, SRAM, PCM, ReRAM, MRAM, FeFET) and their suitability for in-memory or near-memory computation. The paper also maps the evolving commercial landscape, listing startups and industrial efforts (UPMEM, Samsung, IBM, AxDIMM, PIM-HBM, etc.), and highlights open challenges such as programmability, software ecosystems, and validation tools. Collectively, it underscores the potential for dramatic gains in energy and performance for data-intensive tasks, particularly AI workloads, while acknowledging substantial hurdles before widespread adoption. The conclusions point to continued diversification of memory technologies, cross-layer toolchains, and hybrid architectures as paths to realizing practical, scalable CIM/CNM systems.
Abstract
In today's data-centric world, where data fuels numerous application domains, with machine learning at the forefront, handling the enormous volume of data efficiently in terms of time and energy presents a formidable challenge. Conventional computing systems and accelerators are continually being pushed to their limits to stay competitive. In this context, computing near-memory (CNM) and computing-in-memory (CIM) have emerged as potentially game-changing paradigms. This survey introduces the basics of CNM and CIM architectures, including their underlying technologies and working principles. We focus particularly on CIM and CNM architectures that have either been prototyped or commercialized. While surveying the evolving CIM and CNM landscape in academia and industry, we discuss the potential benefits in terms of performance, energy, and cost, along with the challenges associated with these cutting-edge computing paradigms.
