MDM: Manhattan Distance Mapping of DNN Weights for Parasitic-Resistance-Resilient Memristive Crossbars
Matheus Farias, Wanghley Martins, H. T. Kung
TL;DR
The paper tackles parasitic-resistance nonidealities in memristive CIM DNN accelerators by introducing Manhattan Distance Mapping (MDM), a post-training weight remapping that reverses dataflow and reorders rows to place dense, low-order-bit activity near the I/O rails, reducing the nonideality factor $NF$ without retraining. It grounds the approach in the Manhattan Hypothesis and bit-level structured sparsity, and models PR as spatially dependent noise to assess distortion in PyTorch. The authors validate their method with circuit-level SPICE simulations and ImageNet-1K benchmarks, reporting NF reductions up to $46\%$ and average ResNet accuracy gains of $3.6\%$, demonstrating how MDM enables larger crossbars with manageable distortion. By bridging device-level nonidealities and algorithmic remapping, MDM offers a lightweight, scalable path to more efficient CIM-based DNN accelerators.
Abstract
Manhattan Distance Mapping (MDM) is a post-training deep neural network (DNN) weight mapping technique for memristive bit-sliced compute-in-memory (CIM) crossbars that reduces parasitic resistance (PR) nonidealities. PR limits crossbar efficiency by mapping DNN matrices into small crossbar tiles, reducing CIM-based speedup. Each crossbar executes one tile, requiring digital synchronization before the next layer. At this granularity, designers either deploy many small crossbars in parallel or reuse a few sequentially-both increasing analog-to-digital conversions, latency, I/O pressure, and chip area. MDM alleviates PR effects by optimizing active-memristor placement. Exploiting bit-level structured sparsity, it feeds activations from the denser low-order side and reorders rows according to the Manhattan distance, relocating active cells toward regions less affected by PR and thus lowering the nonideality factor (NF). Applied to DNN models on ImageNet-1k, MDM reduces NF by up to 46% and improves accuracy under analog distortion by an average of 3.6% in ResNets. Overall, it provides a lightweight, spatially informed method for scaling CIM DNN accelerators.
