MonoSparse-CAM: Efficient Tree Model Processing via Monotonicity and Sparsity in CAMs
Tergel Molom-Ochir, Brady Taylor, Hai Li, Yiran Chen
TL;DR
The paper tackles the challenge of energy-efficient hardware acceleration for tree-based models on analog CAM arrays. It introduces MonoSparse-CAM, a software-hardware co-design that exploits both sparsity in TBML and the monotonicity of CAM matchlines to skip nonessential computations, achieving major energy and throughput gains. Across simulations and real datasets, MonoSparse-CAM demonstrates energy reductions up to $28.56\times$ over raw processing and $18.51\times$ over prior techniques, with peak efficiency reaching $418$ GOPS/W at high sparsity. The work substantiates the practicality of TBML hardware acceleration on CAM platforms and paves the way for scalable, energy-efficient AI on tabular-data tasks.
Abstract
While the tree-based machine learning (TBML) models exhibit superior performance compared to neural networks on tabular data and hold promise for energy-efficient acceleration using aCAM arrays, their ideal deployment on hardware with explicit exploitation of TBML structure and aCAM circuitry remains a challenging task. In this work, we present MonoSparse-CAM, a new CAM-based optimization technique that exploits TBML sparsity and monotonicity in CAM circuitry to further advance processing performance. Our results indicate that MonoSparse-CAM reduces energy consumption by upto to 28.56x compared to raw processing and by 18.51x compared to state-of-the-art techniques, while improving the efficiency of computation by at least 1.68x.
