Memory Bounded Deep Convolutional Networks
Maxwell D. Collins, Pushmeet Kohli
TL;DR
High-level problem: deep CNNs have prohibitive memory costs for deployment on constrained devices. The paper proposes sparsity-inducing regularizers that enforce sparse connectivity during training, including ℓ1 shrinkage and a direct ℓ0 projection, implemented within SGD and extended in Caffe with a layer-wise sparsity strategy. Key findings show substantial memory reductions (e.g., near 4x for AlexNet) with minimal accuracy loss on MNIST, CIFAR-10, and ImageNet, plus benefits from ensembles and robustness under reduced training data. The work provides a practical pathway to memory-efficient, high-performing CNNs suitable for resource-limited environments and scalable to large datasets.
Abstract
In this work, we investigate the use of sparsity-inducing regularizers during training of Convolution Neural Networks (CNNs). These regularizers encourage that fewer connections in the convolution and fully connected layers take non-zero values and in effect result in sparse connectivity between hidden units in the deep network. This in turn reduces the memory and runtime cost involved in deploying the learned CNNs. We show that training with such regularization can still be performed using stochastic gradient descent implying that it can be used easily in existing codebases. Experimental evaluation of our approach on MNIST, CIFAR, and ImageNet datasets shows that our regularizers can result in dramatic reductions in memory requirements. For instance, when applied on AlexNet, our method can reduce the memory consumption by a factor of four with minimal loss in accuracy.
