Table of Contents
Fetching ...

k-Sparse Autoencoders

Alireza Makhzani, Brendan Frey

TL;DR

The paper tackles the challenge of obtaining sparse representations without complex penalties or nonlinearities. It introduces the k-sparse autoencoder, a linear autoencoder that preserves only the top-k hidden activations, enabling fast training and effective sparse coding. Through analysis linking it to sparse coding with incoherent dictionaries and ITI-like recovery, the work provides theoretical and empirical justification for the approach. Empirically, k-sparse autoencoders yield strong unsupervised and supervised performance on MNIST and NORB, and serve as an effective pretraining mechanism for both shallow and deep networks.

Abstract

Recently, it has been observed that when representations are learnt in a way that encourages sparsity, improved performance is obtained on classification tasks. These methods involve combinations of activation functions, sampling steps and different kinds of penalties. To investigate the effectiveness of sparsity by itself, we propose the k-sparse autoencoder, which is an autoencoder with linear activation function, where in hidden layers only the k highest activities are kept. When applied to the MNIST and NORB datasets, we find that this method achieves better classification results than denoising autoencoders, networks trained with dropout, and RBMs. k-sparse autoencoders are simple to train and the encoding stage is very fast, making them well-suited to large problem sizes, where conventional sparse coding algorithms cannot be applied.

k-Sparse Autoencoders

TL;DR

The paper tackles the challenge of obtaining sparse representations without complex penalties or nonlinearities. It introduces the k-sparse autoencoder, a linear autoencoder that preserves only the top-k hidden activations, enabling fast training and effective sparse coding. Through analysis linking it to sparse coding with incoherent dictionaries and ITI-like recovery, the work provides theoretical and empirical justification for the approach. Empirically, k-sparse autoencoders yield strong unsupervised and supervised performance on MNIST and NORB, and serve as an effective pretraining mechanism for both shallow and deep networks.

Abstract

Recently, it has been observed that when representations are learnt in a way that encourages sparsity, improved performance is obtained on classification tasks. These methods involve combinations of activation functions, sampling steps and different kinds of penalties. To investigate the effectiveness of sparsity by itself, we propose the k-sparse autoencoder, which is an autoencoder with linear activation function, where in hidden layers only the k highest activities are kept. When applied to the MNIST and NORB datasets, we find that this method achieves better classification results than denoising autoencoders, networks trained with dropout, and RBMs. k-sparse autoencoders are simple to train and the encoding stage is very fast, making them well-suited to large problem sizes, where conventional sparse coding algorithms cannot be applied.

Paper Structure

This paper contains 20 sections, 9 equations, 4 figures, 2 tables.

Figures (4)

  • Figure 1: Filters of the k-sparse autoencoder for different sparsity levels k, learnt from MNIST with 1000 hidden units.
  • Figure 2: Filters of the k-sparse autoencoder for different sparsity levels k, learnt from NORB with 4000 hidden units.
  • Figure 3: Filters of k-sparse autoencoder with 1000 hidden units and $\text{k}=50$, learnt from CIFAR-10 random patches.
  • Figure 4: Histogram of hidden unit activities for various unsupervised learning methods.