Semi-Supervised Learning with Ladder Networks
Antti Rasmus, Harri Valpola, Mikko Honkala, Mathias Berglund, Tapani Raiko
TL;DR
The paper tackles label scarcity by unifying supervised learning with layer-wise unsupervised denoising in Ladder networks. It extends the Ladder architecture to include supervision, enabling end-to-end training with a sum of supervised and denoising costs and leveraging skip connections between encoder and decoder. Empirically, the approach delivers state-of-the-art semi-supervised performance on MNIST and CIFAR-10, and shows strong results even with very few labeled examples, while remaining compatible with both MLPs and CNNs. The work offers a simple, scalable framework for semi-supervised learning that can be integrated into existing feedforward architectures and extended to larger temporal problems.
Abstract
We combine supervised learning with unsupervised learning in deep neural networks. The proposed model is trained to simultaneously minimize the sum of supervised and unsupervised cost functions by backpropagation, avoiding the need for layer-wise pre-training. Our work builds on the Ladder network proposed by Valpola (2015), which we extend by combining the model with supervision. We show that the resulting model reaches state-of-the-art performance in semi-supervised MNIST and CIFAR-10 classification, in addition to permutation-invariant MNIST classification with all labels.
