Semi-Supervised Learning with Ladder Networks

Antti Rasmus; Harri Valpola; Mikko Honkala; Mathias Berglund; Tapani Raiko

Semi-Supervised Learning with Ladder Networks

Antti Rasmus, Harri Valpola, Mikko Honkala, Mathias Berglund, Tapani Raiko

TL;DR

The paper tackles label scarcity by unifying supervised learning with layer-wise unsupervised denoising in Ladder networks. It extends the Ladder architecture to include supervision, enabling end-to-end training with a sum of supervised and denoising costs and leveraging skip connections between encoder and decoder. Empirically, the approach delivers state-of-the-art semi-supervised performance on MNIST and CIFAR-10, and shows strong results even with very few labeled examples, while remaining compatible with both MLPs and CNNs. The work offers a simple, scalable framework for semi-supervised learning that can be integrated into existing feedforward architectures and extended to larger temporal problems.

Abstract

We combine supervised learning with unsupervised learning in deep neural networks. The proposed model is trained to simultaneously minimize the sum of supervised and unsupervised cost functions by backpropagation, avoiding the need for layer-wise pre-training. Our work builds on the Ladder network proposed by Valpola (2015), which we extend by combining the model with supervision. We show that the resulting model reaches state-of-the-art performance in semi-supervised MNIST and CIFAR-10 classification, in addition to permutation-invariant MNIST classification with all labels.

Semi-Supervised Learning with Ladder Networks

TL;DR

Abstract

Semi-Supervised Learning with Ladder Networks

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (2)