Diffusion-Based Representation Learning

Sarthak Mittal; Korbinian Abstreiter; Stefan Bauer; Bernhard Schölkopf; Arash Mehrjou

Diffusion-Based Representation Learning

Sarthak Mittal, Korbinian Abstreiter, Stefan Bauer, Bernhard Schölkopf, Arash Mehrjou

TL;DR

This work augments the denoising score matching framework to enable representation learning without any supervised signal and proposes to learn an infinite-dimensional latent code that achieves improvements of state-of-the-art models on semi-supervised image classification.

Abstract

Diffusion-based methods represented as stochastic differential equations on a continuous-time domain have recently proven successful as a non-adversarial generative model. Training such models relies on denoising score matching, which can be seen as multi-scale denoising autoencoders. Here, we augment the denoising score matching framework to enable representation learning without any supervised signal. GANs and VAEs learn representations by directly transforming latent codes to data samples. In contrast, the introduced diffusion-based representation learning relies on a new formulation of the denoising score matching objective and thus encodes the information needed for denoising. We illustrate how this difference allows for manual control of the level of details encoded in the representation. Using the same approach, we propose to learn an infinite-dimensional latent code that achieves improvements of state-of-the-art models on semi-supervised image classification. We also compare the quality of learned representations of diffusion score matching with other methods like autoencoder and contrastively trained systems through their performances on downstream tasks.

Diffusion-Based Representation Learning

TL;DR

Abstract

Diffusion-Based Representation Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (3)