Parallel Backpropagation for Inverse of a Convolution with Application to Normalizing Flows

Sandeep Nagar; Girish Varma

Parallel Backpropagation for Inverse of a Convolution with Application to Normalizing Flows

Sandeep Nagar, Girish Varma

TL;DR

This work tackles the computational bottleneck of backpropagating through the inverse of a convolution, a key operation in Normalizing Flows and related imaging tasks. It derives a fast parallel backpropagation algorithm with running time $O(mk^2)$ on square images and provides a CUDA GPU implementation. By pairing this with Inverse-Flow, a multi-scale flow that uses the inverse of convolution in the forward pass and conventional convolution for sampling, the authors achieve significantly faster sampling while maintaining competitive bits-per-dimension. Experiments on MNIST and CIFAR-10 demonstrate substantial speedups in sampling without sacrificing density estimation quality, highlighting the practical impact for scalable, fast generative modeling.

Abstract

The inverse of an invertible convolution is an important operation that comes up in Normalizing Flows, Image Deblurring, etc. The naive algorithm for backpropagation of this operation using Gaussian elimination has running time $O(n^3)$ where $n$ is the number of pixels in the image. We give a fast parallel backpropagation algorithm with running time $O(\sqrt{n})$ for a square image and provide a GPU implementation of the same. Inverse of Convolutions are usually used in Normalizing Flows in the sampling pass, making them slow. We propose to use the Inverse of Convolutions in the forward (image to latent vector) pass of the Normalizing flow. Since the sampling pass is the inverse of the forward pass, it will use convolutions only, resulting in efficient sampling times. We use our parallel backpropagation algorithm to optimize the inverse of the convolution layer, resulting in fast training times. We implement this approach in various Normalizing Flow backbones, resulting in our Inverse-Flow models. We benchmark Inverse-Flow on standard datasets and show significantly improved sampling times with similar bits per dimension compared to previous models.

Parallel Backpropagation for Inverse of a Convolution with Application to Normalizing Flows

TL;DR

on square images and provides a CUDA GPU implementation. By pairing this with Inverse-Flow, a multi-scale flow that uses the inverse of convolution in the forward pass and conventional convolution for sampling, the authors achieve significantly faster sampling while maintaining competitive bits-per-dimension. Experiments on MNIST and CIFAR-10 demonstrate substantial speedups in sampling without sacrificing density estimation quality, highlighting the practical impact for scalable, fast generative modeling.

Abstract

where

is the number of pixels in the image. We give a fast parallel backpropagation algorithm with running time

for a square image and provide a GPU implementation of the same. Inverse of Convolutions are usually used in Normalizing Flows in the sampling pass, making them slow. We propose to use the Inverse of Convolutions in the forward (image to latent vector) pass of the Normalizing flow. Since the sampling pass is the inverse of the forward pass, it will use convolutions only, resulting in efficient sampling times. We use our parallel backpropagation algorithm to optimize the inverse of the convolution layer, resulting in fast training times. We implement this approach in various Normalizing Flow backbones, resulting in our Inverse-Flow models. We benchmark Inverse-Flow on standard datasets and show significantly improved sampling times with similar bits per dimension compared to previous models.

Parallel Backpropagation for Inverse of a Convolution with Application to Normalizing Flows

TL;DR

Abstract

Parallel Backpropagation for Inverse of a Convolution with Application to Normalizing Flows

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (4)