Fast & Efficient Normalizing Flows and Applications of Image Generative Models

Sandeep Nagar

Fast & Efficient Normalizing Flows and Applications of Image Generative Models

Sandeep Nagar

TL;DR

This work tackles two intertwined goals: making normalizing flows more efficient and leveraging image-generative models for real-world computer vision tasks. It introduces CInC Flow and Inverse-Flow to enable fast, parallelized inversion of convolutions and scalable training, plus Affine-StableSR to combine diffusion priors with lightweight NF-inspired encoding for super-resolution. The applications span automated seed-quality assessment, privacy-preserving anonymization in driving datasets, unsupervised geological mapping with stacked autoencoders, diffusion-based art restoration, and robust traffic-sign detection under missing-sign scenarios. Collectively, the contributions deliver both theoretical advances in flow-based models and practical systems for efficiency, privacy, and real-world CV tasks. The work demonstrates substantial gains in sampling speed, parameter efficiency, and applicability to diverse domains, signaling a meaningful step toward deploying principled generative models at scale.

Abstract

This thesis presents novel contributions in two primary areas: advancing the efficiency of generative models, particularly normalizing flows, and applying generative models to solve real-world computer vision challenges. The first part introduce significant improvements to normalizing flow architectures through six key innovations: 1) Development of invertible 3x3 Convolution layers with mathematically proven necessary and sufficient conditions for invertibility, (2) introduction of a more efficient Quad-coupling layer, 3) Design of a fast and efficient parallel inversion algorithm for kxk convolutional layers, 4) Fast & efficient backpropagation algorithm for inverse of convolution, 5) Using inverse of convolution, in Inverse-Flow, for the forward pass and training it using proposed backpropagation algorithm, and 6) Affine-StableSR, a compact and efficient super-resolution model that leverages pre-trained weights and Normalizing Flow layers to reduce parameter count while maintaining performance. The second part: 1) An automated quality assessment system for agricultural produce using Conditional GANs to address class imbalance, data scarcity and annotation challenges, achieving good accuracy in seed purity testing; 2) An unsupervised geological mapping framework utilizing stacked autoencoders for dimensionality reduction, showing improved feature extraction compared to conventional methods; 3) We proposed a privacy preserving method for autonomous driving datasets using on face detection and image inpainting; 4) Utilizing Stable Diffusion based image inpainting for replacing the detected face and license plate to advancing privacy-preserving techniques and ethical considerations in the field.; and 5) An adapted diffusion model for art restoration that effectively handles multiple types of degradation through unified fine-tuning.

Fast & Efficient Normalizing Flows and Applications of Image Generative Models

TL;DR

Abstract

Fast & Efficient Normalizing Flows and Applications of Image Generative Models

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (46)

Theorems & Definitions (9)