Debiasing Kernel-Based Generative Models
Tian Qin, Wei-Min Huang
TL;DR
Debiasing Kernel-Based Generative Models (DKGM) tackles the challenge of high-quality image generation without explicit density estimation by coupling kernel density estimation (KDE) with a novel debiasing procedure. The method first generates KDE-based initial samples and then applies an iterative debiasing process, inspired by stochastic approximation and expressed as a stochastic differential equation, to recover sharp, high-fidelity images. The two-stage design leverages a KDE-driven stage and a time-step embedded U‑Net for refinement, yielding competitive results on CIFAR‑10 and strong performance on CelebA and LSUN when compared to diffusion and GAN baselines. DKGM also explores the roles of KDE bandwidth and the debiasing noise level, discusses connections to score-based models, and highlights potential for data augmentation, albeit with computational trade-offs in stage-2 training.
Abstract
We propose a novel two-stage framework of generative models named Debiasing Kernel-Based Generative Models (DKGM) with the insights from kernel density estimation (KDE) and stochastic approximation. In the first stage of DKGM, we employ KDE to bypass the obstacles in estimating the density of data without losing too much image quality. One characteristic of KDE is oversmoothing, which makes the generated image blurry. Therefore, in the second stage, we formulate the process of reducing the blurriness of images as a statistical debiasing problem and develop a novel iterative algorithm to improve image quality, which is inspired by the stochastic approximation. Extensive experiments illustrate that the image quality of DKGM on CIFAR10 is comparable to state-of-the-art models such as diffusion models and GAN models. The performance of DKGM on CelebA 128x128 and LSUN (Church) 128x128 is also competitive. We conduct extra experiments to exploit how the bandwidth in KDE affects the sample diversity and debiasing effect of DKGM. The connections between DKGM and score-based models are also discussed.
