KernelFusion: Assumption-Free Blind Super-Resolution via Patch Diffusion

Oliver Heinimann; Assaf Shocher; Tal Zimbalist; Michal Irani

KernelFusion: Assumption-Free Blind Super-Resolution via Patch Diffusion

Oliver Heinimann, Assaf Shocher, Tal Zimbalist, Michal Irani

TL;DR

KernelFusion tackles blind super-resolution under unknown, complex downscaling kernels by a zero-shot diffusion-based approach that learns an image-specific patch distribution from a single LR image and jointly reconstructs the HR image while estimating the SR-kernel. The method operates in two phases: Phase 1 trains a patch-diffusion model on LR data, and Phase 2 performs reverse diffusion at high resolution with a consistency loss, using an implicit neural representation to model the SR-kernel. This yields state-of-the-art results on challenging degradations and demonstrates robust kernel recovery for non-Gaussian kernels, marking a shift toward an assumption-free Blind-SR paradigm. The approach enables accurate SR without external priors or pre-trained models, albeit with per-image training time and some limitations with severe LR artifacts, paving the way for hybrid methods that integrate external information.

Abstract

Traditional super-resolution (SR) methods assume an ``ideal'' downscaling SR-kernel (e.g., bicubic downscaling) between the high-resolution (HR) image and the low-resolution (LR) image. Such methods fail once the LR images are generated differently. Current blind-SR methods aim to remove this assumption, but are still fundamentally restricted to rather simplistic downscaling SR-kernels (e.g., anisotropic Gaussian kernels), and fail on more complex (out of distribution) downscaling degradations. However, using the correct SR-kernel is often more important than using a sophisticated SR algorithm. In ``KernelFusion'' we introduce a zero-shot diffusion-based method that makes no assumptions about the kernel. Our method recovers the unique image-specific SR-kernel directly from the LR input image, while simultaneously recovering its corresponding HR image. KernelFusion exploits the principle that the correct SR-kernel is the one that maximizes patch similarity across different scales of the LR image. We first train an image-specific patch-based diffusion model on the single LR input image, capturing its unique internal patch statistics. We then reconstruct a larger HR image with the same learned patch distribution, while simultaneously recovering the correct downscaling SR-kernel that maintains this cross-scale relation between the HR and LR images. Empirical results show that KernelFusion vastly outperforms all SR baselines on complex downscaling degradations, where existing SotA Blind-SR methods fail miserably. By breaking free from predefined kernel assumptions, KernelFusion pushes Blind-SR into a new assumption-free paradigm, handling downscaling kernels previously thought impossible.

KernelFusion: Assumption-Free Blind Super-Resolution via Patch Diffusion

TL;DR

Abstract

KernelFusion: Assumption-Free Blind Super-Resolution via Patch Diffusion

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)