KBNet: Kernel Basis Network for Image Restoration
Yi Zhang, Dasong Li, Xiaoyu Shi, Dailan He, Kangning Song, Xiaogang Wang, Hongwei Qin, Hongsheng Li
TL;DR
KBNet tackles adaptive spatial information aggregation for image restoration by introducing Kernel Basis Attention (KBA), which uses learnable kernel bases and per-pixel fusion to capture diverse local patterns. It couples KBA with a Multi-axis Feature Fusion (MFF) block to jointly encode channel-wise, spatial-invariant, and pixel-adaptive features, all integrated into a U-Net backbone. The approach delivers state-of-the-art results across denoising, deraining, and deblurring benchmarks while reducing computational cost relative to prior SOTA methods. Together, these components provide an efficient framework that blends convolutional inductive biases with adaptive spatial processing for robust low-level vision tasks.
Abstract
How to aggregate spatial information plays an essential role in learning-based image restoration. Most existing CNN-based networks adopt static convolutional kernels to encode spatial information, which cannot aggregate spatial information adaptively. Recent transformer-based architectures achieve adaptive spatial aggregation. But they lack desirable inductive biases of convolutions and require heavy computational costs. In this paper, we propose a kernel basis attention (KBA) module, which introduces learnable kernel bases to model representative image patterns for spatial information aggregation. Different kernel bases are trained to model different local structures. At each spatial location, they are linearly and adaptively fused by predicted pixel-wise coefficients to obtain aggregation weights. Based on the KBA module, we further design a multi-axis feature fusion (MFF) block to encode and fuse channel-wise, spatial-invariant, and pixel-adaptive features for image restoration. Our model, named kernel basis network (KBNet), achieves state-of-the-art performances on more than ten benchmarks over image denoising, deraining, and deblurring tasks while requiring less computational cost than previous SOTA methods.
